0.21.0 2024-12-14
- Remove support for Python 3.8.
-
artists
: split artists properly, both for tracks and albums. -
comments
: Add a new configuration optiontruncate_comments
which handles very long release descriptions that may crash MPD.
-
album
:- Remove string free download from the album name.
-
mediums
:- Fix incorrect mediums count.
-
title
:- Fix formatting of titles where remix is delimited by a dash or an em-dash.
- For some releases, handle track names where artist and title may be provided in reverse order.
- Address an issue where track names containing
-
inside parentheses get wrongly split into artist and title. - Stop stripping dashes from the beginning of track titles.
-
year
/month
/day
:- Take the timezone into account for the parsed release date.
-
album
:- Clean up/remove artist such as Various.
- Only remove artists and label if they are clearly separated out in the album name.
- Detect artists that may be separated with commas in the album, but with an ampersand in the artist field, and vice versa.
- Remove the original albumartist unless that's the label name.
- Remove w/ Some Remix reliably.
-
artist
:- Ensure that artists are properly separated when a comma is present in the field.
- Try to keep artists in the same format as they were found on the release.
- Extract lead artists from track titles and use them in the albumartist field.
- Ignore instagram handles in the description.
- Do not remove the featuring artist unless they are present in one of the track titles.
- If there is a single albumartist in the release and it's one of the track artists, just use it and ignore the rest of track artists.
- Deduplicate remix artists in the track artist field.
-
catalognum
:- Prevent matches followed by lowercase ep, lp and va.
- Prevent matches based on label name preceded by of.
-
comment
:- Do not include media description if it duplicates the release description.
-
title
:- Reformat titles such as "Title" by Artist to Artist - Title.
- Do not move remixes to the end of the title.
- Remove the need to parenthesize remixes and handle original titles as they are.
0.20.0 2024-11-04
- Drop support for
beets<1.5
.
-
album
:- Keep remix artist in place within 'remix' parentheses, such as Album (Artist Remix).
- When a release has the same name as the album artist, do not clean/adjust it.
- Keep album artist in album when it is immediately followed by a dot.
- Do not remove EP or LP from the beginning of the album name.
- Remove
V.A
from the beginning of the album name, in the same way we removeVA
. - Do not split album with a year range into
albumartist
andalbum
.
-
cleanup:
- Remove
(... preview)
,free dl
,Name Your Price:
,just out!
,- Album
,(Selected by ...)
from album and track names. - Remove unicode HTML whitespace from incoming data.
- Remove
-
albumtype
:- Identify LP / album type from vinyl media descriptions.
- Remove some funky description parsing logic responsible for multiple ep false positives.
- Resolve either ep or lp to add to
albumtypes
, and never both. - Check for single album type (expecting a single track only) before anything else.
-
artist
: handle remix releases with a single title and its remixes. Instead of trying to determine the artist from the titles, detect such release and use the given album artist. -
catalognum
:- Fix false positives:
- Exclude very short matches like OP-1, SK-1 and BBC6.
- Exclude label name, and label name without spaces.
- Exclude matches followed by a comma. This excludes many artists from release descriptions that happen to have names that look like catalogue numbers.
- Exclude matches followed by a single quote. This used to wrongly match vinyl disc
titles like LABEL 12 here:
LABEL 12'' Black Vinyl
. - Prevent album artists becoming catalogue numbers.
- Remove pattern responsible for many false positives that contain a space, like DOOM 3, ONLY 1 and NIGHT 3 etc.
- Do not any more assume that artist [DRAKEN49] is a catalogue number.
- Instead of using a pattern like
[A-Z]+-[0-9]+
(more than one capital letter — dash — more than one number), explicitly specify how many letters and numbers are expected for the most common variations, like TAR30, RM12012, HEY-101 etc.
- Fix false positives:
-
media
: ignore subscription type Bandcamp media format which returns a duplicate digital media. -
title
:- Remove track number from the beginning of the title more reliably.
- Remove label name from anywhere if it inside brackets or from the end of the title if preceded by a dash or a colon.
-
track
:artist
/title
/track_alt
: Handle edge cases wheretrack_alt
is followed by a single dash. Some instances were previously ignored.artist
/title
/track_alt
: fix several artists and titles which had pieces incorrectly identified astrack_alt
.artist
/title
: In releases where every track has the same title, check whether this title may actually be the artist name. If so, move it to the artist field.artist
/title
: use characters[|-–—]
explicitly for splitting artists and titles.artist
/title
: recover some of the original titles which contain-
and got split intoartist
andtitle
.- Return an empty tracklist for releases that have no tracks.
- Auto-Tagger: Wider search analogous to cli search query as a fallback for bad or missing meta data.
- Add artist list fields support for
beets==2.0.0
.
-
album
:- When album name is wrapped in brackets, [ALBUM], keep the brackets in place.
- Uncover some release names in the description preceded by Title :.
-
albumartist
:- Remove notes about remixes, like (incl. ABC remix) and similar.
-
albumtype
:- Improve accuracy of identifying EP and LP release types from the description.
- Include remix albumtype to the release when remixed track count is one less than the track count.
- Check album names that may end with E.P. instead of EP.
-
catalognum
:-
Add support for new formats:
UVB76-023
,SOP 061-1233
,a+w lp029
,SK11X015
. -
Parse label-like catalogue numbers for singletons too.
-
When searching for a catalogue number which is prefixed by the label name
- Take two variations of the label name
- Original one
- Without Records, Recordings, Productions, Music endings
- Form prefixes from each variation
- The original variation
- With punctuation and spaces removed
- Its acronym when it has multiple words
- Lastly, if the original label has multiple words, use the first word as another possible prefix.
For example, for a label named Diffuse Reality Records, the plugin is able to recognize the following catalogue numbers (case insensitively)
- Diffuse Reality Records001
- DiffuseRealityRecords001
- DRR001
- Diffuse Reality001
- DiffuseReality001
- DR001
- Diffuse001
- Take two variations of the label name
-
Parse catalogue number from the description when the header is followed by a hash symbol, like CAT#: ABC-123.
-
Properly catch catalogue number suffix RP.
-
Relax the rule that looks for a catalogue number within brackets in the release title.
-
-
track
:- For tracks named like [Remixer] - Artist - Title move the remixer to the end: Artist - Title [Remixer].
0.19.3 2024-10-17
- CI: Use
poetry
in the build workflow. - CI: Use
pull_request_target
trigger to make sure secrets are passed to runs in forks.
exclude_extra_fields
: A typo that prevented exclude configurations from being applied correctly
0.19.2 2024-08-04
search
:- properly escape query strings for better results with special characters
- change HTTP client implementation to avoid Bandcamp "403 Forbidden" responses
0.19.1 2024-05-10
- (#58) Relax
beets
dependency requirement.
0.19.0 2024-05-07
- (#56) Support parsing URLs that do not end with .com in item comments when importing music that was bought on Bandcamp.
- Add a new flag to the command line application for searching Bandcamp:
[-p PAGE, --page PAGE]
to enable seeing further search results
0.18.0 2024-04-28
- Dropped support for
python 3.7
.
- (#52) genre: do not fail parsing a release without any keywords, for example https://amniote-editions.bandcamp.com/album/ae-mj-0011-the-collective-capsule-vol-1
- (#54) Ensure that our genre matching rules also apply to genres delimited by a dash, not only space.
-
album
:- handle some edge cases when string EP or LP is followed with data relevant to the album
- do not remove artist or label when it is preceded by
x
or followed by characters'
,_
and&
, or words EP, LP and deluxe - handle apostrophes more reliably
- Do not remove VA or V/A from the beginning when followed by a word or a number
-
album
/title
:- Remove
Various -
from album and track names - Handle this [album sent to us by the devil himself]
- Remove
-
catalognum
:- allow catalogue numbers like Dystopian LP01
- parse a range of catalogue numbers when it is present, for example TFT013SR - TFT-016SR
-
comments
: use valueNone
when there are no comments. In contrast to returning an empty string, this way during beets import the previous comment on the track will be kept if the Bandcamp release does not have a description. -
label
: label is now correctly obtained for single releases when it is available. -
title
:- consider with and w/ as markers for collaborating artists
- remove
bonus -
Artist - Title (bonus - something)
->Artist - Title (something)
[album sent to us by the devil himself]: https://examine-archive.bandcamp.com/album/va-examine-archive-international-sampler-xmn01
0.17.2 2023-08-09
- (#50) Fix art fetching functionality which has essentially been disabled until now.
0.17.1 2023-05-20
- (#44) fix an issue with bundle media formats exclusion logic which would wrongly exclude albums that have bundle in their names
0.17.0 2023-05-20
-
album
:-
Handling unnamed (after removal of catalognum and artist names) split EPs that have two artists. In accordance with title guidelines, the EP is named by separating the artists with a slash.
-
Following the title guidelines, the standard series format now applies to Vol/Vol., Volume, Pt too. Previously we only considered Part.
- Compilation - Volume 2
- Compilation Volume 2 -> Compilation, Volume 2
- If series is in the beginning of the album, it is moved to the end
- Vol. 2 - Compilation -> Compilation, Vol 2
- We also ensure the delimiter for abbreviations, space, and removal of leading zeroes
- Vol02 -> Vol. 2
-
Replace (Remixes) -> Remixes
-
-
albumtype
: the EP albumtype is recognized for split EPs.
-
album
:- Remove + Some remix
- Album
+ Someone's Remix
- Album
- Remove + Some remix
-
catalognum
: do not treat RD-9 (Behringer RD-9) as a catalognum -
title
:- Remove Presented by...
- Title
[Presented by Artist] - Title
(Presented by Artist)
- Title
- Remove preceding number prefix when all album tracks have it and there are two numbers
- 01 Title, Other Title
- 1 Title, 2 Other Title
01Title,02Other Title.
- Remove Presented by...
-
All zero width space characters (
\u200b
) are now removed before parsing. -
album
:- Add many cases of missing EP and LP bits when they are found in the comments
- Fix series numbering format: when it is delimited by some character, keep it.
Otherwise, separate it with a comma
- Album - Part 2
- Album Part 2 -> Album, Part 2
- Tackled some edge cases where label name wrongly stayed on the album
Label:AlbumLabel -Album
- Remove Bonus
0.16.3 2023-02-13
- (#41) fix
re.error: nothing to repeat
caused by missing regex escape
-
album
- Remove ft. and alike
- Album
ft. Another Artist
- Album
- Remove non-alphanumeric chars following VA in the beginning
- VA
-Album
- VA
- Remove V/A from the beginning, same as VA
V/AAlbum
- Remove by and vs
- Album
by Albumartist - Album
by Albumartist vs Another Albumartist
- Album
- Remove Unicode quotes (“”) when they wrap the album name (same as quotes before)
“Album”
- Remove split w when it precedes the albumartist
- Album
Split W Albumartist
- Album
- Keep albumartist when it's preceded by of
- 25 years of
Albumartist-> 25 years of Albumartist
- 25 years of
- Parse album part information and place it within parentheses at the end of album name,
as per MB title guidelines
- Album - Part 123 -> Album (Part 123)
- Remove ft. and alike
-
artist
:- Handle some edge cases of digital-only track title cleanup, like Unreleased Bonus Track or Bonus Track 1. These would previously end up in the artist name
- Remove digital-only artifacts from the artist name too
-
track_alt
: parse track alts like B.1
0.16.2 2022-12-28
- (#40) Improve overall search reliability (#37) and handle alternative domain names, thanks @shagr4th.
- internal/build: make sure tests and linting run on pull requests.
0.16.1 2022-12-17
- (#36) Fix bug with some URLs
- (#37) Fix KeyError: 'url' / searching parsing recaptcha URLs
title
:- When album lists titles in the quoted form (Artist "Title"), split artist from the title correctly.
- Address a long-standing issue with track names of the form Title - Some Mix where we would parse Title as the artist and Some Mix as the title. Such name now gets replaced by Title (Some Mix) which is then attributed correctly.
- Handle remix album where titles of the remixes are not delimited in any way.
album
:- Remove Various Artists (optionally followed by some numbers) from the album name
catalognum
:- Handling some rare edge cases of both false positives and false negatives
title
:- Add missing closing parenthesis for mix/edit titles: Title (Some Mix -> Title (Some Mix)
0.16.0 2022-08-19
- (#34) Handle URLs like
https://bandcamp.materiacollective.com
, thanks @Serene-Arc
- Dropped support for
python 3.6
.
- CLI search: index search results and add flag
-o, --open=INDEX
to open the given result in the browser
-
album
:- remove brackets if the entire album name is wrapped in them
- remove (Single)
- do not remove label from the front if it is not followed by a space
- fix some false positives found in the comments when the album name is followed by EP or LP
-
artist
:- keep the original artist separator in releases with a single track
- replace
//
separator with,
in all cases
-
release_date
: in rare cases when it is not found, use the last modified date
0.15.1 2022-06-19
- Python 3.6 support will end with the next non-patch version update.
-
During import, albums can now again be obtained by their IDs. This functionality has been broken since
v0.14.0
. -
album
: when album name contains Album (Label something), the Label is kept in place -
albumartist
: remove catalogue number from the album artist when it's enclosed in brackets -
catalognum
: handle an edge case where several words from the description get assumed for the catalog number when cat and numbers are both found in the same line. -
genre
: exclude label name, unless it maps to a valid MusicBrainz genre -
title
: handle an edge case where one of the track names contains a utf-8 equivalent of a dash
0.15.0 2022-05-16
-
search:
-
you can now search from the command line:
beetcamp [ [-alt] QUERY | RELEASE-URL ]
-
Search is activated with an argument that does not start with https://. It queries Bandcamp with the provided QUERY and returns a JSON list with all search results from the first page, sorted by relevancy.
-
Flags -a, -l and -t can be used to search for album, label/artist or track specifically.
-
Run
beetcamp -h
to see more details. Example: searching for anything called black sands:$ beetcamp 'black sands' | jq '.[:2]' [ { "type": "album", "name": "Black Sands", "artist": "Bonobo", "date": "2010 March 29", "tracks": "12", "url": "https://bonobomusic.bandcamp.com/album/black-sands", "label": "bonobomusic", "similarity": 1 }, { "type": "album", "name": "Black Sands", "artist": "Appalachian Terror Unit", "date": "2011 August 01", "tracks": "4", "url": "https://appalachianterrorunit.bandcamp.com/album/black-sands", "label": "appalachianterrorunit", "similarity": 1 } ]
-
-
search:
- if
label
field is available, the plugin now takes it into account when it ranks search results. albumartist
field is not used to rank compilations anymore since some labels use label name, some use the list of artists, and others a variation of Various Artists - we cannot reliably tell.label
is used instead.
- if
-
album
: track titles are read to see whether they contain the album name. There are cases where titles have the following format: Title [Album Name EP] -
catalognum
:- search track titles
- do not match if preceded by ] character
- allow catalogue numbers like o-ton 113
- allow a pair, if separated by a slash
/
- removed a pattern responsible for a fair bit of false positives
-
albumtype
:- to determine whether a release is a compilation, check comments for string compilation
- check if all track titles are remixes; if so - include remix albumtype into
albumtypes
-
albumtypes
:- remix: check for string rmx in album name
- compilation: even if a release is an ep, check whether it's also a compilation and include it
-
search: fixed searching of singletons, where the plugin now actually performs search instead of immediately returning the currently selected singleton when option E was selected during the import process
-
album art fetching functionality has been broken for a while - it should now work fine
-
album
: simplified album name clean-up logic and thus fixed a couple of edge cases -
albumartist
: remove , more from the end -
catalognum
: in rare cases, if the track list was given in the comments, one of the track titles would get assumed for the catalognum and subsequently cleaned up. From now on this will only apply if all track names include the match (usually delimited by brackets at the end) -
title
:- track parsing has been refactored, therefore many of previously removed bits from the title are now kept in place, such as bits in parentheses, double quotes (unless they wrap the entire title) or non-alphanumeric characters at the end
- allow titles to start with an opening parentheses 🤯
- when the title is found as (Some Remix) Title, it becomes Title (Some Remix)
-
artist
:- featuring artists given in square brackets are now parsed correctly
- de-duplication now ignores the case
- when only one of the track artists in the release is missing, try splitting the name with - (no spaces) to account for bad formatting
-
track_alt
: track alts with numbers above 6 (like A7) and letters A and B on their own are now extracted successfully
0.14.0 2022-04-18
-
media
:-
previously, we picked the very first Vinyl, CD etc. media available and the rest did not exist from the plugin's point of view. This is now updated: every media which maps to tracks in the same release will get returned, similar to Discogs and MusicBrainz autotaggers.
Therefore,
bandcamp.preferred_media
configuration option does not have any effect anymore and it can be safely removed from your configuration. Instead, use the globalpreferred.media
option and adjust distance weights correspondingly. -
If Vinyl format track list is found in its description, then
track_alt
,medium
,medium_index
andmedium_total
are adjusted accordingly.
-
-
new field
albumtypes
which contains secondary release types, such as remix, live, soundtrack etc. Applies tobeets >= 1.5
-
internal: Tests clean up: pytest fixture-spaghetti has been replaced with JSON files that contain the expected output data.
-
albumtype
: some accuracy improvements- For Vinyl media, all
disctitle
s are checked for EP or LP presence - EP, LP and album words in release and media descriptions are counted and decide the album type
- If compilation or best of or anniversary is present in the album name, then the release is a compilation
- A single album with an original track and several remixes now has album albumtype, not single
- For Vinyl media, all
-
albumartist
: every release with more than 3 artists will now have Various Artists (orva_name
) albumartist
album
:- Remove EP or LP from the beginning more reliably
- Only remove VA if album name starts or ends with it
0.13.2 2022-04-03
- Fixed importing of officially purchased Bandcamp tracks which have Visit {label_url} in their
COMMENT
field (at least for FLAC files) when the album name does not contain a single ASCII alphanumeric character. We here use the album name to guess the release url, and in this case the plugin has previously been failing to take into account this edge case and failed the import process immediately.
0.13.1 2022-04-03
- search: changes introduced in 0.13.0 broke searching functionality for
python 3.6-3.8
due to changes in the built-indifflib
library. This has been fixed and tests for the searching logic are now added. Thanks to @emanuele-virgillito for reporting the issue.
0.13.0 2022-03-22
-
search: considerable improvements in search results accuracy
- Release name and artist is parsed for each found release
- They are compared to what's being queried and sorted by best matches
- Therefore, from now on we will check the first search results page only which should yield up to 18 results.
search_max
parameter is now 2 by default, - in most cases you should get by fine with it being set to 1. This will make the search nearly instant and reduce the loads that Bandcamp need to deal with.
-
Python 3.10 is now supported.
-
album
-
search priority: step by step until the first one which is found:
- Whatever follows Title: in the release description
- Something in single or double quotes in the release title
- If EP or LP is in the release title, whatever precedes it having removed
catalognum
and artists - Whatever is left in the release title having removed
catalognum
and artists - Whatever precedes EP or LP string in the release description
catalognum
- The entire initial release title
-
remove (digital album) and (album) from the album name
Some Album (album) -> Some Album
-
-
label
: strip quotes if sourced from the description -
artist
/albumartist
: remove remixers from artists fieldstitle: Choone (Some Remix) -> title: Choone (Some Remix) artist: Artist, Some -> artist: Artist
-
artist
/title
: featuring artists are moved fromtitle
to theartist
fieldartist: Artist -> Artist ft. Some title: Title ft. Some -> Title
-
singleton
: do not populatealbumstatus
,index
,medium_index
,medium
,medium_total
fields
-
artist
/track_alt
:- artists like B2 and A4 are not anymore assumed to be
track_alt
whentrack_alt
is not present in any other track in that release.
# name: B2 - Some Title title: Some Title -> Some Title track_alt: B2 -> artist: -> B2
- and other way around,
track_alt
like A or B are correctly parsed iftrack_alt
was found for the rest of the tracks
- artists like B2 and A4 are not anymore assumed to be
-
catalognum
: catalogue numbers starting with VA are not anymore ignored, unless VA is followed by numbers. VA02 is still ignored while VAHELLO001 is now parsed correctly. -
Fixed Github workflow which tests the package across various python and
beets
versions: they should now fail reliably. Dependencies are from now on cached, so they run fairly quickly. -
Clarified that
preferred_media
should include Digital Media (not Digital) in the README.
0.12.0 2022-02-10
-
album
: following MusicBrainz title format specification, strings EP and LP are from now on kept in place in album names. -
catalognum
: To find the catalog number, we have previously been looking at the release title and pointers such as Catalogue Number: within the release description.In addition to the above, we now apply a generic search pattern across the rest of the text, including media title, media description and the rest of the release description.
For those interested, at a high level the pattern used in the search looks like below
( [A-Z .]+\d{3} # HANDS D300 | [A-z ][ ]0\d{2,3} # Persephonic Sirens 012 | [A-Z-]{2,}\d+ # RIV4 | [A-Z]+[A-Z.$-]+\d{2,} # USE202, HEY-101, LI$025 | [A-Z.]{2,}[ ]\d{1,3} # OBS.CUR 9 | \w+[A-z]0\d+ # 1ØPILLS018, fa036 | [a-z]+(cd|lp)\d+ # ostgutlp45 | [A-z]+\d+-\d+ # P90-003 ) ( # optionally followed by [ ]?[A-Z] # IBM001V | [.][0-9]+ # ISMVA002.1 | -?[A-Z]+ # PLUS8024CD )?
-
albumtype
: similar to thecatalognum
, the descriptions are searched for EP and LP strings presence to find out thealbumtype
. -
track
: Support for tracks that do not use dash (-
) but some other character to separate pieces of information in track names. For example, consider the following tracklist:A1 | WHITESHADOWHURTS x TOXICSPIKEBACK | Arcadia A2 | WHITESHADOWHURTS | Corrupted Entity A3 | WHITESHADOWHURTS | Colosseo B1 | TOXICSPIKEBACK | Eclipse B2 | TOXICSPIKEBACK | Eclipse [DJ LINT's Tribe Mix] B3 | WHITESHADOWHURTS | Corrupted Entity [OAT.M's Oldschool Mix]
beetcamp
now finds that|
is being used as the delimiter and parses values fortrack_alt
,artist
andtitle
accordingly.
- singleton:
album
andalbumartist
fields are not anymore populated. catalognum
: artists like PROCESS 404 are not assumed to be catalogue numbers anymore.track_alt
: allow non-capital letters, like a1 to be parsed and convert them to capitals.albumartist
: use Various Artists (or equivalent) when a release includes more than four different artists. Until now we've only done so for compilations.genre
: genres are now sorted alphabetically
- Support for
beets<1.5
has been broken since0.11.0
, - it should now work fine. However, fields such ascomments
andlyrics
are not available, and album-like metadata likecatalognum
is not available for singletons. Thanks @zane-schaffer for reporting this issue (Closes #22). singleton
:catalognum
, if found, is now reliably removed from the title.track.title
:-
delimiter is handled more appropriately when it is found in the song title.albumartist
: for the Various Artists releases, the plugin will now use the globally configuredva_name
field instead of hard-coding Various Artists.artist
: Recent bandcamp updates of the JSON data removed artists names from most of compilations, therefore we are again having a peek at the raw HTML data to fetch the data from there.
0.11.0 2021-11-12
-
An entrypoint for
beetcamp
: if the package is in your$PATH
, bandcamp metadata can be obtained directly as a JSONbeetcamp <bandcamp-url> # {"album": "some album", ..., "tracks": [{"title": ...}, ... ]}
This has mostly been useful in scripts: for example, in my case it bridges the metadata gap between mpd and a last.fm scrobbler in those cases when music has not yet made it into the beets library.
-
Two more MusicBrainz fields now get populated:
style
: the tag/genre that bandcamp categorize the release asgenre
: comma-delimited list of release keywords that match any musicbrainz genres.
This comes with some configuration options, see the defaults below:
bandcamp: ... genre: capitalise: no maximum: 0 # no limit always_include: [] mode: progressive # classical, progressive or psychedelic
See the readme for information about the different options.
-
New configuration option
comments_separator
to separate release, media descriptions and credits. Default:\n---\n
. Comments formatting has been changing with every release without a good reason - this should stop. Ultimately it is one's personal choice how they want the formatting to look like.
-
excluded_extra_fields
configuration option has been extended to support every track field and most of album fields. See the readme for more information. -
The hook for additional data has been removed since
lyrics
anddescription
are now retrieved immediately. They can be inspected like every other field, through, for example, theedit (C)andidates
action during the import. -
track_alt
: allow B2 Title where B2 is followed by a space -
catalognum
: include Catalog: as a valid header when parsing the description -
track.title
digital-only cleanup, remove:- DIGITAL and Bonus from the front of the title
- digital-only and (digital) from the end
lyrics
: instead of parsing the HTML, lyrics are now reliably retrieved from the JSON data and added to each track where applicable.- Nowadays it is possible that the
datePublished
field is not given in the release JSON data - this is now handled gracefully.
0.10.1 2021-09-13
-
Fixed #18 by handling cases when a track duration is not given.
-
Fixed #19 where artist names like SUNN O))) would get incorrectly mistreated by the album name cleanup logic due to multiple consecutive parentheses. The fix involved adding some rules around it: they are now deduplicated only if
- they are preceded with a space
- or they enclose remix / edit info and are the last characters in the album name
-
Fixed #20 where dynamically obtained label names used in further REs caused
re.error
since they were not appropriately escaped (for example,label: /m\ records
).
Thanks @arogl for reporting each of the above!
album
: Keep label in the album name if it's immediately followed by an apostrophe. An example scenario:label: Mike
album: Mike's Creations
-
General
- Fixed the logic which fetches the additional data fields (
comments
andlyrics
). It used to cause unwanted behavior since it wrote the files whenwrite: yes
was enabled in the beets config. Now, it's activated through theimport_task_apply
hook and adjusts the metadata (beets db) without ever touching the files directly. - Unexpected errors are now printed instead of causing
beets
to quit immediately.
- Fixed the logic which fetches the additional data fields (
-
track.track_alt
: handleA1 - Title
andA1 - Artist - Title
when alt index is not followed by a full stop. -
track.title
:- Handle cases like Artist -Title / Artist- Title when there is no space between the dash and the title or artist
- Fixed digital only cleaner which would previously remove the string Only when it's found on its own
- Accept ¯\(ツ)/¯ as valid title
- Clean up ( Remix ) -> (Remix)
-
country
: Washington, D.C. and South Korea have not been parsed correctly and thus releases have been defaulting to XW. This is now fixed.
-
catalognum
:- Treat VA[0-9]+, vinyl [0-9]+, triple [0-9]+, ep 12 as invalid (case insensitive)
- Handle single digits (like ROAD4) as valid (until now we required at least two)
- Handle catalognums in parentheses, like (ISM001)
- Handle a period or a dash in the non-digit part, like OBS.CUR 12, O-TON 113
- Allow a single capital letter after the digits, like IBM001V
- Allow the catalognum to start with a non-capital letter, like fa010
-
album
andtrack.title
: little clean up: replace multiple consecutive spaces with a single one and remove all double quotes -
album
:- Only remove label from the album name if
albumtype
is not a compilation - Remove (FREE), (FREE DL), VA, (Incl. some artists remixes) and alike
- Improved the way Various Artists are cleaned up when catalognum is available
- Only remove label from the album name if
-
albumartist
:- If various is specified as the albumartist, make it Various Artists
- When the label have set their name as the albumartist in every field, and if the actual albumartist could be inferred from the album name, use the inferred name.
- If all release tracks have the same artist, assume they are the albumartist
-
albumtype
: treat 4-track release as a valid candidate for a compilation / VA albumtype
- Bandcamp json updates:
release_date
:datePublished
field now tells the correct release date so now we use it instead of parsing the plain html.label
: some releases embed therecordLabel
field into the json data - it now gets prioritized over the publisher name when it is available.
track.title
: clean up*digital only*
properly. Previously we did not account for asterisks
- A regression from
0.9.2
which caused double initialization of the plugin. If your initial tracks metadata has the album name, the results should again be returned instantly. - Searching by release ID where the ID is not a bandcamp URL should now be ignored by the plugin. Thanks @arogl.
- Thanks @arogl for fixing a FutureWarning apparent thrown in Python 3.7.
- Thanks @brianredbeard for reporting that the plugin writes file metadata even when this is disabled globally. This is now fixed.
- singleton album/artist: cases when the release name contains only the track name are now parsed correctly.
- Removed deprecated
lyrics
configuration option.
- Added a github action to run CI for
master
anddev
branches. For now it's just a minimal configuration and will probably get updated soon.
album.albumstatus
: If the release date is today, use Official and not Promotional.album.albumtype
:- Until now we have only set single track releases to have the single type. This has been fixed regarding the MusicBrainz description: release composed of the same title and multiple remixes is a single.
- Use
ep
only if EP is mentioned either in the album name or the disc title.
album.catalognum
: Make the DISCTITLE uppercase before looking for the catalog number.album.media
: Exclude anything that contains bundle in their names. These usually contain additional releases that we do not need.track.title
: Clean- DIGITAL ONLY
(and similar) when it's preceded by a dash and not enclosed by parentheses or square brackets.track.track_alt
: Having witnessed a very creative track title E7-E5, limit thetrack_alt
field number to the range 0-6.- Committed a JSON test case which was supposed to be part of
0.9.0
.
- Extend
url2json
with--tracklist-for-tests
to ease adding new test cases.
- If track artist is given in the
byArtist
field of the track JSON resource, it is used. (Fixes #13, thanks @xeroxcat). - Parse cases like
Catalogue:CAT-000
from the description correctly when the space is missing.
- The
comments
field now includes the media description and credits. - The description is searched for artist and album names in addition to the catalog number.
-
All test cases are now pretty JSON files - this should bring more transparency around the adjustments that Bandcamp make in the future (once they get updated). The
url2json
tool has-u
flag that updates them automatically. -
Parsing
(FREE)
,(free download)
-like strings are now removed from the track names.[Vinyl]
is excluded from album names.
-
Responded to bandcamp html updates:
artist_id
now lies underpublisher
resource (previouslybyArtist
) in the/track/<name>
output when the track is part of an album.url
field has disappeared from track objects - using@id
instead.country
andlabel
fields are now found in the JSON data and thus we make use of it- Updated and truncated test html files since we now only need to see the beginning of the document.
-
Parsing / logic:
- Token
feat.
is now recognized as a valid member of theartist
field. free download
,[EP|LP]
,(EP|LP)
,E.P.
,LP
are now cleaned from the album name.- Updated
albumtype
logic: in somecompilation
cases track artists would go missing and get set to Various Artists - instead it now defaults to the originalalbumartist
. - Handling a couple of edge cases in the track name / title, and catalognum parsers.
- Token
-
Package:
- Moved
beets
from main to dev dependencies. - Updated supported python versions range (3.6.x-3.9.x)
- Added pylint.
- Removed dependency on
packaging
- usingpkg_resources
instead.
- Moved
-
Internal:
- Reintroduced
@cached_property
across most of the fields having found how often certain ones get called.
- Reintroduced
- Release description is now checked for the catalog number.
- Added a test based on parsing the JSON output directly without having to parse the entire HTML. Bandcamp have been moving away from HTML luckily, so let's hope the trend continues.
- Added a tiny cmd-line tool
url2json
which simply outputs either a compacted or a human version of the JSON data that is found for the given bandcamp URL.
- Fixed singleton regression where track list was getting read incorrectly.
-
For those who use
beets >= 1.5.0
, singleton tracks are now enriched with similar metadata to albums (depending on whether they are found of course):album
: Artist - Track usuallyalbumartist
albumstatus
albumtype
:single
catalognum
country
label
medium
,medium_index
,medium_total
- release date:
year
,month
,day
-
Album names get cleaned up. The following, if found, are removed:
- Artist name (unless it's a singleton track)
- Label name
- Catalog number
- Strings
- Various Artists
- limited edition
- EP (only if it is preceded by a space)
- If any of the above are preceded/followed by - or | characters, they are removed together with spaces around them (if they are found)
- If any of the above (except EP) are enclosed in parentheses or square brackets, they are also removed.
Examples:
Album - Various Artists -> Album Various Artists - Album -> Album Album EP -> Album [Label] Album EP -> Album Artist - Album EP -> Album Label | Album -> Album Album (limited edition) -> Album
-
Added recommended installation method in the readme.
-
Added tox tests for
beets < 1.5
andbeets > 1.5
for python versions from 3.6 up to 3.9. -
Sped up re-importing bandcamp items by checking whether the URL is already available before searching.
-
Parsing: If track's name includes bandcamp digital (bonus|only) etc., bandcamp part gets removed as well.
- Internal simplifications regarding
beets
version difference handling.
- Parsing: country/location name parser now takes into account punctuation such as in
St. Louis
- it previously ignored full stops.
-
Until now, the returned fields have been limited by what's available in search-specific
TrackInfo
andAlbumInfo
objects. The marks the first attempt of adding information to library items that are available at later import stages.If the
comments
field is empty or containsVisit <artist-page>
, the plug-in populates this field with the release description. This can be reverted by including it in a newexclude_extra_fields
list option.
lyrics
configuration option is now deprecated and will be removed in one of the upcoming releases (0.8.0 / 0.9.0 - before stable v1 goes out). If lyrics aren't needed, it should be added to theexclude_extra_fields
list.
- The
albumartist
that would go missing for thebeets 1.5.0
import stage has now safely returned.
- For the case when a track or an album is getting imported through the id / URL mode, we now check whether the provided URL is a Bandcamp link. In some cases parsing foreign URLs results in decoding errors, so we'd like to catch those URLs early. Thanks @arogl for spotting this.
-
Bandcamp updated their html format which broke track duration parsing. This is now fixed and test html files are updated.
-
Fixed track name parser which would incorrectly parse a track name like
24 hours
, ignoring the numbers from the beginning of the string. -
Locations that have non-ASCII characters in their names would not be identified (something like Montreal, Québec) - now the characters are converted and
pycountry
does understand them. -
Fixed an edge case where an EP would be incorrectly misidentified as an album.
- Catalog number parser now requires at least two digits to find a good match.
- Country name overrides for Russia and The Netherlands which deviate from the official names.
- Track names:
- If digital and exclusive are found in the name, it means it's digital-only.
- Artist / track splitting logic now won't split them on the dash if it doesn't have spaces on both sides.
track_alt
field may now contain numerical values if track names start with them. Previously, only vinyl format was supported with theA1
/B2
notation.
- Previously skipped, not-yet-released albums are now handled appropriately. In such
cases,
albumstatus
gets set to Promotional, and the release date will be a future date instead of past.
-
Handle a sold-out release where the track listing isn't available, which would otherwise cause a KeyError.
-
Catalog number parser should now forget that cassette types like C30 or C90 could be valid catalog numbers.
- Brought dev dependencies up-to-date.
- For data that is parsed directly from the html, ampersands are now correctly unescaped.
- On Bandcamp merch is listed in the same list together with media - this is now taken into account and merch is ignored. Previously, some albums would fail to be returned because of this.
- Fixed readme headings where configuration options were shown in capitals on
PyPI
.
- Added some functionality to exclude digital-only tracks for media that aren't
Digital Media. A new configuration option
include_digital_only_tracks
, if set toTrue
will include all tracks regardless of the media, and if set toFalse
, will mind, for example, a Vinyl media and exclude tracks that have some sort of digital only flag in their names, likeDIGI
,[Digital Bonus]
,[Digital Only]
and alike. These flags are also cleared from the track names.
- For LP Vinyls, the disc count and album type are now corrected.
release_date
search pattern now looks for a specific date format, guarding it against similar matches that could be found in the description, thanks @noahsager.
- Handled a
KeyError
that would come up when looking for an album/track where the block describing available media isn't found. Thanks @noahsager.
- Info logs are now
DEBUG
logs so that they're not printed without the verbose mode, thanks @arogl.
catalognum
parser used to parseVol.30
orChristmas 2020
as catalog number - these are now excluded. It's likely that additional patterns will come up later.
- Added the changelog.
- Fixed installation instructions in the readme.
- The pipeline now uses generators, therefore the plug-in searches until it finds a good fit and won't continue further (same as the musicbrainz autotagger)
- Extended the parsing functionality with data like catalog number, label, country etc. The full list is given in the readme.