Releases: snejus/beetcamp
0.12.0 - 2022-02-10
Added
album
: following MusicBrainz title format specification, strings EP and LP are from now on kept in place in album names.catalognum
: To find the catalog number, we have previously been looking at the release title and pointers such as Catalogue Number: within the release description.
In addition to the above, we now apply a generic search pattern across the rest of the text, including media title, media description and the rest of the release description.
For those interested, at a high level the pattern used in the search looks like below
(
[A-Z .]+\d{3} # HANDS D300
| [A-z ][ ]0\d{2,3} # Persephonic Sirens 012
| [A-Z-]{2,}\d+ # RIV4
| [A-Z]+[A-Z.$-]+\d{2,} # USE202, HEY-101, LI$025
| [A-Z.]{2,}[ ]\d{1,3} # OBS.CUR 9
| \w+[A-z]0\d+ # 1ØPILLS018, fa036
| [a-z]+(cd|lp)\d+ # ostgutlp45
| [A-z]+\d+-\d+ # P90-003
)
( # optionally followed by
[ ]?[A-Z] # IBM001V
| [.][0-9]+ # ISMVA002.1
| -?[A-Z]+ # PLUS8024CD
)?
-
albumtype
: similar to thecatalognum
, the descriptions are searched for EP and LP strings presence to find out thealbumtype
. -
track
: Support for tracks that do not use dash (-
) but some other character to separate pieces of information in track names. For example, consider the following tracklist:
A1 | WHITESHADOWHURTS x TOXICSPIKEBACK | Arcadia
A2 | WHITESHADOWHURTS | Corrupted Entity
A3 | WHITESHADOWHURTS | Colosseo
B1 | TOXICSPIKEBACK | Eclipse
B2 | TOXICSPIKEBACK | Eclipse [DJ LINT's Tribe Mix]
B3 | WHITESHADOWHURTS | Corrupted Entity [OAT.M's Oldschool Mix]
beetcamp
now finds that |
is being used as the delimiter and parses values for track_alt
, artist
and title
accordingly.
Updated
- singleton:
album
andalbumartist
fields are not anymore populated. catalognum
: artists like PROCESS 404 are not assumed to be catalogue numbers anymore.track_alt
: allow non-capital letters, like a1 to be parsed and convert them to capitals.albumartist
: use Various Artists (or equivalent) when a release includes more than four different artists. Until now we've only done so for compilations.genre
: genres are now sorted alphabetically
Fixed
- Support for
beets<1.5
has been broken since0.11.0
, - it should now work fine. However, fields such ascomments
andlyrics
are not available, and album-like metadata likecatalognum
is not available for singletons. Thanks @zane-schaffer for reporting this issue (Closes #22). singleton
:catalognum
, if found, is now reliably removed from the title.track.title
:-
delimiter is handled more appropriately when it is found in the song title.albumartist
: for the Various Artists releases, the plugin will now use the globally configuredva_name
field instead of hard-coding Various Artists.artist
: Recent bandcamp updates of the JSON data removed artists names from most of compilations, therefore we are again having a peek at the raw HTML data to fetch the data from there.
0.11.0 - 2021-11-12 - Added genre, entrypoint
0.11.0 2021-11-12
Added
-
An entrypoint for
beetcamp
: if the package is in your$PATH
, bandcamp
metadata can be obtained directly as a JSONbeetcamp <bandcamp-url> # {"album": "some album", ..., "tracks": [{"title": ...}, ... ]}
This has mostly been useful in scripts: for example, in my case it bridges the metadata
gap between mpd and a last.fm scrobbler in those cases when music has not yet made it
into the beets library. -
Two more MusicBrainz fields now get populated:
style
: the tag/genre that bandcamp categorize the release asgenre
: comma-delimited list of release keywords that match any musicbrainz
genres.
This comes with some configuration options, see the defaults below:
bandcamp: ... genre: capitalise: no maximum: 0 # no limit always_include: [] mode: progressive # classical, progressive or psychedelic
See the readme for information about the different options.
-
New configuration option
comments_separator
to separate release, media
descriptions and credits. Default:\n---\n
. Comments formatting has been
changing with every release without a good reason - this should stop. Ultimately it is
one's personal choice how they want the formatting to look like.
Updated
-
excluded_extra_fields
configuration option has been extended to support every track
field and most of album fields. See the readme for more information. -
The hook for additional data has been removed since
lyrics
anddescription
are now
retrieved immediately. They can be inspected like every other field, through, for
example, theedit (C)andidates
action during the import. -
track_alt
: allow B2 Title where B2 is followed by a space -
catalognum
: include Catalog: as a valid header when parsing the description -
track.title
digital-only cleanup, remove:- DIGITAL and Bonus from the front of the title
- digital-only and (digital) from the end
Fixed
lyrics
: instead of parsing the HTML, lyrics are now reliably retrieved from the JSON
data and added to each track where applicable.- Nowadays it is possible that the
datePublished
field is not given in the release JSON
data - this is now handled gracefully.
0.10.1 - 2021-09-13
Fixed
-
Fixed #18 by handling cases when a track duration is not given.
-
Fixed #19 where artist names like SUNN O))) would get incorrectly mistreated by the album name cleanup logic due to multiple consecutive parentheses. The fix involved adding some rules around it: they are now deduped only if
- they are preceded with a space
- or they enclose remix / edit info and are the last characters in the album name
-
Fixed #20 where dynamically obtained label names used in further REs caused
re.error
since they were not appropriately escaped (for example,label: /m\ records
).
Thanks @arogl for reporting each of the above!
-
album
: Keep label in the album name if it's immediately followed by an apostrophe. An example scenario:label: Mike
album: Mike's Creations
0.10.0 - 2021-09-10
Fixed
-
General
- Fixed the logic which fetches the additional data fields (
comments
andlyrics
). It
used to cause unwanted behavior since it wrote the files whenwrite: yes
was
enabled in the beets config. Now, it's activated through theimport_task_apply
hook
and adjusts the metadata (beets db) without ever touching the files directly. - Unexpected errors are now printed instead of causing
beets
to quit immediately.
- Fixed the logic which fetches the additional data fields (
-
track.track_alt
: handleA1 - Title
andA1 - Artist - Title
when alt index is not
followed by a full stop. -
track.title
:- Handle cases like Artist -Title / Artist- Title when there is no space between
the dash and the title or artist - Fixed digital only cleaner which would previously remove the string Only when
it's found on its own - Accept ¯\(ツ)/¯ as valid title
- Clean up ( Remix ) -> (Remix)
- Handle cases like Artist -Title / Artist- Title when there is no space between
-
country
: Washington, D.C. and South Korea have not been parsed correctly and
thus releases have been defaulting to XW. This is now fixed.
Updated
-
catalognum
:- Treat VA[0-9]+, vinyl [0-9]+, triple [0-9]+, ep 12 as invalid (case
insensitive) - Handle single digits (like ROAD4) as valid (until now we required at least two)
- Handle catalognums in parentheses, like (ISM001)
- Handle a period or a dash in the non-digit part, like OBS.CUR 12, O-TON 113
- Allow a single capital letter after the digits, like IBM001V
- Allow the catalognum to start with a non-capital letter, like fa010
- Treat VA[0-9]+, vinyl [0-9]+, triple [0-9]+, ep 12 as invalid (case
-
album
andtrack.title
: little clean up: replace multiple consecutive spaces with a
single one and remove all double quotes -
album
:- Only remove label from the album name if
albumtype
is not a compilation - Remove (FREE), (FREE DL), VA, (Incl. some artists remixes) and alike
- Improved the way Various Artists are cleaned up when catalognum is available
- Only remove label from the album name if
-
albumartist
:- If various is specified as the albumartist, make it Various Artists
- When the label have set their name as the albumartist in every field, and if the
actual albumartist could be inferred from the album name, use the inferred name. - If all release tracks have the same artist, assume they are the albumartist
-
albumtype
: treat 4-track release as a valid candidate for a compilation / VA albumtype
0.9.3 - 2021-08-01 Handle non-bandcamp URLs, use updated release_date and label name
Updated
- Bandcamp json updates:
release_date
:datePublished
field now tells the correct release date so now we use
it instead of parsing the plain html.label
: some releases embed therecordLabel
field into the json data - it now gets
prioritized over the publisher name when it is available.
track.title
: clean up*digital only*
properly. Previously we did not account for
asterixes
Fixed
- A regression from
0.9.2
which caused double initialization of the plugin. If your
initial tracks metadata has the album name, the results should again be returned
instantly. - Searching by release ID where the ID is not a bandcamp URL should now be ignored by the
plugin. Thanks @arogl.
0.9.2 - Fixed writing of the metadata
Fixed
- Thanks @arogl for fixing a FutureWarning apparent thrown in Python 3.7.
- Thanks @brianredbeard for reporting that the plugin writes file metadata even when this
is disabled globally. This is now fixed. - singleton album/artist: cases when the release name contains only the track name are now
parsed correctly.
Removed
- Removed deprecated
lyrics
configuration option.
Added
- Added a github action to run ci for
master
anddev
branches. For now it's just a minimal
configuration and will probably get updated soon.
0.9.1 - 2021-06-04
Fixed
album.albumstatus
: If the release date is today, use Official and not Promotional.album.albumtype
:- Until now we have only set single track releases to have the single type. This has been fixed regarding the MusicBrainz description: release composed of the same title and multiple remixes is a single.
- Use
ep
only if EP is mentioned either in the album name or the disc title.
album.catalognum
: Make the DISCTITLE uppercase before looking for the catalogue number.album.media
: Exclude anything that contains bundle in their names. These usually contain additional releases that we do not need.track.title
: Clean- DIGITAL ONLY
(and similar) when it's preceded by a dash and not enclosed by parens or square brackets.track.track_alt
: Having witnessed a very creative track title E7-E5, limit thetrack_alt
field number to the range 0-6.- Committed a JSON testcase which was supposed to be part of
0.9.0
.
Added
- Extend
url2json
with--tracklist-for-tests
to ease adding new testcases.
0.9.0 - 2021-06-01
Fixed
- If track artist is given in the
byArtist
field of the track JSON resource, it is used.
(Fixes #13, thanks @xeroxcat). - Parse cases like
Catalogue:CAT-000
from the description correctly when the space is missing.
Added
- The
comments
field now includes the media description and credits. - The description is searched for artist and album names in addition to the catalogue
number.
Updated
-
All testcases are now pretty JSON files - this should bring more transparency around
the adjustments that Bandcamp make in the future (once they get updated). Theurl2json
tool has-u
flag that updates them automatically. -
Parsing
-
(FREE)
,(free download)
-like strings are now removed from the track names. -
[Vinyl]
is excluded from album names.
0.8.0 - 2021-04-20
0.8.0 - 2021-04-20
Fixed
-
Responded to bandcamp html updates:
artist_id
now lies underpublisher
resource (previouslybyArtist
) in the
/track/<name>
output when the track is part of an album.url
field has disappeared from track objects - using@id
instead.country
andlabel
fields are now found in the JSON data and thus we make use of it- Updated and truncated test html files since we now only need to see the beginning of
the document.
-
Parsing / logic:
- Token
feat.
is now recognised as a valid member of theartist
field. free download
,[EP|LP]
,(EP|LP)
,E.P.
,LP
are now cleaned from the album name.- Updated
albumtype
logic: in somecompilation
cases track artists would go missing
and get set to Various Artists - instead it now defaults to the original
albumartist
. - Handling a couple of edge cases in the track name / title, and catalognum parsers.
- Token
Updated
-
Package:
- Moved
beets
from main to dev dependencies. - Updated supported python versions range (3.6.x-3.9.x)
- Added pylint.
- Removed dependency on
packaging
- usingpkg_resources
instead.
- Moved
-
Internal:
- Reintroduced
@cached_property
across most of the fields having found how often certain
ones get called.
- Reintroduced
Added
- Release description is now checked for the catalogue number.
- Added a test based on parsing the JSON output directly without having to parse the
entire HTML. Bandcamp have been moving away from HTML luckily, so let's hope the trend
continues. - Added a tiny cmd-line tool
url2json
which simply outputs either a compacted or a
human version of the JSON data that is found for the given bandcamp URL.
0.7.1 - 2021-03-15
Fixed
- Fixed singleton regression where track list was getting read incorrectly.