Skip to content

Releases: AbsaOSS/atum-service

v0.3.0

25 Oct 14:22
1b3a032
Compare
Choose a tag to compare

Breaking Changes 💥

  • Additional data methods of AtumContext uses REST API v2 (incompatibility of Agent of 0.3.0+ with server 0.2.0) by @lsulak in #283
  • Full Flyway integration developed by @benedeki in #276

New Features 🎉

  • Atum server REST API v2 developed by @salamonpavel, @TebaleloS, @lsulak, @benedeki in #140

    • GET /partitionings?partitioning=serializedPartitioning in #268
    • GET /partitionings/{partId}/additional-data in #227
    • PATCH /partitionings/{partId}/additional-data in #221
    • there are many more, but they are silent live for now
  • Introduced response envelopes providing additional metadata (requestId) for REST API v2 endpoints by @salamonpavel in #197

  • Replaced Json4s and Jackson serialization libraries with Circe by @TebaleloS, @salamonpavel, @benedeki in #214

  • Introduced health API endpoint in a form StatusBoard projects expects by @salamonpavel in #282

  • Dockerfile and application configuration verified for deployment with ZIO and Http4s web server by @salamonpavel in #274

  • Dockerfile adjusted to ZIO framework and custom configuration now being passed during docker run, i.e. independent of the sbt build and docker build by @lsulak in #279

Silent Live 🤫

  • Introduced the Reader module to make reading of information stored in Atum server easy. by @benedeki in #248 (not publsiehd yet, only in code-base)
  • Atum server REST API v2 endpoints developed by @salamonpavel, @TebaleloS, @lsulak, @benedeki in #140
    • There are numerous other endpoints implemented beside those mentioned above. We yet discourage from their usage though, as they are subject to change, particularly their payloads.

Known Issues ⚠️

  • Dependency shading might be needed when using the Agent in Spark environment (especially when some Hadoop dependencies are in use as well, for example if you package the application that contains the Agent and use such JAR in Spark Submit command). Here's the suggested project code snippet for Maven:
            <plugin>
                <groupId>org.apache.maven.plugins</groupId>
                <artifactId>maven-shade-plugin</artifactId>
                <version>${maven.shade.plugin.version}</version>
                <executions>
                    <execution>
                        <phase>package</phase>
                        <goals>
                            <goal>shade</goal>
                        </goals>
                    </execution>
                </executions>
                <configuration>
                    <relocations>
                        <relocation>
                            <pattern>okhttp3</pattern>
                            <shadedPattern>shaded.okhttp3</shadedPattern>
                        </relocation>
                        <relocation>
                            <pattern>okio</pattern>
                            <shadedPattern>shaded.okio</shadedPattern>
                        </relocation>
                        <relocation>
                            <pattern>sttp</pattern>
                            <shadedPattern>shaded.sttp</shadedPattern>
                        </relocation>
                        <relocation>
                            <pattern>cats</pattern>
                            <shadedPattern>shaded.cats</shadedPattern>
                        </relocation>
                        <relocation>
                            <pattern>shapeless</pattern>
                            <shadedPattern>shaded.shapeless</shadedPattern>
                        </relocation>
                        <relocation>
                            <pattern>kotlin</pattern>
                            <shadedPattern>shaded.kotlin</shadedPattern>
                        </relocation>
                    </relocations>
                    <filters>
                        <filter>
                            <artifact>*:*</artifact>
                            <excludes>
                                <exclude>META-INF/*.SF</exclude>
                                <exclude>META-INF/*.DSA</exclude>
                                <exclude>META-INF/*.RSA</exclude>
                            </excludes>
                        </filter>
                    </filters>
                </configuration>
            </plugin>

Full Changelog

v0.2.0...v0.3.0

v0.1.1

03 Sep 11:41
cefe63c
Compare
Choose a tag to compare

This version fixes the configuration of the application in the Dockerized environment.

Bugfixes 🛠

  • Wrong format of application.properties file by @lsulak in #130
  • Renaming the application.properties file to be just a template and not the real one, the docker image MUST provide it by @lsulak in #131
  • Bugfix/remove hardcoded application.properties references from the code and adding ability to use config from SPRING_CONFIG_LOCATION env var by @lsulak in #132

Full Changelog: v0.1.0...v0.1.1

v0.2.0

08 Jun 06:58
e4e0420
Compare
Choose a tag to compare

Breaking Changes 💥

  • Dropped support of Spark 2.4 by @benedeki, @lsulak, @salamonpavel in #193
  • Server moved from Spring to Zio/Tapir by @salamonpavel in #145
    • As the application has Http4s Blaze server backend included now there is no need for any servlet container like Tomcat.
    • The application is packaged as JAR file and run directly using java-jar.
  • Server requires Java 11 platform by @salamonpavel in #151
  • The groupId of the libraries changed from za.co.absa to za.co.absa.atum-service.atum-service

New Features 🎉

Bugfixes 🛠

Full Changelog

v0.1.0...v0.2.0

v0.1.0

17 Nov 21:35
ee7b7de
Compare
Choose a tag to compare

Initial release of the Atum service

Server

  • has two endpoints
    • /api/v1/createPartitioning to register or retrieve a partitioning and optionally establish a a relation with another partitioning
    • /api/v1/createCheckpoint to record measurement data
  • connects to Postgres DB that stores the data
  • newly created partitioning automatically contains the count function to measure

Agent

  • spawn context based on key provided (partitioning)
  • add measuring functions; supported now are:
    • count
    • distinctCount
    • aggregatedTotal - sum of values in the column
    • absAggregatedTotal
    • hashCrc32
  • provides interfaces to measure data completeness on DataFrames (create checkpoints)

Database

  • created, including DB Roles and an ownership model of the database objects for the Roles
  • stores and processes data related to:
    • Partitioning
    • Additional Data
    • Measurement
    • Measure Definition
    • Checkpoint
    • Flow a concept how to describe the data as they go through the systems and how different partitionings relate.