#188: new Flow service in API v2 #195

lsulak · 2024-04-30T15:28:55Z

created new controller, service, and repository: Flow
new REST API, v2: get checkpoints (with measurements) of a flow, based on a partitioning
tested with Postman and wrote many integration tests as well as unit tests
APIv2: following kebab-case format for API paths, APIv1: keeping camelCase for backward compatibility

Closes #188

Release notes:

Implements a new REST API endpoint: POST api/v2/get-flow-checkpoints for getting checkpoints (with measurements) of a flow; flow being identifiable based on an input partitioning (v2).
Using kebab-case convention for our API paths in v2 (v1 is kept camelCase as it was before, for backward compatibility reasons).

…support and new DTO

…e/188-server-part-of-get-flow-checkpoints # Conflicts: # server/src/test/scala/za/co/absa/atum/server/api/TestData.scala

github-actions · 2024-04-30T15:44:29Z

JaCoCo server module code coverage report - scala 2.13.11

Build Failed

…oints

…a model and tests

… (will need refactoring)

…e/188-server-part-of-get-flow-checkpoints

lsulak · 2024-05-23T21:19:26Z

server/src/main/scala/za/co/absa/atum/server/model/CheckpointFromDB.scala

+    } yield MeasureResultDTO(mainValue, supportValues)
+
+    measureResultOrErr match {
+      case Left(err) => throw err


@salamonpavel I was thinking what to do here, perhaps this is not 'best-zio-practice' - would you have a better idea how to handle this? Also, check FlowServiceImpl where it is used please

I think you can and should return Either instead of throwing exception which in this version of code in not handled properly. I would go with this code (only small change of yours).

def toCheckpointDTO( partitioning: PartitioningDTO, checkpointQueryResult: CheckpointFromDB ): Either[DecodingFailure, CheckpointDTO] = { val measureResultOrErr = checkpointQueryResult.measurementValue.as[MeasureResultDTO] measureResultOrErr match { case Left(err) => Left(err) case Right(measureResult) => Right( CheckpointDTO( id = checkpointQueryResult.idCheckpoint, name = checkpointQueryResult.checkpointName, author = checkpointQueryResult.author, measuredByAtumAgent = checkpointQueryResult.measuredByAtumAgent, partitioning = partitioning, processStartTime = checkpointQueryResult.checkpointStartTime, processEndTime = checkpointQueryResult.checkpointEndTime, measurements = Set( MeasurementDTO( measure = MeasureDTO( measureName = checkpointQueryResult.measureName, measuredColumns = checkpointQueryResult.measuredColumns ), result = measureResult ) ) ) ) } }

salamonpavel · 2024-05-24T10:43:15Z

server/src/main/scala/za/co/absa/atum/server/model/CheckpointFromDB.scala

+
+object CheckpointFromDB {
+
+  private def extractMainValue(json: Json): Either[Error, MeasureResultDTO.TypedValue] = {


This method is not needed. You can directly deserialize into MeasureResultDTO in toCheckpointDTO method.

checkpointQueryResult.measurementValue.as[MeasureResultDTO]

hmm, that's true now. I had a few iterations of this code - thanks! Will change it

salamonpavel · 2024-05-24T10:43:30Z

server/src/main/scala/za/co/absa/atum/server/model/CheckpointFromDB.scala

+    json.as[MeasureResultDTO].map(_.mainValue)
+  }
+
+  private def extractSupportValues(json: Json): Either[Error, Map[String, MeasureResultDTO.TypedValue]] =


This method is not needed. You can directly deserialize into MeasureResultDTO in toCheckpointDTO method.

checkpointQueryResult.measurementValue.as[MeasureResultDTO]

salamonpavel · 2024-05-24T11:52:12Z

server/src/main/scala/za/co/absa/atum/server/api/service/FlowServiceImpl.scala

+class FlowServiceImpl(flowRepository: FlowRepository)
+  extends FlowService with BaseService {
+
+  override def getFlowCheckpoints(checkpointQueryDTO: CheckpointQueryDTO): IO[ServiceError, Seq[CheckpointDTO]] = {


In this version the exception coming from the deserialization is not handled. In some other comment I am mentioning you should return rather either in the serde code. Then you could create zios from those eithers and collect them. See bellow two version of the same, one with sequential processing (the one with foreach), another one with parallel processing (collectAll). I would personally choose the foreach variant.

def getFlowCheckpointsCollectAll(checkpointQueryDTO: CheckpointQueryDTO): IO[ServiceError, Seq[CheckpointDTO]] = { for { checkpointsFromDB <- repositoryCall(flowRepository.getFlowCheckpoints(checkpointQueryDTO), "getFlowCheckpoints") checkpointDTOs <- ZIO.collectAll { checkpointsFromDB.map { checkpointFromDB => ZIO.fromEither(CheckpointFromDB.toCheckpointDTO(checkpointQueryDTO.partitioning, checkpointFromDB)) .mapError(error => ServiceError(error.getMessage)) } } } yield checkpointDTOs } def getFlowCheckpointsForeach(checkpointQueryDTO: CheckpointQueryDTO): IO[ServiceError, Seq[CheckpointDTO]] = { for { checkpointsFromDB <- repositoryCall(flowRepository.getFlowCheckpoints(checkpointQueryDTO), "getFlowCheckpoints") checkpointDTOs <- ZIO.foreach(checkpointsFromDB) { checkpointFromDB => ZIO.fromEither(CheckpointFromDB.toCheckpointDTO(checkpointQueryDTO.partitioning, checkpointFromDB)) .mapError(error => ServiceError(error.getMessage)) } } yield checkpointDTOs }

Actually now reading the documentation I can see that you could also use foreachPar for parallel processing. So the difference between foreach(Par) and collectAll is mainly in the fact that collectAll takes sequence of effects on the input whereas foreach takes a normal collection and function to convert the elements into zio. The return value is the same, and both return failed effect if any of the zios fail.

Thanks, it's quite educative, I'll give it some reading.

I think I'll stick with the sequential processing - the only 'parallelism' would be what happens in toCheckpointDTO and that's not that 'slow' (i.e. I like performance / multiprocessing optimizations where it significantly impacts performance, on the other hand if it doesn't, parallelism can introduce additional overhead and, god forbids, debugging of problems is a bit more difficult)

…oints

salamonpavel · 2024-06-12T06:48:47Z

server/src/main/scala/za/co/absa/atum/server/api/database/DoobieImplicits.scala


+  implicit val encodeResultValueType: Encoder[MeasureResultDTO.ResultValueType] = Encoder.encodeString.contramap {


What's the motivation to place json related encoders/decoders alongside doobie implicits?

Not sure I understand the question, those JSON related SerDe code was already there & I needed those MeasureResult DTOs to be serialized/deserialized as well

DoobieImplicits object is there for defining Put/Get/Read/Write instances for Doobie. Then we have PlayJsonImplicits for Reads/Writes/Format type classes for Play Json. And what you have defined is actually related to Circe.

Aaah. Yes, I'm sorry I understand now. I'll move them to CirceImplicits.scala

I know that I could move them directly to CheckpointFromDB.scala, but I anticipate that @TebaleloS will create a bunch of them later as well, so it might be a good idea for them to be centralized

It's customary to place them in companion objects.

moved into companion object of a given DTO, thanks for the recommendation

salamonpavel · 2024-06-12T06:55:06Z

.../src/main/scala/za/co/absa/atum/server/api/database/flows/functions/GetFlowCheckpoints.scala

+
+  override def sql(values: CheckpointQueryDTO)(implicit read: Read[CheckpointFromDB]): Fragment = {
+    val partitioning = PartitioningForDB.fromSeqPartitionDTO(values.partitioning)
+    val partitioningNormalized = Json.toJson(partitioning).toString


Maybe we could already serialize it into Json from Circe instead of using String derived by play json.

I don't want to do that in this PR, it's been open for far too long and we have a ticket for this already. Let's make it later & in one bunch, not in pieces in these feature PRs I think

#200

salamonpavel · 2024-06-12T06:56:27Z

server/src/main/scala/za/co/absa/atum/server/api/http/Endpoints.scala

@@ -54,6 +54,15 @@ trait Endpoints extends BaseEndpoints {
      .out(jsonBody[AdditionalDataSubmitDTO])
  }

+  protected val getFlowCheckpointsEndpoint
+    : PublicEndpoint[CheckpointQueryDTO, ErrorResponse, Seq[CheckpointDTO], Any] = {


Let's merge #199 before this PR so you can incorporate the envelope.

Okay, happy to do that - I approved #199 just now

salamonpavel · 2024-06-12T06:59:11Z

...er/src/test/scala/za/co/absa/atum/server/api/controller/FlowControllerIntegrationTests.scala

+import zio.test.Assertion.failsWithA
+import zio.test._
+
+object FlowControllerIntegrationTests extends ZIOSpecDefault with TestData {


It's a unit test and should be executed as such. Please rename to FlowControllerUnitTests.

salamonpavel · 2024-06-12T07:00:24Z

...er/src/test/scala/za/co/absa/atum/server/api/repository/FlowRepositoryIntegrationTests.scala

+import zio.test._
+import zio.test.junit.ZTestJUnitRunner
+
+@RunWith(classOf[ZTestJUnitRunner])


Remove the annotation and make it an object. Also as above, it's a unit test and should be executed as such. Please rename the object to FlowRepositoryUnitTests

Okay, I'll actually make these changes in the whole repo. I was not paying particular attention to it, but it's time to change it

@lsulak
Maybe you could also rename all other test files where the suffix 'IntegrationTests' was incorrectly used?

naturally, I also did it already :)

salamonpavel · 2024-06-12T07:00:53Z

server/src/test/scala/za/co/absa/atum/server/api/service/FlowServiceIntegrationTests.scala

+import zio.test._
+import zio.test.junit.ZTestJUnitRunner
+
+@RunWith(classOf[ZTestJUnitRunner])


salamonpavel

This PR showcase high-quality code that adheres to best practices and coding standards. However, it would be beneficial to add automated tests for newly created endpoint. While manual testing via tools like Postman is useful for exploratory testing and very specific scenarios, it cannot replace automated tests.

…oints

lsulak · 2024-06-12T15:07:42Z

This PR showcase high-quality code that adheres to best practices and coding standards. However, it would be beneficial to add automated tests for newly created endpoint. While manual testing via tools like Postman is useful for exploratory testing and very specific scenarios, it cannot replace automated tests.

Thanks, appreciate it!

Re API tests: actually it's a good idea. Ticket here: #210 for the future as it's quite difficult to implement all the good ideas immediately :D

…e/188-server-part-of-get-flow-checkpoints # Conflicts: # database/src/main/postgres/flows/V1.9.1__get_flow_checkpoints.sql # project/Dependencies.scala # server/src/main/scala/za/co/absa/atum/server/Constants.scala # server/src/main/scala/za/co/absa/atum/server/api/database/DoobieImplicits.scala # server/src/main/scala/za/co/absa/atum/server/api/http/BaseEndpoints.scala # server/src/main/scala/za/co/absa/atum/server/api/http/Endpoints.scala # server/src/main/scala/za/co/absa/atum/server/api/http/Routes.scala # server/src/main/scala/za/co/absa/atum/server/api/repository/BaseRepository.scala # server/src/main/scala/za/co/absa/atum/server/api/service/BaseService.scala # server/src/main/scala/za/co/absa/atum/server/model/CheckpointFromDB.scala # server/src/main/scala/za/co/absa/atum/server/model/PlayJsonImplicits.scala # server/src/test/scala/za/co/absa/atum/server/api/TestData.scala # server/src/test/scala/za/co/absa/atum/server/api/controller/CheckpointControllerUnitTests.scala # server/src/test/scala/za/co/absa/atum/server/api/service/PartitioningServiceUnitTests.scala

…-merge conflict resolution

lsulak · 2024-06-13T15:21:47Z

Release notes

Implements a new REST API endpoint: POST api/v2/get-flow-checkpoints for getting checkpoints (with measurements) of a flow; flow being identifiable based on an input partitioning (v2).
Using kebab-case convention for our API paths in v2 (v1 is kept camelCase as it was before, for backward compatibility reasons).

salamonpavel · 2024-06-14T07:37:35Z

This PR showcase high-quality code that adheres to best practices and coding standards. However, it would be beneficial to add automated tests for newly created endpoint. While manual testing via tools like Postman is useful for exploratory testing and very specific scenarios, it cannot replace automated tests.

Thanks, appreciate it!

Re API tests: actually it's a good idea. Ticket here: #210 for the future as it's quite difficult to implement all the good ideas immediately :D

There are also these unit tests for endpoints that could be implemented also for the new endpoint.

…ject of a given DTO

lsulak · 2024-06-14T11:20:24Z

This PR showcase high-quality code that adheres to best practices and coding standards. However, it would be beneficial to add automated tests for newly created endpoint. While manual testing via tools like Postman is useful for exploratory testing and very specific scenarios, it cannot replace automated tests.

Thanks, appreciate it!
Re API tests: actually it's a good idea. Ticket here: #210 for the future as it's quite difficult to implement all the good ideas immediately :D

There are also these unit tests for endpoints that could be implemented also for the new endpoint.

I personally don't like covering everything with all types of tests - but in this case I'll add them, perhaps it's a good enough balance to have at least 1 test for each endpoint type / service, and since here I introduced 'flow' service / functionality, it might be nice to have it.

lsulak added 4 commits April 30, 2024 17:28

#188: adding new controller / service / repository Flow, with server …

86397c2

…support and new DTO

#188: further PoC-ing

dab8751

Merge remote-tracking branch 'refs/remotes/origin/master' into featur…

9678071

…e/188-server-part-of-get-flow-checkpoints # Conflicts: # server/src/test/scala/za/co/absa/atum/server/api/TestData.scala

merge conflict resolution

1fea27b

lsulak self-assigned this Apr 30, 2024

lsulak added the work in progress Work on this item is not yet finished (mainly intended for PRs) label Apr 30, 2024

lsulak added 5 commits May 7, 2024 18:33

Merge branch 'master' into feature/188-server-part-of-get-flow-checkp…

61f8f47

…oints

Merge branch 'master' into feature/188-server-part-of-get-flow-checkp…

c1c10ce

…oints

#188: final implementation of desired functionality with adjusted dat…

a8c8ead

…a model and tests

#188: implementing decoding of SupportValues as well, in measurements…

95b4ba6

… (will need refactoring)

#188: final refactoring

520cba8

lsulak marked this pull request as ready for review May 23, 2024 11:51

lsulak requested review from benedeki, TebaleloS, Zejnilovic, dk1844 and salamonpavel as code owners May 23, 2024 11:51

lsulak added 5 commits May 23, 2024 23:06

#188: tests, tests, tests

f840e5a

Merge remote-tracking branch 'refs/remotes/origin/master' into featur…

09249cb

…e/188-server-part-of-get-flow-checkpoints

merge conflict resolution

4bf50a1

#188: removing temporary code I used

163de13

remove

5b21021

lsulak commented May 23, 2024

View reviewed changes

salamonpavel reviewed May 24, 2024

View reviewed changes

post-review improvements

dcac162

lsulak mentioned this pull request May 27, 2024

Server endpoints v2 returning the checkpoins data of a partitioning: 190 #194

Merged

Merge branch 'master' into feature/188-server-part-of-get-flow-checkp…

9e0731b

…oints

salamonpavel reviewed Jun 12, 2024

View reviewed changes

lsulak added 2 commits June 12, 2024 16:54

Merge branch 'master' into feature/188-server-part-of-get-flow-checkp…

4805305

…oints

post-review improvement

52972c3

lsulak added 11 commits June 13, 2024 13:12

post-merge changes

bddf570

#188: API changes in v2

1ce658b

#188: fix

0676967

#188: making APIv1 compatible again

d0ca3ae

#188: fixing mocked API tests

b7310f7

#188: fixing test data, removing duplicates - part of the recent post…

e346077

…-merge conflict resolution

#188: fixing some ITs

3c53a89

#188: fixing all UTs for the server

464752a

#188: optimization / refactoring

92065f6

#188: fixing the API & ITs

d6fe29d

#188: moving the circe-related implicit conversions to a companion ob…

8d777f6

…ject of a given DTO

lsulak added 2 commits June 14, 2024 13:31

#188: one more integration test for the new API

8e8b565

#188: not ITs, everything is being mocked and runs locally

38ca1bc

salamonpavel approved these changes Jun 14, 2024

View reviewed changes

lsulak merged commit 470e091 into master Jun 14, 2024
6 of 7 checks passed

lsulak deleted the feature/188-server-part-of-get-flow-checkpoints branch June 14, 2024 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#188: new Flow service in API v2 #195

#188: new Flow service in API v2 #195

lsulak commented Apr 30, 2024 •

edited by miroslavpojer

Loading

github-actions bot commented Apr 30, 2024 •

edited

Loading

lsulak May 23, 2024

salamonpavel May 24, 2024

salamonpavel May 24, 2024

lsulak May 24, 2024

salamonpavel May 24, 2024

salamonpavel May 24, 2024 •

edited

Loading

salamonpavel May 24, 2024

lsulak May 24, 2024 •

edited

Loading

salamonpavel Jun 12, 2024

lsulak Jun 12, 2024

salamonpavel Jun 14, 2024 •

edited

Loading

lsulak Jun 14, 2024

salamonpavel Jun 14, 2024

lsulak Jun 14, 2024 •

edited

Loading

salamonpavel Jun 12, 2024

lsulak Jun 12, 2024

salamonpavel Jun 12, 2024

lsulak Jun 12, 2024

salamonpavel Jun 12, 2024

lsulak Jun 12, 2024

salamonpavel Jun 12, 2024

lsulak Jun 12, 2024

salamonpavel Jun 14, 2024 •

edited

Loading

lsulak Jun 14, 2024

salamonpavel Jun 12, 2024

salamonpavel left a comment

lsulak commented Jun 12, 2024 •

edited

Loading

lsulak commented Jun 13, 2024 •

edited

Loading

salamonpavel commented Jun 14, 2024 •

edited

Loading

lsulak commented Jun 14, 2024


		object CheckpointFromDB {

		private def extractMainValue(json: Json): Either[Error, MeasureResultDTO.TypedValue] = {


		implicit val encodeResultValueType: Encoder[MeasureResultDTO.ResultValueType] = Encoder.encodeString.contramap {

#188: new Flow service in API v2 #195

#188: new Flow service in API v2 #195

Conversation

lsulak commented Apr 30, 2024 • edited by miroslavpojer Loading

github-actions bot commented Apr 30, 2024 • edited Loading

JaCoCo server module code coverage report - scala 2.13.11

Build Failed

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

salamonpavel May 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lsulak May 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

salamonpavel Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lsulak Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

salamonpavel Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

salamonpavel left a comment

Choose a reason for hiding this comment

lsulak commented Jun 12, 2024 • edited Loading

lsulak commented Jun 13, 2024 • edited Loading

salamonpavel commented Jun 14, 2024 • edited Loading

lsulak commented Jun 14, 2024

lsulak commented Apr 30, 2024 •

edited by miroslavpojer

Loading

github-actions bot commented Apr 30, 2024 •

edited

Loading

salamonpavel May 24, 2024 •

edited

Loading

lsulak May 24, 2024 •

edited

Loading

salamonpavel Jun 14, 2024 •

edited

Loading

lsulak Jun 14, 2024 •

edited

Loading

salamonpavel Jun 14, 2024 •

edited

Loading

lsulak commented Jun 12, 2024 •

edited

Loading

lsulak commented Jun 13, 2024 •

edited

Loading

salamonpavel commented Jun 14, 2024 •

edited

Loading