[Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact #6371

yangw-dev · 2025-03-06T19:18:35Z

Details:

print os in test spect printout section
store os, job_arn (mobile job) and job_conclusion to each artifact metadata. Notice this is not github job conclusion, this is mobile job conclusion.
wrap post-test logics into ReportProcessor,
- pass aws client as parameter for test-driven purpose
- add unit test for ReportProcessor

vercel · 2025-03-06T19:18:39Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Updated (UTC)
torchci	⬜️ Ignored (Inspect)	Visit Preview	Mar 7, 2025 0:33am

tools/device-farm-runner/test_run_on_aws_devicefarm.py

ZainRizvi · 2025-03-07T16:28:45Z

tools/device-farm-runner/test_run_on_aws_devicefarm.py

+        self.assertEqual(m_df.getMockClient().list_suites.call_count, 2)
+        self.assertEqual(m_df.getMockClient().list_tests.call_count, 4)
+        self.assertEqual(download_artifact_mock.call_count, 12)


these numbers are very precise and look like they might break the test when the code changes slightly. Can these constraints be relaxed a bit to just checking the most critical api calls and perhaps using greaterThan/lessThan type of comparisons?

The idea is to make the test more resilient to design changes in the code (so that future authors don't have to go around updating a bunch of tests) while still verifying that the core functionality desired is maintained.

ZainRizvi · 2025-03-07T16:32:59Z

tools/device-farm-runner/test_run_on_aws_devicefarm.py

+        self.assertEqual(a2["name"], "test spec output")
+
+    @mock.patch("run_on_aws_devicefarm.download_artifact")
+    def test_reportProcessor_debug(self, download_artifact_mock):


nit: Can you please rename this test to match the "test_when_then" format the tests further down have?

That would help explain to future maintainers what exactly this test is intended to validate (which TBH isn't clear to me right now)

ZainRizvi · 2025-03-07T16:35:45Z

tools/device-farm-runner/test_run_on_aws_devicefarm.py

+        self.assertEqual(m_df.getMockClient().list_suites.call_count, 2)
+        self.assertEqual(m_df.getMockClient().list_tests.call_count, 4)
+        self.assertEqual(m_s3.getMockClient().upload_file.call_count, 12)
+        self.assertEqual(m_s3.getMockClient().upload_file.call_count, 12)
+        self.assertEqual(download_artifact_mock.call_count, 12)
+        self.assertEqual(m_s3.getMockClient().upload_file.call_count, 12)


same as comment below, about making these more resilient

ZainRizvi · 2025-03-07T16:37:28Z

tools/device-farm-runner/run_on_aws_devicefarm.py

+        run_report = self._to_run_report(report)
+        self.run_report = run_report


very minor nit, but why not:

Suggested change

run_report = self._to_run_report(report)

self.run_report = run_report

self.run_report = self._to_run_report(report)

(feel free to ignore if you prefer the current format)

huydhn · 2025-03-08T01:44:24Z

tools/device-farm-runner/run_on_aws_devicefarm.py

@@ -196,6 +200,8 @@ def parse_args() -> Any:
        help="an optional file to write the list of artifacts from AWS in JSON format",
    )

+    parser.add_argument("--debug", action="store_true")


Oh, I didn't realize until I read your comment later that debug mode doesn't upload the artifacts to S3. This is a nice feature, but I think a help description is needed here to make that clearer. You could also make some parameters like --workflow-id or --workflow-attempt optional in debug mode

On the other hand, another option I think would be useful here is --quiet. I know that I print too many messages like INFO:root:Run mobile-job-android-13711065730-1-2025-03-07-wrRJMDcB in state RUNNING in the console log, which could be silent

huydhn · 2025-03-08T01:59:52Z

tools/device-farm-runner/run_on_aws_devicefarm.py

            project_arn,
            unique_prefix,
            args.android_instrumentation_test,
            args.test_spec,
        )

+    if test_to_run == {}:


Strictly speaking, the test suite is not a requirement because Device Farm can fuzz the app to make sure it doesn't crash. So, only the app is required. All the current use cases on CI need an test suite though, so having this check is ok. However, you could achieve this just by setting required=True in the test_group mutual exclusive group in argparse instead of doing an explicit check here

huydhn · 2025-03-08T02:05:13Z

Does the console log from the mobile test jobs look correct https://github.com/pytorch/test-infra/actions/runs/13711065730/job/38347515619?pr=6371?

You could also test this on ExecuTorch side (where it really matters) by quickly submitting a mock PR to ExecuTorch to use your branch addSpec:

huydhn · 2025-03-08T02:08:32Z

Look like a small bug on this part https://github.com/pytorch/test-infra/actions/runs/13711065730/job/38347515619?pr=6371#step:15:81, f-string something I guess

INFO:root:{' ' * indent} start gathering artifacts

yangw-dev added 4 commits March 6, 2025 16:55

remove log

d77265b

remove log

ee31193

remove log

0be4b8b

remove log

e875370

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 6, 2025

github-advanced-security bot found potential problems Mar 6, 2025

View reviewed changes

tools/device-farm-runner/test_run_on_aws_devicefarm.py Fixed Show fixed Hide fixed

remove log

1bae658

yangw-dev changed the title ~~[Test]~~ [Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact Mar 6, 2025

yangw-dev marked this pull request as ready for review March 6, 2025 21:40

yangw-dev requested review from huydhn, Camyll and ZainRizvi March 6, 2025 21:40

yangw-dev added 5 commits March 7, 2025 06:56

fix lint

874fd1e

fix lint

b353400

fix lint

6b5ca0e

fix lint

0b3654b

fix org

5d51791

ZainRizvi approved these changes Mar 7, 2025

View reviewed changes

huydhn reviewed Mar 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact #6371

[Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact #6371

yangw-dev commented Mar 6, 2025 •

edited

Loading

vercel bot commented Mar 6, 2025 •

edited

Loading

ZainRizvi Mar 7, 2025

ZainRizvi Mar 7, 2025

ZainRizvi Mar 7, 2025

ZainRizvi Mar 7, 2025

huydhn Mar 8, 2025

huydhn Mar 8, 2025 •

edited

Loading

huydhn Mar 8, 2025 •

edited

Loading

huydhn commented Mar 8, 2025

huydhn commented Mar 8, 2025

		run_report = self._to_run_report(report)
		self.run_report = run_report

	run_report = self._to_run_report(report)
	self.run_report = run_report
	self.run_report = self._to_run_report(report)

[Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact #6371

Are you sure you want to change the base?

[Mobile Benchmark Test] Add os, job_arn, and job_conclusion to artifact #6371

Conversation

yangw-dev commented Mar 6, 2025 • edited Loading

vercel bot commented Mar 6, 2025 • edited Loading

ZainRizvi Mar 7, 2025

Choose a reason for hiding this comment

ZainRizvi Mar 7, 2025

Choose a reason for hiding this comment

ZainRizvi Mar 7, 2025

Choose a reason for hiding this comment

ZainRizvi Mar 7, 2025

Choose a reason for hiding this comment

huydhn Mar 8, 2025

Choose a reason for hiding this comment

huydhn Mar 8, 2025 • edited Loading

Choose a reason for hiding this comment

huydhn Mar 8, 2025 • edited Loading

Choose a reason for hiding this comment

huydhn commented Mar 8, 2025

huydhn commented Mar 8, 2025

yangw-dev commented Mar 6, 2025 •

edited

Loading

vercel bot commented Mar 6, 2025 •

edited

Loading

huydhn Mar 8, 2025 •

edited

Loading

huydhn Mar 8, 2025 •

edited

Loading