Test/add multimodal audio test #382

mdimado · 2025-01-22T13:24:56Z

Purpose

add test for multimodal audio support

Proposed Changes

added a new GetAudioToolInput class
created a new GetAudioTool class
modified the test_multimodal_messages function
added audio testing scenario

Issues

feat: add support for base64 encoded audio files #380 ensure api complience with audio for multimodal messages #373

rachwalk

Hello, thank you for working on this.

This PR suffers from the same issue as #380
The failing test provides the following output:

 =================================== FAILURES ===================================
_______________________ test_to_langchain_ai_multimodal ________________________

    def test_to_langchain_ai_multimodal():
        payload = HRIPayload(text="Response", images=["img"], audios=["audio"])
        message = HRIMessage(payload=payload, message_author="ai")
    
>       with pytest.raises(
            ValueError
        ):  # NOTE: update when https://github.com/RobotecAI/rai/issues/370 is resolved
E       Failed: DID NOT RAISE <class 'ValueError'>

This one also has a pre-commit formatting issue. You can reproduce these errors on your local machine by running:
pytest -m "not billable" - to reproduce the test output - Note: running just pytestwill trigger all tests,pre-commit run --all- to see the formatting issues detected by the pre-commit. You can also runpre-commit installin your local rai directory, which will automatically runpre-commit` on all the files changed before you make a commit.

I won't be able to merge the PR unless all existing tests pass and the formatting kept up. Hope the feedback I provided helps you out.

rachwalk · 2025-01-23T16:23:03Z

src/rai/rai/messages/multimodal.py

@@ -56,6 +54,19 @@ def __init__(
                for image in self.images
            ]
            _content.extend(_image_content)
+
+        # aduio content handling (used audio/wav as MIME type)


Suggested change

# aduio content handling (used audio/wav as MIME type)

# audio content handling (used audio/wav as MIME type)

mb, I'll change that

rachwalk · 2025-01-24T12:27:24Z

tests/messages/test_multimodal.py

+    def _run(self, name: str):
+        # simple audio signal (1 second of 440Hz sine wave)
+        sample_rate = 44100
+        duration = 1.0
+        t = np.linspace(0, duration, int(sample_rate * duration))
+        audio_signal = np.sin(2 * np.pi * 440 * t)


A better solution would be to add an actual test.wav file, instead of mocking the input to the too like so.

mdimado added 2 commits January 21, 2025 11:59

feat: add support for base64 encoded audio files

ed872c4

add tests for multimodal audio support

64397b0

rachwalk reviewed Jan 24, 2025

View reviewed changes

rachwalk mentioned this pull request Jan 24, 2025

feat: add support for base64 encoded audio files #380

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test/add multimodal audio test #382

Test/add multimodal audio test #382

mdimado commented Jan 22, 2025

rachwalk left a comment

rachwalk Jan 23, 2025

mdimado Jan 25, 2025

rachwalk Jan 24, 2025

	# aduio content handling (used audio/wav as MIME type)
	# audio content handling (used audio/wav as MIME type)

Test/add multimodal audio test #382

Are you sure you want to change the base?

Test/add multimodal audio test #382

Conversation

mdimado commented Jan 22, 2025

Purpose

Proposed Changes

Issues

rachwalk left a comment

Choose a reason for hiding this comment

rachwalk Jan 23, 2025

Choose a reason for hiding this comment

mdimado Jan 25, 2025

Choose a reason for hiding this comment

rachwalk Jan 24, 2025

Choose a reason for hiding this comment