Update tn #74

Tankya2 · 2024-10-04T07:24:39Z

Hi,

This PR is to address issue #36.

Added memory management code and set the Pathfinding model used to CUTENSOR.

updates: - [github.com/asottile/pyupgrade: v3.16.0 → v3.17.0](asottile/pyupgrade@v3.16.0...v3.17.0)

updates: - [github.com/psf/black: 24.4.2 → 24.8.0](psf/black@24.4.2...24.8.0)

for more information, see https://pre-commit.ci

alecandido

The eval.py contains highly duplicated code, and this is manifest in this PR, since the same update has been repeated over and over.

Before merging this PR, increasing even more the maintenance burden, it would be worth to refactor the eval.py file (in this, or even in another PR), to avoid incurring is such a redundant diff.

src/qibotn/backends/cutensornet.py

src/qibotn/eval.py

alecandido · 2024-10-04T12:45:15Z

src/qibotn/eval.py

-    operands = myconvertor.state_vector_operands()
+        operands = myconvertor.state_vector_operands()
+    else:
+        operands = None


What's the actual purpose of this?

If rank != 0, qibo_circ is fully ignored...

Even if it is somehow meaningful (I'm not seeing how, but that may be my limitation), the result could only be trivial, so you could even return immediately, without executing all the other operations...

Each rank needs the same initial set of operands for computation. Here, the operands are created in Rank 0, for all other rank the operands are just set to None. In line 86, the operands created in Rank 0 is then broadcasted to all other ranks.

alecandido · 2024-10-04T12:45:52Z

src/qibotn/eval.py

@@ -62,6 +62,7 @@ def dense_vector_tn_MPI(qibo_circ, datatype, n_samples=8):
        Dense vector of quantum circuit.
    """

+    import cuquantum.cutensornet as cutn


Any reason to keep the imports within the functions? (instead of top-level)

I know it was like this even before this PR...

The reason was that not all functions require the import, specifically dense_vector_tn(), expectation_pauli_tn(), dense_vector_mps(), pauli_string_gen(). Do you think it is better to bring them to the top-level?

alecandido · 2024-10-04T12:46:47Z

src/qibotn/eval.py

-    # Assign the device for each process.
-    device_id = rank % getDeviceCount()


Do you remember why it was repeated before?

The comment may be still useful, and you could lift to the line above.

alecandido · 2024-10-04T12:47:24Z

src/qibotn/eval.py

@@ -136,6 +150,7 @@ def dense_vector_tn_nccl(qibo_circ, datatype, n_samples=8):
    Returns:
        Dense vector of quantum circuit.
    """
+    import cuquantum.cutensornet as cutn


Same as above

alecandido · 2024-10-04T12:47:31Z

src/qibotn/eval.py

@@ -200,6 +230,9 @@ def dense_vector_tn_nccl(qibo_circ, datatype, n_samples=8):
        stream_ptr,
    )

+    del network


Same as above

src/qibotn/eval.py

…tly already there at the function end

scarrazza · 2025-01-16T04:43:54Z

@alecandido @Tankya2 what is the plan for this PR?

Tankya2 · 2025-01-16T07:20:40Z

@alecandido @Tankya2 what is the plan for this PR?

Hi @scarrazza, @alecandido , @liweintu , I believe most issues in this PR have been addressed. The remaining concern is the presence of duplicated code across various computation modes. I propose to close this PR first if there are no other further issues and to tackle these duplicated codes in another PR.

liweintu · 2025-01-16T10:12:38Z

I tested this PR branch on ASPIRE 2A+, using test_cuquantum_cutensor_backend.py. Most cases passed, but there's 1 assertion failure below.

        # Test Cuquantum
        cutn_time, result_tn = time(
            lambda: qibotn.eval.dense_vector_tn(qibo_circ, dtype).flatten()
        )

>       assert 1e-2 * qibo_time < cutn_time < 1e2 * qibo_time
E       assert 0.38179754093289375 < (100.0 * 0.00212192814797163)

Seems the time taken is too long to pass the assertion? @Tankya2

alecandido · 2025-01-16T12:16:23Z

My main concern was that the code was extended, introducing even further duplication, without reducing the existing one.

Tests and performance benchmarks are not automated, so little can be said.

However, if you manually test the code, and this is needed soon, of course feel free to proceed. Just take note of the project needs, and schedule them for later.

Tankya2 and others added 10 commits July 4, 2024 13:40

Fix bug

17813a6

Add configuration and free memory explicitly

9d37020

correct missing mempool initialization

bbacc26

Update NCCL

f358d0e

Update dense_vector_tn_MPI

1aaa838

Update dense vector tn nccl

fb5e0e7

[pre-commit.ci] pre-commit autoupdate

c1dd326

updates: - [github.com/asottile/pyupgrade: v3.16.0 → v3.17.0](asottile/pyupgrade@v3.16.0...v3.17.0)

[pre-commit.ci] pre-commit autoupdate

f4d00c4

updates: - [github.com/psf/black: 24.4.2 → 24.8.0](psf/black@24.4.2...24.8.0)

Comment

8aef3af

Format

fe83631

Tankya2 requested review from liweintu, scarrazza and alecandido October 4, 2024 07:24

Tankya2 assigned Tankya2 and Vinitha-balachandran Oct 4, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

c3a845c

for more information, see https://pre-commit.ci

alecandido requested changes Oct 4, 2024

View reviewed changes

Tankya2 and others added 5 commits October 25, 2024 17:37

Merge branch 'main' into update_tn

f33c44e

Remove import quantum

f9e74fe

Remove duplication

289c8e2

Remove the unnecessary deletion because automatic deletion is implici…

191949d

…tly already there at the function end

Remove redundant mem pool

27b378b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tn #74

Update tn #74

Tankya2 commented Oct 4, 2024

alecandido left a comment

alecandido Oct 4, 2024

Tankya2 Oct 30, 2024 •

edited

Loading

alecandido Oct 4, 2024

Tankya2 Oct 30, 2024

alecandido Oct 4, 2024

alecandido Oct 4, 2024

alecandido Oct 4, 2024

alecandido Oct 4, 2024

scarrazza commented Jan 16, 2025

Tankya2 commented Jan 16, 2025

liweintu commented Jan 16, 2025

alecandido commented Jan 16, 2025

		# Assign the device for each process.
		device_id = rank % getDeviceCount()

Update tn #74

Are you sure you want to change the base?

Update tn #74

Conversation

Tankya2 commented Oct 4, 2024

alecandido left a comment

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

Tankya2 Oct 30, 2024 • edited Loading

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

Tankya2 Oct 30, 2024

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

alecandido Oct 4, 2024

Choose a reason for hiding this comment

scarrazza commented Jan 16, 2025

Tankya2 commented Jan 16, 2025

liweintu commented Jan 16, 2025

alecandido commented Jan 16, 2025

Tankya2 Oct 30, 2024 •

edited

Loading