[ENH] ICA LiNGAM algorithm in casual_discovery by Jatinbhardwaj-093 · Pull Request #2998 · pgmpy/pgmpy

Jatinbhardwaj-093 · 2026-03-16T10:09:22Z

The following checklist is mandatory.

Your PR will be closed if you remove the checklist or do not answer the questions to a satisfactory level. Use of LLM is strictly forbidden for any part of this checklist (even for improving language).

Your checklist for this pull request

Have you followed all the steps from our Contributing Guide?
Does the PR fully address the linked issue and is within its defined scope? If you are still working on the PR, mark it as draft.
Are all the GitHub Actions checks passing? If not, mark your PR as draft while you fix it.

Please answer the following questions:

Did you use an LLM for any assistance? Please describe how and what you used it for?
Yes. I used Gemini 3.1 primarily for assistance in understanding and summarizing parts of the LiNGAM research paper. In particular, it helped clarify
The algorithm for permutation of the weight matrix.
The overall steps of the LiNGAM algorithm.
The statistical pruning methods described in Section 6 of the paper (Wald test and A Chi-Square Test for Evaluating the Overall Fit of the Estimated Model), which are not yet implemented in the current code.
The implementation itself was written by me based on the original paper and my understanding of the algorithm.
What steps have you taken to verify that the changes correctly address the issue? And what edge cases have you considered?
The implementation was written based on the LiNGAM paper and verified by running the algorithm on synthetic datasets with known causal structures.
A small number of tests were generated with the help of an AI tool and then manually reviewed. Currently, two simple test cases are included to verify that the discovered causal graph matches the expected structure.
Has the LLM added try-except blocks? They will need to be removed; any error handling must be explicit.
No.
Have you used LLM for generating tests? They need to be compressed into a smaller number of tests without reducing coverage.
Some initial test scaffolding was generated with the help of an AI tool and then reviewed manually. However, the test suite is not finalized in this PR. A more complete and compact set of tests will be added in a subsequent update.

Issue number(s) that this pull request fixes

Issue Add LiNGAM Causal Discovery Algorithm #2936

List of changes to the codebase in this pull request

Added the algorithm and few test.

Signed-off-by: Jatin Bhardwaj <[email protected]>

Jatinbhardwaj-093 · 2026-03-17T07:46:29Z

Remaining Tasks

Section 6: Statistical Tests for Pruning Edges.
- Wald Test for Examining Significance of Edges
Tests
- Add tests on real datasets
- Add few more synthetic/random tests with known causal structures.
Add proper documentation

codecov · 2026-03-17T07:51:38Z

Codecov Report

❌ Patch coverage is 95.95376% with 7 lines in your changes missing coverage. Please review.
✅ Project coverage is 95.57%. Comparing base (95637a5) to head (6570834).
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
pgmpy/causal_discovery/LiNGAM.py	94.06%	7 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##              dev    #2998    +/-   ##
========================================
  Coverage   95.57%   95.57%            
========================================
  Files         504      506     +2     
  Lines       29122    29295   +173     
========================================
+ Hits        27833    27999   +166     
- Misses       1289     1296     +7

Files with missing lines	Coverage Δ
pgmpy/causal_discovery/__init__.py	`100.00% <100.00%> (ø)`
pgmpy/tests/test_causal_discovery/test_LiNGAM.py	`100.00% <100.00%> (ø)`
pgmpy/causal_discovery/LiNGAM.py	`94.06% <94.06%> (ø)`

ankurankan

@Jatinbhardwaj-093 I see that you are also trying to include expert knowledge in the implementation. Did you find a reference paper for it? If not, I would suggest keeping the initial implementation without it.

Jatinbhardwaj-093 · 2026-03-17T11:09:01Z

@Jatinbhardwaj-093 I see that you are also trying to include expert knowledge in the implementation. Did you find a reference paper for it? If not, I would suggest keeping the initial implementation without it.

I initially assumed that expert knowledge was a native feature in pgmpy, which is why I didn’t explore it in depth during the first draft. I will look for relevant reference papers to support its inclusion. If I’m unable to find sufficient backing, I’ll remove or revise this part accordingly.

Jatinbhardwaj-093 · 2026-03-17T11:14:56Z

@ankurankan

I was thinking about the variant parameter you suggested. Please correct me if I’m wrong, but my understanding is that this would correspond to different types of LiNGAM algorithms, such as ICA, Direct, VAR, Kernel, and others.

If supporting multiple variants is a primary goal, would it make sense to create a single master wrapper class that selects the appropriate LiNGAM implementation based on the variant parameter? Which will have this parameter.

ankurankan · 2026-03-17T19:23:29Z

@Jatinbhardwaj-093 In the case of LiNGAM, I think the two main algorithms are Direct and ICA. I don't think there is a significant overlap between the two, so might be better to implement them in separate classes and not have a variant argument. Sorry, if I suggested to add that earlier.

ankurankan · 2026-03-17T19:25:46Z

pgmpy/causal_discovery/LiNGAM.py

+        # Step 1: Apply an ICA algorithm to obtain a decomposition X = AS where S has
+        # the same size as X and contains in its rows the independent components.
+        # From here on, we will exclusively work with W = A^-1.
+        ica = FastICA(random_state=self.random_state, max_iter=1000)


I would suggest taking an instance of the FastICA as the argument itself. This makes the method composable, and users can fine-tune the FastICA hyperparameters without us having to design our method in a way that allows users to specify that.

Signed-off-by: Jatin Bhardwaj <[email protected]>

Jatinbhardwaj-093 added 4 commits March 17, 2026 12:49

Added the algorithm. One radom test

8241448

Signed-off-by: Jatin Bhardwaj <[email protected]>

Added the algorithm. One radom test

ff52c09

Signed-off-by: Jatin Bhardwaj <[email protected]>

Added one more test

38a3342

Signed-off-by: Jatin Bhardwaj <[email protected]>

Added wald test for pruning edges which are part of association path.

6570834

Signed-off-by: Jatin Bhardwaj <[email protected]>

Jatinbhardwaj-093 force-pushed the causal_discovery/ICALiNGAM branch from 23a75bd to 6570834 Compare March 17, 2026 07:20

ankurankan reviewed Mar 17, 2026

View reviewed changes

Jatinbhardwaj-093 added 7 commits March 21, 2026 18:10

remove expert knowledge.

4d060b9

Signed-off-by: Jatin Bhardwaj <[email protected]>

taking fast_ica for decompostiion as a parameter.

e715dd0

Signed-off-by: Jatin Bhardwaj <[email protected]>

remove variation.

0dc2e7b

Signed-off-by: Jatin Bhardwaj <[email protected]>

Merge branch 'dev' into causal_discovery/ICALiNGAM

555927d

Merge branch 'dev' into causal_discovery/ICALiNGAM

253deeb

add large graph test

52f87a0

Signed-off-by: Jatin Bhardwaj <[email protected]>

Merge branch 'dev' into causal_discovery/ICALiNGAM

38c7cec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ENH] ICA LiNGAM algorithm in casual_discovery#2998

[ENH] ICA LiNGAM algorithm in casual_discovery#2998
Jatinbhardwaj-093 wants to merge 11 commits intopgmpy:devfrom
Jatinbhardwaj-093:causal_discovery/ICALiNGAM

Jatinbhardwaj-093 commented Mar 16, 2026 •

edited

Loading

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Uh oh!

codecov bot commented Mar 17, 2026

Uh oh!

ankurankan left a comment

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Uh oh!

ankurankan commented Mar 17, 2026 •

edited

Loading

Uh oh!

ankurankan Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Jatinbhardwaj-093 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The following checklist is mandatory.

Your checklist for this pull request

Issue number(s) that this pull request fixes

List of changes to the codebase in this pull request

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Remaining Tasks

Uh oh!

codecov bot commented Mar 17, 2026

Codecov Report

Uh oh!

ankurankan left a comment

Choose a reason for hiding this comment

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Uh oh!

Jatinbhardwaj-093 commented Mar 17, 2026

Uh oh!

ankurankan commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ankurankan Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Jatinbhardwaj-093 commented Mar 16, 2026 •

edited

Loading

ankurankan commented Mar 17, 2026 •

edited

Loading