Difference between passing data directly or via formula for `glm` #450

rikhuijzer · 2021-10-11T16:51:32Z

Does it make sense that glm(X, y, ...) passes here, but glm(formula, data, ...) fails?

julia> using GLM

julia> data = (a = [1, 4, 9], b = [2, 5, 7], c = [3, 6, 11], y = [1, 1, 0])
(a = [1, 4, 9], b = [2, 5, 7], c = [3, 6, 11], y = [1, 1, 0])

julia> X = [data.a data.b data.c]
3×3 Matrix{Int64}:
 1  2   3
 4  5   6
 9  7  11

julia> glm(X, data.y, Bernoulli(), LogitLink());

julia> form = @formula(y ~ a + b + c);

julia> glm(form, data, Bernoulli(), LogitLink());
ERROR: PosDefException: matrix is not positive definite; Cholesky factorization failed.
[...]

Version: GLM v1.5.1

The text was updated successfully, but these errors were encountered:

andreasnoack · 2021-10-11T17:02:05Z

The formula based version adds a constant so the two are not equivalent.

andreasnoack closed this as completed Oct 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Difference between passing data directly or via formula for `glm` #450

Difference between passing data directly or via formula for `glm` #450

rikhuijzer commented Oct 11, 2021

andreasnoack commented Oct 11, 2021

Difference between passing data directly or via formula for glm #450

Difference between passing data directly or via formula for glm #450

Comments

rikhuijzer commented Oct 11, 2021

andreasnoack commented Oct 11, 2021

Difference between passing data directly or via formula for `glm` #450

Difference between passing data directly or via formula for `glm` #450