It looks as though the CPU has two pieces of iTLB hardware. One for huge pages and one for small pages. If that's true, we could benefit from having the interpreter use the huge pages, and the JIT use the small pages all for itself. This is a patch that we could also potentially upstream. It could benefit CRuby and MJIT as well.
See Intel doc and Intel sample implementation.