Publications by Tags

, , , , , ,

Selected Papers

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal, Debapriya Tula, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul

Under Review

TLDR: An efficient framework for progressive decoding with nested models for faster inference.

, ,

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health

Arpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja

Under Review

TLDR: An improved collaborative bandits approach with bayesian regret derivation for a special case!

, ,

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

NeurIPS 2024

TLDR: Token-wise routing between nested experts for tackling redundancy in visual modalities.

, , ,

LookupViT: Compressing Visual Information to a smaller number of tokens

Rajat Koner, Gagan Jain, Prateek Jain, Volker Tresp, Sujoy Paul

ECCV 2024

TLDR: An asyncronous version of attention with sub-quadratic scaling and superior performance.

,

Conditional Computation

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal, Debapriya Tula, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul

Under Review

TLDR: An efficient framework for progressive decoding with nested models for faster inference.

, ,

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

NeurIPS 2024

TLDR: Token-wise routing between nested experts for tackling redundancy in visual modalities.

, , ,

Efficiency

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal, Debapriya Tula, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul

Under Review

TLDR: An efficient framework for progressive decoding with nested models for faster inference.

, ,

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

NeurIPS 2024

TLDR: Token-wise routing between nested experts for tackling redundancy in visual modalities.

, , ,

LookupViT: Compressing Visual Information to a smaller number of tokens

Rajat Koner, Gagan Jain, Prateek Jain, Volker Tresp, Sujoy Paul

ECCV 2024

TLDR: An asyncronous version of attention with sub-quadratic scaling and superior performance.

,

Mixture of Experts

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

NeurIPS 2024

TLDR: Token-wise routing between nested experts for tackling redundancy in visual modalities.

, , ,

Reinforcement Learning

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health

Arpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja

Under Review

TLDR: An improved collaborative bandits approach with bayesian regret derivation for a special case!

, ,

Societal Impact

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health

Arpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja

Under Review

TLDR: An improved collaborative bandits approach with bayesian regret derivation for a special case!

, ,

Theory

Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health

Arpan Dasgupta, Gagan Jain, Arun Suggala, Karthikeyan Shanmugam, Milind Tambe, Aparna Taneja

Under Review

TLDR: An improved collaborative bandits approach with bayesian regret derivation for a special case!

, ,

Vision

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal, Debapriya Tula, Gagan Jain, Pradeep Shenoy, Prateek Jain, Sujoy Paul

Under Review

TLDR: An efficient framework for progressive decoding with nested models for faster inference.

, ,

Mixture of Nested Experts: Adaptive Processing of Visual Tokens

Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

NeurIPS 2024

TLDR: Token-wise routing between nested experts for tackling redundancy in visual modalities.

, , ,

LookupViT: Compressing Visual Information to a smaller number of tokens

Rajat Koner, Gagan Jain, Prateek Jain, Volker Tresp, Sujoy Paul

ECCV 2024

TLDR: An asyncronous version of attention with sub-quadratic scaling and superior performance.

,