Skip to content

Latest commit

 

History

History
38 lines (23 loc) · 1.03 KB

04.02.2024.md

File metadata and controls

38 lines (23 loc) · 1.03 KB

Mobile Models

Like 👍. Comment 💬. Subscribe 🟥. 🏘 Discord: https://discord.gg/pPAFwndTJd

YouTube: https://youtube.com/live/ufnL_bKsb6g

X: https://twitter.com/i/broadcasts/1ynKOyPEoXzJR

Twitch: https://www.twitch.tv/hu_po

References

MOBILE-AGENT: AUTONOMOUS MULTI-MODAL MOBILE DEVICE AGENT WITH VISUAL PERCEPTION https://arxiv.org/pdf/2401.16158.pdf

AppAgent: Multimodal Agents as Smartphone Users https://arxiv.org/pdf/2312.13771.pdf

MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices https://arxiv.org/pdf/2311.16567.pdf

GeLU vs Swish https://miro.medium.com/v2/resize:fit:1400/format:webp/1*rKRPQHOph77ySdMzpbWiLw.jpeg

MobileSAMv2: Faster Segment Anything to Everything https://arxiv.org/pdf/2312.09579.pdf

https://github.com/ChaoningZhang/MobileSAM

Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts https://arxiv.org/pdf/2309.04354.pdf

Efficient Hybrid Zoom using Camera Fusion on Mobile Phones https://arxiv.org/pdf/2401.01461.pdf