Jonas Geiping (@jonasgeiping) Twitter Tweets • TwiCopy

Jonas Geiping

@jonasgeiping

+ Follow

Machine Learning Research at the ELLIS Institute & MPI-IS // Investigating fundamental questions in Safety, Security, Privacy & Efficiency of modern ML

ID:1443639893083758598

linkhttps://jonasgeiping.github.io/ calendar_today30-09-2021 18:12:52

328 Tweets

1,6K Followers

612 Following

ELLIS Institute Tübingen

@ELLISInst_Tue

2 weeks ago

🎙 The second episode of the Cyber Valley Podcast with our Principal Investigator Jonas Geiping is now available🚀Tune in to learn about Safety and Efficiency of AI.
👉 Check it out: institute-tue.ellis.eu/en/news/cyber-…

thumb_up_off_alt13

chat_bubble_outline0

repeat7

shareShare

account_circle

ELLIS Institute Tübingen

@ELLISInst_Tue

1 month ago

🎙 The first episode of the Cyber Valley Podcast with our Principal Investigators is now out! 🚀 @Orvieto_Antonio #AI Podcast #AI Research #AI
🔗 Learn more: institute-tue.ellis.eu/en/news/cyber-…

thumb_up_off_alt18

chat_bubble_outline0

repeat8

shareShare

account_circle

Cyber Valley

@Cyber_Valley

1 month ago

🚀 Get ready to dive deep into the captivating world of artificial intelligence with us!
The Cyber Valley Podcast coming soon...
🎙️ Don’t miss our unforgettable episodes, created in collaboration with the ELLIS Institute Tübingen #AI Podcast #AI Research #ELLIS #AI Antonio Orvieto

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

account_circle

Yuxin Wen

@ywen99

1 month ago

Very interesting paper!

Sharing a similar threat model but with a different focus, our recent paper (arxiv.org/abs/2404.01231), also titled Privacy Backdoors 🤗, achieves strong membership inference performance through poisoning pre-trained models.

thumb_up_off_alt32

chat_bubble_outline0

repeat4

shareShare

account_circle

Jonas Geiping

@jonasgeiping

1 month ago

How can we define and how can we compare style info in generated images?
For example when trying to figure out if a generated image copies an existing art style by accident?

Gowthami Somepalli led our recent investigation about new models and data for this purpose, summarized here:

thumb_up_off_alt17

chat_bubble_outline0

repeat0

shareShare

account_circle

AK

@_akhaliq

1 month ago

Measuring Style Similarity in Diffusion Models

Generative models are now widely used by graphic designers and artists. Prior works have shown that these models remember and often replicate content from their training data during generation. Hence as their proliferation

account_circle

Jonas Geiping

@jonasgeiping

1 month ago

Had a very interesting chat with Sam Charrington for the The TWIML AI Podcast podcast recently, broadly about adversarial attacks on LLMs.

I'm long overdue to post a thread about this research and all the ways of coercing LLMs to do and reveal (almost) anything, I'll get to it tomorrow!

thumb_up_off_alt34

chat_bubble_outline0

repeat3

shareShare

account_circle

Micah Goldblum

@micahgoldblum

1 month ago

We show how to make data poisoning and backdoor attacks way more potent by synthesizing them from scratch with guided diffusion. 🧵 1/8

Paper: arxiv.org/abs/2403.16365

account_circle

Hamid

@hamid_kazemi22

2 months ago

Excited to share our latest paper on CLIP model inversion uncovering surprising NSFW image occurrences and more! Heartfelt thanks to all my amazing collaborators! ♥️Atoosa Chegini Jonas Geiping Soheil Feizi Tom Goldstein
Paper: huggingface.co/papers/2403.02…

account_circle

AK

@_akhaliq

2 months ago

Coercing LLMs to do and reveal (almost) anything

It has recently been shown that adversarial attacks on large language models (LLMs) can 'jailbreak' the model into making harmful statements. In this work, we argue that the spectrum of adversarial attacks on LLMs is much larger

account_circle

John Kirchenbauer

@jwkirchenbauer

3 months ago

Happy to share that our work studying the reliability of watermarking techniques for AI generated text detection has been accepted at #ICLR24 ! See y'all in Vienna 🇦🇹

thumb_up_off_alt36

chat_bubble_outline0

repeat4

shareShare

account_circle

Amrit Singh Bedi

@amritsinghbedi3

4 months ago

Now accepted in Transactions on Machine Learning Research (TMLR) Transactions on Machine Learning Research

Thank you team
Dinesh Manocha Furong Huang Souradip Chakraborty Jonas Geiping Soumya Jana Soumya Suvra Ghosal

UMD Department of Computer Science GAMMA UMD

thumb_up_off_alt25

chat_bubble_outline0

repeat3

shareShare

account_circle

Neel Jain

@neeljain1717

5 months ago

Want to invert your image to text?!?!?

Come check out our PEZ optimizer that makes optimizing hard prompts easy today at 5pm, #606. Also, we show how you can use PEZ to break content filter like those in Midjourney. #NeurIPS #NeurIPS 23

account_circle

Yangsibo Huang

@YangsiboHuang

5 months ago

I am at #NeurIPS2023 now.

I am also on the academic job market, and humbled to be selected as a 2023 EECS Rising Star✨. I work on ML security, privacy & data transparency.

Appreciate any reposts & happy to chat in person! CV+statements: tinyurl.com/yangsibo

Find me at ⬇️

account_circle