12 6 2

Santiago Viquez

santiviquez

https://www.santiviquez.com/

santiviquez

AI & ML interests

ML @ NannyML. A bit of everything. NLP, RL, and, of course, tabular. In the GenAI era, how can you not love tabular data? Educational content and OSS.

Articles

Are your NLP models deteriorating post-deployment? Let’s use unlabelled data to find out

Nov 16, 2023

Organizations

Posts 18

Post

1535

Looking for someone with +10 years of experience training Deep Kolmogorov-Arnold Networks.

Any suggestions?

Post

2033

More open research updates 🧵

Performance estimation is currently the best way to quantify the impact of data drift on model performance. 💡

I've been benchmarking performance estimation methods (CBPE and M-CBPE) against data drift signals.

I'm using drift results as features for many regression algorithms, and then I'm taking those to estimate the model's performance. Finally, I'm measuring the Mean Absolute Error (MAE) between the regression models' predictions and actual performance.

So far, for all my experiments, performance estimation methods do better than drift signals. 👨‍🔬

Bear in mind that these are some early results, I'm running the flow on more datasets as we speak.

Hopefully, by next week, I will have more results to share 👀

View all posts