Large Scale Machine Learning and Other Animals

Danny Bickson Aug 28, 2022 Updated Aug 28, 2022

Show full content

A couple of days ago, I got this phishing message on Linkedin

The English is pretty bad and the location of the person in Africa makes it suspicious. It immediately jumped to my mind that this is a great opportunity to test reverse image search capabilities to check the legitimacy of the sender.

Starting with Microsoft Image search I got this non helpful answer:

Namely similar poses but wrong persons.
I used Yandex image search and get the following:

The first hit is exact match for someone named Sarah Benson's instagram page:

Finally Google reverse image search brought the following:

A combination of two ads, two correctly returned results, one Veronica Manesh and one anti scam forum Maria Zabrash (which points to non existing Facebook page).
I also tried Baidu reverse image search but i wasn't able to get it to work at all.
Overall it seems super easy to find the same persons image in multiple accounts assuming different names. I wonder why LinkedIn has no profile checking to make sure the image is legit?
Next I tried to search for this not real English text of "I see it as a nice pleasure, getting in touch with you". I found it in many places on the web mainly in dating sites featuring African woman and also in some scam warning sites. Again it took me a couple of seconds to find many sites that should have ring the alarm bells.
To conclude it seems there is zero effort from LinkeIn side to filter scam. In terms of the best reverse image search I often find Yandex to work better than the others.

tag:blogger.com,1999:blog-3211409948956809184.post-3976179257831021271

Extensions

Large Image Datasets Are a Mess!

Danny Bickson May 24, 2022 Updated May 24, 2022

Show full content

Here is my latest blog post on medium: https://medium.com/@amiralush/large-image-datasets-today-are-a-mess-e3ea4c9e8d22

tag:blogger.com,1999:blog-3211409948956809184.post-7385033106177493966

Extensions

Segments.ai - Automating manual tagging work

Danny Bickson Feb 25, 2022 Updated Feb 25, 2022

Show full content

I found this cool YC21 Belgian small startup called segments.ai. It seems they have a good technique for automating manual tagging process:

tag:blogger.com,1999:blog-3211409948956809184.post-601923955623032076

Extensions

Having fun with Macaw - or - Dinosaurs meat is sour!

Danny Bickson Nov 4, 2021 Updated Nov 4, 2021

Show full content

Following the announcement from Prof. Oren Etzioni about their new Macaw NLP model which should be up to 10% better than OpenAI's model, I spent a fun afternoon playing with the released model.

So far 3 models have been released: large, 3b and 11b. Documentation says large is a lightweight model (indeed a good naming convention here) and should not be fully trusted, the other ones are 3b (medium) and 11b (the big one).

I started with the large model and my conclusion that besides solving the demo question about the sky's color, it is completely non functional.

Who killed Jesus? The romans (correct)
Who is Carlos Guestrin? a leading authority on space and human exploration (wrong)
Who is Oren Etzioni? a leading authority on migration and conservation (wrong)
What is Belgian Malinois? a kind of dog that carries a large amount of weight (wrong)
Who came first the Romans or the Greeks? The Romans (wrong)
What is the population of Israel? 2.4 Billion (wrong)
Who is larger a bird or an elephant? the bird, the elephant (wrong)
Who is Benjamin Netanyahu? the prime minister of new york (funny and wrong)
Are you drunk? Yes (maybe right?)
Name a book by Alexander Dumas? The Comedy of Errors (again wrong).

To summerize since the model is so often wrong, it would help to get an additional confidence estimation that tells you that the answer is most likely junk.
After getting familiar with the small and weird model I moved to the medium model and continued to have fun there.

How many cents in one dollar? 100 (correct)
What is the main cause of global warming? greenhouse effect (correct). Some other good answers: volcanic eruptions (B) ocean currents (C) soil erosion (D) human population growth
What is the best vertical to open a startup in machine learning? healthcare, additional answers: (A) finance (B) healthcare (C) retail (D) technology. Not bad! Not that some answers are repeating, there is no pruning of answers.
What should I do when my boss is upset with me at work? apologize. Additional options: (A) leave the office (B) call the police (C) go to the mall (D) stay late. I like the apologize and stay late! Call the police answer is hilarious. And go to the mall is the favorite approach of my wife!
What is the best way to defend against Covid virus vaccinating (correct). Additional answers: (A) using a flu vaccine (B) using a tetanus shot (C) using a gamma ray to destroy the virus (D) using a gamma ray to destroy the virus.
What is the recommended number of whiskey shots to drink before driving? Two (wrong!)
If I love her does she love me back? yes (wrong) Additional answers start to look better: (A) she will love me back (B) she will hate me (C) she will leave me (D) she will never love me.
How many calories in marble ball? 0 (correct)
Who is the best venture capital firm? SBI (wrong, never heard of them)
What is the taste of dinosaur meat? sour ; Additional option (A) salty (B) sweet (C) savory (D) a little bit sour. Who knows?
Is there life on other planets? yes. Additional option (A) no life is found on other planets (B) there is life on other planets (C) there is life on Mars (D) there is life on other planets. Who knows?
Were there weapons for mass destruction in Iraq? yes (wrong). Additional options (A) no weapons of mass destruction were found in Iraq (B) there were no weapons of mass destruction in Iraq (C) there were weapons of mass destruction in Iraq but they were destroyed (D) there were weapons of mass destruction but they were not destroyed. Interestingly both Iraq and Mars are capital letters (probably identified name entities) while some entities are in small letters.
Who is behind the nine eleven attack? al qaeda (correct). Additional conspiracies (A) the government (B) the military (C) the intelligence community (D) the religious right
What year will the aliens attack us? 2100. Who knows?
Do ghosts exist? yes. Additional options: (A) they are just a kind of animal (B) they are made of air (C) they are made of water (D) they exist in the sky.

To conclude some answers are totally correct and some answers are totally hilarious. Maybe one should invent a don't know answer where the confidence is very low give it instead.
The handling of keyword names is a bit weird, for example Iraq with capital letters but new york and al qaeda without. I wonder if there is a post processing the puts some named entities as capital letters?
In terms of running time the models are pretty slow. It take s a couple of minutes on the latest Mac M1 to get an answer.I will next try to test 11b model, it is 40Gb and thus slooooow to download locally.

tag:blogger.com,1999:blog-3211409948956809184.post-3645904846670669393

Extensions

BebopNet: Deep Neural Models for Personalized Jazz Improvisations

Danny Bickson Nov 3, 2021 Updated Nov 3, 2021

Show full content

I recently found this paper: BebopNet: Deep Neural Models for Personalized Jazz Improvisations, by Shunit Haviv Hakimi, Nadav Bhonker, Ran El-Yaniv from the Technion. It uses deep learning approach to teach a machine to improvise when playing Jazz. The paper won the best paper award at ISMIR 2020.

The results are pretty cool. The level of improvisation is pretty good but I hear a little awkwardness in the timing of the notes.

tag:blogger.com,1999:blog-3211409948956809184.post-9126813116475851688

Extensions

Amazing Demo from SparkBeyond

Danny Bickson Oct 14, 2021 Updated Oct 14, 2021

Show full content

My friend Sagie Davidovich, CEO SparkBeyond, has shown me the following amazing demo:

SparkBeyond crawled hundreds of billions of Internet pages, papers, patents and social media site to build one of the largest available knowledge graphs. Based on this data it is possible to ask natural language questions about the knowledge and get aggregated knowledge summary. Unlike Google search where you have to manually go over of zillion resources here the data is summarized and aggregated visually. It is possible to understand reasons, trends, ask for follow up questions and see supporting evidence and statistics.

Unlike the typical language model which gives you a summary without knowing where the data was obtained from, In SparkBeyond;s model it is possible to get detailed references show where is the answer coming from.

An interesting related work is Colbert from Prof. Matei Zeharia. Intead of memorizing the full language model using hundreds of billions parameters a significantly smaller index is maintained that retrieves the relevant information on the fly,

tag:blogger.com,1999:blog-3211409948956809184.post-283190192428518088

Extensions

Colossal - The Future of DNA Editing?

Danny Bickson Sep 27, 2021 Updated Sep 27, 2021

Show full content

I found some recent news about Colossal a new startup that wants to revive extinct Mammoth to fight the global warming. Fighting global warming is one of the best things we can do, especially that one of the co-founders is Prof. George Chruch from Harvard Medical School, a very credible authority on gene editing. Church is one of the inventors of Crispr, a gene editing tool that can cut and paste any desired segment of the DNA and thus make whatever changes we like to do.

Here is my take on it:

Their website is amazing, a lot of effort was invented on that front. Backing up the pretty wild idea and thus draws a lot of attention to this work. The raised amount of 15M$ is tiny considering the amount of lab effort, equipment, materials etc.
Global warming sounds like an awkward excuse to fund the research they really like to do.
Ben Lamm, CEO of Colossal, told The Washington Post in an email that the extinction of the woolly mammoth left an ecological void in the Arctic tundra that Colossal aims to fill. The eventual goal is to return the species to the region so that they can reestablish grasslands and protect the permafrost, keeping it from releasing greenhouse gases at such a high rate.
Sending a wild Mammoth to eat grass somewhere frozen, with the hope of reducing gas emissions is likely is the most complicated way to fight global warming I can imagine. But is a sexy way of drawing news attention.
The difference between Mammoth DNA and a person DNA is most likely 90% similar. Thus having the ability to revive and extinct Mammoth will enable also reviving also persons. Recently, Israeli research hash shown the possibility of raising mice embryos outside the womb. So raising Mammoth outside the womb as they like to do is maybe doable.
Christopher Preston, a professor of environmental ethics and philosophy at the University of Montana, questioned Colossal’s focus on climate change, given that it would take decades to raise a herd of woolly mammoths large enough to have environmental impacts.
So, the real applications of this technology may be applied to humans. For example, what if I wanted to revive my dead grandfather? What is I wanted a baby with blond hair and blue eyes? My guess there is a huge market for this technology in real life.

I wonder why all the news and media attention ignores the actual use cases of this tehnology?

tag:blogger.com,1999:blog-3211409948956809184.post-4578403786148174091

Extensions

Israeli Machine Vision Conference is Coming up Soon: Oct 26, 2021

Danny Bickson Sep 14, 2021 Updated Sep 14, 2021

Show full content

The Israeli Machine Vision Conference is coming up next month on October 26. One of the interesting papers reported there will be the transformer visualization work I wrote on last week.

tag:blogger.com,1999:blog-3211409948956809184.post-6386281180481308905

Extensions

How can we visualize attention?

Danny Bickson Sep 6, 2021 Updated Sep 6, 2021

Show full content

A nice and recent paper from Lior Wolf's lab at Tel Aviv University: https://arxiv.org/pdf/2103.15679.pdf by Hila Chefer, Shir Gur and Lior Wolf. The problem is very simple: given a transformer encoder/ decoder network, we would like to visualize the affect of attention on the image. While the problem is simple the answer is pretty complicated: we need to take into account attention matrices from mutliple layers at once. The paper suggests an iterative way to add up all those attention layers into one coherent image.

Figure 4 shows that the result is very compelling vs. previous art:

top row is the new paper and bottom row is work for comparison.

tag:blogger.com,1999:blog-3211409948956809184.post-5888478073763548653

Extensions

Gaussian Belief Propagation Tutorial

Danny Bickson Sep 3, 2021 Updated Sep 3, 2021

Show full content

I have stumbled upon this nice tutorial: which interactively visualizes Gaussian Belief Propagation. What is nice about it that the authors spent time to make an interactive tutorial that you can play with.

As a grad student I was totally excited about Gaussian Belief Propagation and spend a large chunk of my PhD thesis on it. In a nutshell it is an iterative algorithm for solving a set of linear equations (for a PSD square matrix). The algorithm is very similar to Jacobi iterative method but uses second order information (namely approximation of the Hessian) to improve on convergence speed at the cost of additional memory & computation. In deep learning terminology this is related to adding Adam/ Momentum/ Admm etc. From personal experience, when people get excited about speeding up conference of iterative algorithm they completely neglect the fact here is no free lunch: when you speed convergence in terms of number of iterations you typically pay in something else (computation/ communication).

The complexity of the algorithm derivation comes from the fact it arises from probabilistic graphical models where the notation of the problem is cumbersome, as it can be presented as either factor graphs or undirected graphical model. A factor graph is a bipartite graph with evidence nodes (the input) at one side and a function aggregating the nodes on the other side. It is very similar to a single dense layer in deep learning where the input is coming from the left and the summation plus activation is done on the right. However unlike deep learning the factor has only a single layer and the message propagate again back to the variable (input) nodes back and forth. So the factor graph is the grand grand father of deep learning.

To make it totally confusing the seminal paper by Prof. Weiss uses pairwise notation which is a third way of presenting the same model. (Instead of a single linear system of equation it is a collection of multiple sets of sparse linear equations where each set has two variables only).

Any continuous function can be locally approximated in a first order method around a point by computing the gradient. That is why we often see linear modeling when modeling complex problems, including in deep learning where each dense layer is linear. This is the relevancy of solving linear models in multiple domains.

Another nice property of the algorithm is that besides of the marginals (the solution to the linear system of equations) we get an approximation to the main diagonal of the inverse matrix of the linear system. This is often useful when inverting the full matrix is too heavy computationally.

tag:blogger.com,1999:blog-3211409948956809184.post-494838042787961051

Extensions

Amazing: Carbon Robotics Weeder - Deep Learning for Organic Weed Control!

Danny Bickson Aug 29, 2021 Updated Aug 29, 2021

Show full content

This is totally amazing:

As someone who works on manufacturing automation with robotics and vision, I can say this is a very complicated task since the robot has to distinguish by a 2D image between the right crop and weeds. Also the laser shooting of the weeds is awesome!

After one minute of digging I found out I know Nick Kirsch who is a director at Carbon Robotics and was an executive intern in our startup Turi in 2016! This is a Seattle based company, I can't wait to talk to Nick and learn more.

tag:blogger.com,1999:blog-3211409948956809184.post-4429880316638898929

Extensions

What is MBZUAI ?

Danny Bickson Aug 29, 2021 Updated Sep 3, 2021

Show full content

Today I found (slightly late) that Prof. Eric Xing from Carnegie Mellon MBZUAI (Mohamed bin Zayed University of Artificial Intelligence) as their President late last year. Eric is a well known professor which I know from my CMU days, who was the CEO of Petuum, a Parameter Server like implementation for scaling up machine learning.
From MBZUAI website: MBZUAI is the world’s first graduate-level, research-based artificial intelligence (AI) university. Launched in October 2019 and located in Masdar City, Abu Dhabi, the University aims to empower students, businesses and governments to advance artificial intelligence as a global force for positive progress.
When reading this news I also found that the Israeli Weizman Institute is collaborating with MBZUAI for a joint AI program. This is a great fruit of the recent peace treaty of Israel and Abu Dhabi.
Another interesting organization is g42.ai which is OpenAI like org from Abu Dahbi.

tag:blogger.com,1999:blog-3211409948956809184.post-1232695584159392156

Extensions

Israeli AI21 Launches the Biggest NLP Model So Far

Danny Bickson Aug 22, 2021 Updated Aug 22, 2021

Show full content

AI21 is a research lab is the Israeli equivalent OpenAI, founded by several machine learning luminaries including Prof. Amnon Shashua (MobileEye, Orcam, Digial Bank) who is a Prof at the Hebrew University. (Amnon was my lecturer for the ML course, which was an amazing course and he is an amazing person as well).

This week AI21 announced the release of the largest NLP model called Jurassic-1. It is a comparable model to GPT-3. The is no objective evaluation of the two models, but AI21 mentions that the number of word tokens used to train the models is 250K (compared to around 50K of GPT-3) which gives more flexibility in answering questions regarding common phrases, named entities etc. A great tutorial for GPT-3 is given in Yannic's Youtube Channel:

Building such a large NLP model is challenging, since the model has around 170B parameters and you need weeks of training with hundreds of GPUs, a cost that typically only the biggest companies can afford. Another interesting company I recently met is LightOn which builds photon based hardware to training language models, they recently announced the largest French based model.

It will be interesting to see when AI21 and similar companies will move to training non-English corpuses which is the place such companies can shine.

An interesting conference coming up soon is the NLP Summit (An online event Oct 5-7).

tag:blogger.com,1999:blog-3211409948956809184.post-5329395908896265711

Extensions

Yannic Kilchner - The Man and the Legend

Danny Bickson Aug 8, 2021 Updated Aug 8, 2021

Show full content

I recently stumbled upon Yannic's Youtube Channel and I was totally blown away. Yannic is a fresh PhD out of ETH Zurich and he has few dozens of recent deep learning papers explained amazingly well. Both the selection of papers is smart, as well as the explanation of the content. In addition for some of the papers he adds personal comments and critics about the papers claims which really make sense. The audience for those tutorials is advanced deep learning audience and they cover advanced topics which Coursera courses mostly did not catch up yet. For example great coverage of transformers for both language and image models.

According to his LinkedIn, Yannic recently started a company along with 3 other ETH PhDs called DeepJudge which deploys deep learning NLP models in the legal domain. The company is 4 months old and according to CrunchBase raised a small seed round.

Based on the brains of the DeepJudge team, I call all the VCs, headhunters, university recruiters and everyone else to wake up! I am pretty sure we will here a lot of those guys.

tag:blogger.com,1999:blog-3211409948956809184.post-8214826819605543606

Extensions

Registration is open - Deep Learning Autumn School at Bar Ilan University

Danny Bickson Sep 25, 2019 Updated Sep 25, 2019

Show full content

My friend Ely Porat sent me the following registration notice for the deep learning and AR/VR Autumn courses:
https://sites.google.com/datalab.cs.biu.ac.il/biusummerschool/home?authuser=0
Registration is pretty low and you can get academic credit from Bar Ilan University.

tag:blogger.com,1999:blog-3211409948956809184.post-8433044723812370235

Extensions

Israeli AI Week - call for talks ending Aug 1st

Danny Bickson Jul 14, 2019 Updated Jul 14, 2019

Show full content

My friend Assaf Araki from Intel invited me to give a hand organizing the Israeli AI Week. We are looking for talks from innovative companies and machine learning researchers.
The AI-Week in Tel Aviv is a nationwide community event organized by representatives from the Academia, Government, Industry and NGOs. AI week will bring together ~2000 AI researchers and practitioners from Israel and around the globe. The event will take place in Tel Aviv university campus during November 17th to November 20th and will include one day of hands on workshops, two conference days with +100 speakers, a two days of data hackathon and multiple additional events.
The goals of AI Week are to increase the synergy among the local AI eco system and with leading global innovators while exposing Israel AI innovation to the world. This event will include cutting edge sessions delivered by leading researchers and industry innovators from Israel and around the world.
The Israel AI week founding members include Tel Aviv University, Startup Nation Central, Israel Innovation Authority & Intel. We are adding more and more partners into this community effort.
Among our steering committee members are Prof. Amnon Shashua (SVP Intel, President and CEO of Mobileye & Professor at the Hebrew University), Major Gen. (Res.) Professor Isaac Ben-Israel(Chairman of Israel Space Agency, Chairman of Israel National Council for R&D & Professor at Tel-Aviv University), Prof. Eugene Kandel (CEO of Start-Up Nation Central & Professor at the Hebrew University) and Aharon Aharon (CEO of Israel Innovation Authority).

tag:blogger.com,1999:blog-3211409948956809184.post-5574472171353006614

Extensions

Allen Institue Opens an Israeli Branch

Danny Bickson May 23, 2019 Updated May 23, 2019

Show full content

This Tuesday Prof. Oren Etsioni announced on his keynote talk at the Data Science Summit, that the Seattle Allen Institute opens an Israeli Branch:

Prof. Yoav Goldberg from Bar Ilan is heading the research effort, with focus on NLP.

tag:blogger.com,1999:blog-3211409948956809184.post-2040308918377874287

Extensions

Upcoming data science events in Israel

Danny Bickson Mar 7, 2019 Updated Mar 7, 2019

Show full content

My friend Assaf Araki is organizing the AI Week Nov 17-21 at Tel Aviv University. It is going to be the first national AI event with around 2000 participants. Dr. Ben Lorica, chief scientist of O'Reilly will be one of the speakers along with Prof. Amnon Shashua, CEO of MobileEye.

Another interesting event is the AI Data Science Summit - May 21 in Jerusalem. It is a follow up event organized by Avner Algom of our European Data Science Summit. Prof. Oren Etzioni from the Allen Institute for AI is one of the keynote speakers.

tag:blogger.com,1999:blog-3211409948956809184.post-8049710236404864350

Extensions

Hacking Deep Learning (Bar Ilan) Workshop Videos

Danny Bickson Feb 19, 2019 Updated Feb 19, 2019

Show full content

Hacking Deep Learning (Bar Ilan) Workshop Videos now online. Thanks for my friend Prof. Yossi Keshet for organizing and inviting me!
One notable talk which is unfortunately missing from the videos is of Prof. Adi Shamir described in this paper. The work analysis how many pixels one should change to confuse a deep learning based classifier. The result is surprising - only a few! A related describe work is this.

tag:blogger.com,1999:blog-3211409948956809184.post-9192315959597424089

Extensions

UAI 2019 is coming to Tel Aviv

Danny Bickson Feb 17, 2019 Updated Feb 17, 2019

Show full content

I got this from my friend Nick:
This year, Tel Aviv will host UAI http://auai.org/uai2019/ (July 22-25). Several students have been able to attend the conference and present their work thanks to the generosity of your institutions. We hope that you will continue to support us this year as well. You can find more information about sponsorship packages here http://auai.org/uai2019/sponsorships.php .
Regards
Nikolaos Vasiloglou

UAI 2019 Sponsorship Chair
Feel free to reach out to Nick in case you are interested in sponsoring the event.

tag:blogger.com,1999:blog-3211409948956809184.post-5317225626294543794

Extensions

Alibaba acquires Data Artisans?

Danny Bickson Jan 10, 2019 Updated Jan 10, 2019

Show full content

Data Artisans is the company behind Apache Flink - the European answer to Apache Spark.
According to this news article Alibaba acquires Data Artisans.

I wrote back in 2014 on Apache Flink project.

tag:blogger.com,1999:blog-3211409948956809184.post-2883767950974037773

Extensions

Apple shares Turi Create open source framework

Danny Bickson Dec 9, 2017 Updated Dec 9, 2017

Show full content

It is very exciting that after many years of hard work, we have finally released our machine learning framework as open source! The announcement made yesterday at NIPS by Prof. Carlos Guestrin:

And here is our github link: https://github.com/apple/turicreate

tag:blogger.com,1999:blog-3211409948956809184.post-3417917067025165811

Extensions

Prof. Joseph Keshet from BIU fools deep learning

Danny Bickson Sep 8, 2017 Updated Sep 8, 2017

Show full content

My friend Joseph (Yossi) Keshet have recently released work for fooling deep learning systems. His work got a lot of attendion including MIT Technology Review and the New Scientist. Nice work!!

tag:blogger.com,1999:blog-3211409948956809184.post-4516798651535562141

Extensions

Dataiku raised 28M$

Danny Bickson Sep 8, 2017 Updated Sep 8, 2017

Show full content

According to VentureBeat Dataiku just raised 28M$. Dataiku has a web based platform for data science.

Here is my personal connection. Strangely last time I wed a couple I was wearing their t-shirt.

Unrelated, I just learned from my colleague Brian that Cloudera just acquired Fast Forward Labs, which is the company behind Hilary Mason. I visited Hilary in her offices a couple of years ago and learned they had an interesting consulting models of sharing periodical tech reports for educating data scientists to become more proficient. Congrats Hilary!

tag:blogger.com,1999:blog-3211409948956809184.post-2656568756220818740

Extensions

Deepgram - Audio Search with Deep Learning

Danny Bickson Sep 5, 2017 Updated Sep 9, 2017

Show full content

A very interesting podcast by Sam Charrington who is interviewing Scott Stephenson from DeepGram. DeepGram is using deep learning activations for creating indexes that allows to search text in voice recordings.

DeepGram have released Kur which is a high level abstraction of deep learning framework to allow quickly defining network layouts. But still, writing the target persona is researchers with deep learning knowledge.

A related Israeli startup is AudioBurst. They claim to use AI for indexing but it is not clear what they actually do. Another Israeli startup is Verbit. They seem to transcribe audio with humans going over the preliminary result.

In addition, my friend Yishay Carmiel is working on importing parts of Kaldi to TensorFLow. A recent Google developer blog post describes this effort. Yishay is leading a spinoff of Spoken called IntelligentWire who is also searching audio files using deep learning.

Overall it seems that search in audio files using deep learning is getting hotter!

tag:blogger.com,1999:blog-3211409948956809184.post-7845202660293957872

Extensions