The Basque Association

of Language Industries

Document Actions

RSS feeds

Nov 14, 2018 Night Sight: Seeing in the Dark on Pixel Phones
Posted by Marc Levoy, Distinguished Engineer and Yael Pritch, Staff Software Engineer Night Sight is a new feature of the Pixel Camera app that lets you take sharp, clean photographs in very low light, even in light so dim you can't see much with your own eyes.
Nov 12, 2018 Accurate Online Speaker Diarization with Supervised Learning
Posted by Chong Wang, Research Scientist, Google AI Speaker diarization , the process of partitioning an audio stream with multiple people into homogeneous segments associated with each individual, is an important part of speech recognition systems. By solving the problem of “who spoke when”, speaker diarization has applications in many important scenarios, such as understanding medical conversations , video captioning and more.
Nov 02, 2018 Open Sourcing BERT: State-of-the-Art Pre-training for Natural Language Processing
Posted by Jacob Devlin and Ming-Wei Chang, Research Scientists, Google AI Language One of the biggest challenges in natural language processing (NLP) is the shortage of training data. Because NLP is a diversified field with many distinct tasks, most task-specific datasets contain only a few thousand or a few hundred thousand human-labeled training examples.
Oct 31, 2018 Google at EMNLP 2018
Posted by Manaal Faruqui, Senior Research Scientist and Emily Pitler, Staff Research Scientist, Google AI Language This week, the annual conference on Empirical Methods in Natural Language Processing (EMNLP 2018) will be held in Brussels, Belgium. Google will have a strong presence at EMNLP with several of our researchers presenting their research on a diverse set of topics, including language identification, segmentation, semantic parsing and question answering, additionally serving in various levels ...
Oct 30, 2018 Introducing AdaNet: Fast and Flexible AutoML with Learning Guarantees
Posted by Charles Weill, Software Engineer, Google AI, NYC Ensemble learning , the art of combining different machine learning (ML) model predictions, is widely used with neural networks to achieve state-of-the-art performance, benefitting from a rich history and theoretical guarantees to enable success at challenges such as the Netflix Prize and various Kaggle competitions .
Oct 30, 2018 Acoustic Detection of Humpback Whales Using a Convolutional Neural Network
Posted by Matt Harvey, Software Engineer, Google AI Perception Over the last several years, Google AI Perception teams have developed techniques for audio event analysis that have been applied on YouTube for non-speech captions , video categorizations, and indexing. Furthermore, we have published the AudioSet evaluation set and open-sourced some model code in order to further spur research in the community.
Oct 24, 2018 Curiosity and Procrastination in Reinforcement Learning
Posted by Nikolay Savinov, Research Intern, Google Brain Team and Timothy Lillicrap, Research Scientist, DeepMind Reinforcement learning (RL) is one of the most actively pursued research techniques of machine learning, in which an artificial agent receives a positive reward when it does something right, and negative reward otherwise.
Oct 23, 2018 Fluid Annotation: An Exploratory Machine Learning–Powered Interface for Faster Image Annotation
Posted by Jasper Uijlings and Vittorio Ferrari, Research Scientists, Machine Perception The performance of modern deep learning–based computer vision models, such as those implemented by the TensorFlow Object Detection API , depends on the availability of increasingly large, labeled training datasets, such as Open Images .
Oct 22, 2018 See Better and Further with Super Res Zoom on the Pixel 3
Posted by Bartlomiej Wronski, Software Engineer and Peyman Milanfar, Lead Scientist, Computational Imaging Digital zoom using algorithms (rather than lenses) has long been the “ugly duckling” of mobile device cameras. As compared to the optical zoom capabilities of DSLR cameras , the quality of digitally zoomed images has not been competitive, and conventional wisdom is that the complex optics and mechanisms of larger cameras can't be replaced with much more compact mobile ...
Oct 12, 2018 Applying Deep Learning to Metastatic Breast Cancer Detection
Posted by Martin Stumpe, Technical Lead and Craig Mermel, Product Manager, Healthcare, Google AI A pathologist’s microscopic examination of a tumor in patients is considered the gold standard for cancer diagnosis, and has a profound impact on prognosis and treatment decisions. One important but laborious aspect of the pathologic review involves detecting cancer that has spread ( metastasized ) from the primary site to nearby lymph nodes.
Oct 10, 2018 Open Sourcing Active Question Reformulation with Reinforcement Learning
Posted by Michelle Chen Huebscher, Software Engineer and Rodrigo Nogueira, New York University PhD Student and Software Engineering Intern, Google AI Language Natural language understanding is a significant ongoing focus of Google’s AI research, with application to machine translation , syntactic and semantic parsing, and much more .
Oct 09, 2018 Highlights from the Google AI Residency Program
Posted by Phing Lee, Program Manager, Google AI Residency In 2016, we welcomed the inaugural class of the Google Brain Residency , a select group of 27 individuals participating in a 12-month program focused on jump-starting a career in machine learning and deep learning research. Since then, the program has experienced rapid growth, leading to its evolution into the Google AI Residency , which serves to provide residents the opportunity to embed ...
Sep 28, 2018 Introducing the Kaggle “Quick, Draw!” Doodle Recognition Challenge
Posted by Thomas Deselaers, Senior Staff Software Engineer and Jake Walker, Product Manager, Machine Perception Online handwriting recognition consists of recognizing structured patterns in freeform handwritten input. While Google products like Translate , Keep and Handwriting Input use this technology to recognize handwritten text, it works for any predefined pattern for which enough training data is available.
Sep 26, 2018 Building Google Dataset Search and Fostering an Open Data Ecosystem
Posted by Matthew Burgess and Natasha Noy, Google AI Earlier this month we launched Google Dataset Search , a tool designed to make it easier for researchers to discover datasets that can help with their work. What we colloquially call "Google Scholar for data,” Google Dataset Search is a search engine across metadata for millions of datasets in thousands of repositories across the Web.
Sep 16, 2018 Google’s Next Generation Music Recognition
Posted by James Lyon, Google AI, Zürich In 2017 we launched Now Playing on the Pixel 2, using deep neural networks to bring low-power, always-on music recognition to mobile devices. In developing Now Playing, our goal was to create a small, efficient music recognizer which requires a very small fingerprint for each track in the database, allowing music recognition to be run entirely on-device without an internet connection.
Sep 13, 2018 Introducing the Unrestricted Adversarial Examples Challenge
Posted by Tom B. Brown and Catherine Olsson, Research Engineers, Google Brain Team Machine learning is being deployed in more and more real-world applications, including medicine , chemistry and agriculture . When it comes to deploying machine learning in safety-critical contexts, significant challenges remain. In particular, all known machine learning algorithms are vulnerable to adversarial examples — inputs that an attacker has intentionally designed to cause the model to make a mistake.
Sep 11, 2018 The What-If Tool: Code-Free Probing of Machine Learning Models
Posted by James Wexler, Software Engineer, Google AI Building effective machine learning (ML) systems means asking a lot of questions. It's not enough to train a model and walk away. Instead, good practitioners act as detectives, probing to understand their model better: How would changes to a datapoint affect my model’s prediction? Does it perform differently for various groups–for example, historically marginalized people? How diverse is the dataset I am testing my ...
Sep 07, 2018 Text-to-Speech for Low-Resource Languages (Episode 4): One Down, 299 to Go
Posted by Alexander Gutkin, Software Engineer, Google AI This is the fourth episode in the series of posts reporting on the work we are doing to build text-to-speech (TTS) systems for low resource languages. In the first episode , we described the crowdsourced acoustic data collection effort for Project Unison.
Sep 07, 2018 Introducing the Inclusive Images Competition
Posted by Tulsee Doshi, Product Manager, Google AI The release of large, publicly available image datasets, such as ImageNet , Open Images and Conceptual Captions , has been one of the factors driving the tremendous progress in the field of computer vision. While these datasets are a necessary and critical part of developing useful machine learning (ML) models, some open source data sets have been found to be geographically skewed based on ...
Sep 05, 2018 Conceptual Captions: A New Dataset and Challenge for Image Captioning
Posted by Piyush Sharma, Software Engineer and Radu Soricut, Research Scientist, Google AI The web is filled with billions of images, helping to entertain and inform the world on a countless variety of subjects. However, much of that visual information is not accessible to those with visual impairments, or with slow internet speeds that prohibit the loading of images.