AI Shutterstock

Shutterstock

Nominated Award: Best Use of Responsible AI & Ethics

Website of Company: www.shuttesrtock.com

Shutterstock is a growing, fast-paced, entrepreneurial company operating within a disruptive industry for over 15 years. Well-positioned as the leader in the digital content space, Shutterstock has the largest crowd-sourced digital content library in the world, including leading collections of images, vectors, music, and video. We manage a library of creative building blocks for an expanding global customer base.

Shutterstock adds hundreds of thousands of images, video and audio content each week, and currently has more than 200 million images and more than 9 million video clips available. Headquartered in New York City, Shutterstock has offices around the world and customers in more than 150 countries. The company also owns: Bigstock, a value-oriented stock media agency; Shutterstock Custom, a custom content creation platform; Offset, a high-end image collection; PremiumBeat, a curated royalty-free music library; and Shutterstock Editorial, a premier source of editorial images for the world’s media.

Our Creative platform supports photographers, advertisers, film & television studios, marketers and publishers to produce and consume amazing content, while our Editorial business is a world leader in the production and distribution of real-time celebrity, sports, fashion and news photography, as well as covering events such as the Academy Awards, London Fashion Week and the Met Gala.

Reason for Nomination

Programme Overview
Shutterstock is a leading global creative platform offering full-service solutions, high-quality content, and creative workflow solutions for brands, businesses and media companies. Working with its growing community of over 2.0 million contributors, Shutterstock adds hundreds of thousands of images each week, and currently has more than 400 million images and more than 24 million video clips available along with a comprehensive collection of 3D models and music. In July 2021 Shutterstock announced Shutterstock.AI along with three start-up acquisitions: Pattern89, Datasine, and Shotzr. Shutterstock.AI powers new recommendations, predictive performance products and services that support the entire creative process. This will help customers make more informed content choices to ensure that what they produce is going to resonate with their audiences. The massive data and the renewed investment in AI at Shutterstock have called for the birth of a new Diversity and Inclusion Program.

Purpose of the Programme
At Shutterstock our goal is to build a workforce, contributor network, content library and AI products that are a reflection of the diverse global community we serve. We are proud of our rich mix of backgrounds and perspectives. We want to nurture this diversity by making sure all employees feel included and supported, our product, search and content offerings satisfy our clients’ diversity needs, and that Shutterstock is living its commitment to DEI through the way we do business, operate and build AI technology. Furthermore we aim to become a beacon in helping the AI community to establish guiding principles and common practice towards Ethical AI which starts with diverse data, an audited AI process, and inclusive teams.

Details of Specific Initiatives
Our view is that an inclusive culture is a shared responsibility of all of our employees. Led by our leaders, the focus is to ensure that all employees understand our DEI mission and know what their role is in achieving that mission.

We created a four pillars initiative:
– Creating Diversity in Data. At Shutterstock we source content with diversity via a network of over 2 million contributors from more than 150 countries. But, we don’t stop here. In 2020, we established “The Create Fund” to empower historically excluded artists, help fill content gaps, and further diversity and inclusion within our content library and contributor network. The Create Fund approach diversity in our content in two ways: (i) Artist Investment Program: Through our Artist Investment Program, Shutterstock is providing mentorships, cash grants and other support to empower historically excluded artists in the stock content industry. In 2021, we launched a webinar series in partnership with Diversify Photo and Black Women Photographers to promote the program among the artists in their networks. In 2022 we partnered with People of Color Collective in the UK to provide grants Black and Asian artists and will continue to support their members throughout the year. (ii) Grant Contest Program: Shutterstock is partnering with a variety of leading organizations to help change the conversation about important diversity and inclusion topics. We aim to help lessen the barrier for-and invest in-artists committed to highlighting diversity, inclusion, social justice, and environmental awareness through the content they create. Our partners have included It Gets Better Project and the American Society (ASA) on Aging. In 2022, we published a guide in partnership with the ASA to help our contributors add inclusive metadata in our platform.

– Audited Process. Machine-learning models are commonly trained on large amounts of real world data. This could entail the risk of inheriting human biases as it is observed in everyday life. Machine learning models can be very powerful and without proper testing not only could recycle these biases but could also enhance these phenomena. Therefore, it’s absolutely necessary to establish a rigorous testing process for biases before these models reach production. At Shutterstock, we test our AI models for a number of different biases from gender to ethnicity and behavioral. We do this by utilizing a large curated collection of images which we use to profile our AI models. We perform a number of statistical tests to ensure that there is no preference for a certain subset of characteristics in the data that could indicate the possibility of biased results. If an AI model does show some bias at our bias testing step then it is not deployed to production but goes back to the designing stage for retraining.

– Diverse and Inclusive Teams. This effort starts with building a diverse tech talent pipeline. We have established partnerships with organizations, like Black Girls Code, Black Professionals in Tech Network, Women Who Code and Women in AI that are on the frontline of increasing diversity in Tech. Through these partnerships, we are providing role models and additional support to develop those who have been historically excluded or underrepresented in the tech industry. We have already reached 40-60 gender balance in our first AI team which is an enormous improvement compared to the average 22% participation in this field. This team also brings diverse culture and backgrounds as its members are from South Africa, Spain, US, Canada, Malaysia, Ireland, Greece and Italy.

– Ethical Training: Machine-Learning models have the potential to solve the world’s biggest challenges, but they can also pose risks to individuals and groups. As we develop this technology, we must consider its ethical ramifications. Responsible design and use of AI starts with training the technical teams to question their inventions with wider social, economic and cultural perspectives. We have partnered with the World Ethical Data Foundation (https://worldethicaldata.org/ ) which is developing a specialized training for Shutterstock’s employees to raise awareness of the societal impacts of AI technology and to give our technical employees the tools and knowledge to pursue responsible AI.

Additional Information:

Impact on the Organization
Shutterstock is focused on creating the best possible experience for our global community of contributors, employers and customers. This program is core to the future vision of Shutterstock to ensure that our content is diverse and authentically representative of our global customers and community. Beside our internal impact, we aim for broad external focus. For instance, in Nov 2021 we were the only company at UNESCO presiding the press conference for the launch of the Recommendation of Ethics in AI.

Impact on the Team / Individuals
We believe that diverse and inclusive teams present the most innovative work and will better position us to meet the expectations of our customers, contributors and community. We are committed to building a diverse pipeline that is representative of the areas where we operate and through a data-driven approach we are taking the requisite steps to implement programs and establish partnerships that help us reach those who are traditionally excluded and are underrepresented in different areas of our business.

Shutterstock project 2021

Nominated Award: Best Application of AI in a Large Enterprise

Shutterstock is a growing, fast-paced, entrepreneurial company operating within a disruptive industry for over 15 years. Well-positioned as the leader in the digital content
space, Shutterstock has the largest crowd-sourced digital content library in the world, including leading collections of images, vectors, music, and video. We manage a library of  creative building blocks for an expanding global customer base.

Shutterstock adds hundreds of thousands of images, video and audio content each week, and currently has more than 400 million images and more than 20 million video clips available.

Headquartered in New York City, Shutterstock has offices around the world and customers in more than 150 countries. The company also owns: Bigstock, a value-oriented stock media agency; Shutterstock Custom, a custom content creation platform; Offset, a high-end image collection; PremiumBeat, a curated royalty-free music library; and Shutterstock Editorial, a premier source of editorial images for the world’s media.

Our Creative platform supports photographers, advertisers, film & television studios, marketers and publishers to produce and consume amazing content, while our Editorial business is  a world leader in the pro duction and distribution of real-time celebrity, sports, fashion and news photography, as well as covering events such as the Academy Awards, London Fashion Week and the Met Gala.

Shutterstock is looking for Data Science interns to join our product teams, working on the future of our creative and predictive performance platforms.

You will be working with highly motivated and extremely talented data scientists on large datasets , analytics and visualization tools and, finally, deep learning and machine learning. Our core belief is that the customers are our number one priority so we work hard to deliver value back to them with everything we do. We believe strongly in team ownership of systems, which includes defining the vision of the services to prioritization of projects.

Reason for Nomination:

Background:                                                                                                                                        High-quality labelled image datasets are vital for pushing the boundaries of what’s possible in AI today. Such collections are essential for the supervised training of computer vision models for tasks such as object detection and image segmentation. However, despite the numerous recent advances in this field, the acquisition of such datasets remains a key challenge both in academia and industry.

The problem:

The problem is twofold. First, dataset collection for a specific problem is a time-consuming and expensive process with publicly available datasets currently lacking the volume and variety required to train highly performant models. Datasets for very specialised and niche problems (which are common in industry) either do not exist, or are extremely difficult to curate. Second, keyword-based search, a common approach used today, is a rigid and somewhat limited approach for image retrieval which often fails when trying to collect very large datasets containing highly specific objects, actions or scenes. For example, the concept of “dangerous driving” can be represented by many distinct images such as an in-vehicle shot of a driver holding a phone or an image of a car performing an illegal maneuver. A keyword-based approach will likely struggle to retrieve images that represent a specific semantic concept such as this.

Solution:

We have built a dataset curation system that is able to analyze 400M images via a complex semantic embedding and retrieve specific datasets that satisfy highly granular text queries. The system facilitates the tailoring of a curated dataset to any niche or bespoke problem the user may require a model to be trained for (e.g. 100K images of people in cars wearing a seatbelt). An interactive relevance feedback mechanism has also been implemented to refi ne a given collection and further improve the overall quality.

Additionally, this system has the ability to parse complex semantic filters such as “only partially occluded objects” or “only large objects in the upper left quadrant of the image”.
Finally, the system has been designed to combat model bias by producing representative dataset s in terms of race, gender and age. Impact: This new dataset curation system enables the fast and reliable creation of specialised datasets to train supervised machine learning models. This is a key step in the rapid development of industrial AI solutions and the acceleration of research. By fully leveraging the ~400M high quality images within the Shutterstock collection this system provides a unique opportunity in dataset curation.

This approach also enables a user to semantically explore a large scale dataset (100M+) enabling a deeper understanding of the content both in terms of volume an d variety of images in a highly flexible manner. Being able to curate datasets that are tailored for specific problems enables models to specialise on tasks that would not be possible with more generic datasets obtained through the status quo techniques for curation and annotation.