Sony Pixel Power calrec Sony

Vidinet Cognitive Services - AWS Speech to Text

16/10/2020

Transcribe your content from speech to text- why?

There are many reasons to transcribe your spoken content in your media. The first reason that comes to mind is, of course, subtitling. Not only in the natively spoken language but also in translated versions. According to multiple research, subtitled videos improve reach, CTA, reactions, and share rates significantly. The second reason is, of course, to help you find the content you are looking for - do you remember the soundbite that the CEO made in that speech - but where is it?

From a business perspective, it also essential to understand how Search Engine Optimization (SEO) is affected by subtitling. Video in itself is obviously not text-based, so any information that informs Google what the video content describes benefits the ranking of the video. Subtitling your video to not just one language but many, therefore, could improve your SEO and visibility. Makes sense?

These are just some of the benefits of making subtitling in preferably more than one language available for your content.

However, for some of you, there are also new regulations to consider. An E.U. directive 2016/2102/EU now states that all member states must include subtitling on all official video information to comply with the U.N. Convention on the Rights of Persons with Disabilities (CRPD). This includes video information from government, schools, and other official organizations, including private companies that delivers information for public viewing.

Similar regulations have been present in the U.S. for many years. The most recent regulation, The 21st Century Communications, and Video Accessibility Act of 2010, states the presence of closed captions on material produced and distributed in the U.S. and can be accessed in the U.S.

Transcribe your content - but how?

Traditionally, transcribing speech to text has been a human task only. With the introduction of the new machine learning algorithms, this is now changing, and we can see how machines and humans can interact and cooperate in this area. Machine learning transcribing software proves more and more accurate, and with today's score at around 80 % or higher depending on the quality of material, the software-based services can offload a lot of initial work that would typically be done by humans only.

So, instead of spending 8 hours on manually transcribing a 1-hour video, you will be able to improve your subtitling distribution workflow by offloading the first 80 % of work to a cognitive automatic subtitling algorithm such as the VCS (Vidispine Cognitive Services) in Vidinet.

With the introduction of VCS, we now take Vidispine API and Vidinet to the next level. The Vidinet Cognitive Services is a core architecture designed to manage cognitive services from a growing number of providers on the market. In this first release of VCS, you will find cognitive services based on the AWS Transcribe libraries.

Vidinet and AWS Speech to Text - a short introduction.

Vidinet is our media supply chain platform where Vidispine customers add and configure different services for their on-premise, cloud, or hybrid environment. In here, you can now access VCS Speech to Text and add this service to your infrastructure - or just your trial account.

Let s take a quick look at a UI and how you can test the VCS Speech to Text functionality.

After uploading your content, choose Analyze to enable the AWS transcription service for your video. Vidinet will provide you with a cost estimate for the service as a basis for your calculations.

When the analysis and transcription have finished, you can easily search and navigate for the results.

The Vidispine UIIt is essential to understand that our Vidispine Development Toolkit (VDT) allows you to design any user interface (UI) that works for your environment. In these examples, we have provided a UI that provides basic functionality for testing the Vidispine API. As you can see, the VCS Speech to Text service provides you with not only a transcription and time-code but also a simple interface for manual adjustment of the auto-generated text.

The Vidispine Development Toolkit (VDT) is free and includes multiple packages

Low-level javascript SDK for front/backend

React wrappers

Prebuilt components using https://material-ui.com/ (react components using Googles material design CSS)

With this brief introduction to the VCS Speech to Text service in Vidinet, it is time for you to test this service for yourself. Remember that the functionality and accuracy of machine learning also algorithms improve over time.

If you are using a transcription service or are working manually with speech to text today, you will most likely benefit from VCS Speech to Text in Vidinet.

Amazon Transcribe Pricing - how much?

When you try out the VCS Speech to Text, you will get an automatic cost estimate based on the amazon transcribe pricing and the source duration for the job you are starting. Use this estimate as a basis for calculating the price for the automation of speech to text in your media supply chain.

Currently, we charge 0,024 USD per content minute, but remember that you only pay when you use the service. You will scale up or pause your media supply chain whenever your business model requires it.

This flexibility is just one of many advantages when building your media supply chain with Vidispine.

Related Articles

Vidinet Cognitive Services

Create intelligent workflows with Vidinet Cognitive Services.

Why we are Going Cognitive

In an interview with Ralf Jansen, you can learn more about Vidinet Cognitive Services, why it is important, and how you can use it.

Webinar: Basics of VidiNet Cognitive Services

This webinar gives insights about our AI strategy and how the integration in the VidiNet ecosystem will work. We also demonstrate the first integrations in acti
LINK: https://www.vidispine.com/resources/blog/vidinet-cognitive-services-aw...
See more stories from vidispine

Most recent headlines

20/02/2025

NAB Show To Feature New Business Of Entertainment Track

WASHINGTON The 2025 NAB Show, April 5-9, at the Las Vegas Convention Center will mark the debut of the Business of Entertainment track developed with The Ankler...

20/02/2025

Local News Veteran Adrienne Roark Joins Tegna as Chief Content Officer

TYSONS, Va. Tegna Inc. has announced that news veteran Adrienne Roark has been named chief content officer reporting to CEO Mike Steib, effective March 31....

20/02/2025

Roku to Become the Streaming Hub of Bassmaster Tournaments

BIRMINGHAM, Ala. Roku has further expanded its sports content with a new media rights deal with Bassmaster that will make Roku the streaming hub for Bassmaster ...

20/02/2025

NABs Curtis LeGeyt Calls for Modernization of Broadcast Ownership Rules

WASHINGTON National Association of Broadcasters (NAB) President and CEO Curtis LeGeyt opened The Media Institute's 2025 Communications Forum series with a s...

20/02/2025

From Storage Struggles to Streamlined Storytelling How Th...

The Belonging Co., a Nashville-based church renowned for its dynamic worship experiences and multimedia-rich conferences, tapped DigitalGlue's creative.spac...

20/02/2025

Mediahuis Radio Chooses DHD Audio Mixers for New Studios...

Mediahuis Radio continues its expansion with the completion of a new production facility in Amsterdam. DHD RX2 and DX2 audio mixers connected to XD3 Cores form ...

20/02/2025

Calrec expands its flexible production model at NAB 2025

At NAB 2025, Calrec is introducing a suite of new interconnected products and updates aiming to help broadcasters meet a variety of challenges. With increased c...

20/02/2025

Keepit achieves exceptional growth in a tough 2024 market

Keepit, the world's only independent vendor of cloud backup and recovery solutions designed to protect SaaS data, today announced a remarkable year of growt...

20/02/2025

Hitomi Broadcast Showcases Enhanced UHD Capabilities at N...

Expanded SMPTE ST 2110 support strengthens lip-sync and latency measurement solutions Hitomi Broadcast, the market leader in audio/video alignment and latency ...

20/02/2025

LiveU Transforms Tabcorp Sky Racing Coverage through Rem...

LiveU, a leader in live IP-video and remote production solutions, today announced the successful implementation of its comprehensive IP remote production soluti...

20/02/2025

MainConcept and Veset Collaborate to Enhance Live TV with...

MainConcept, a leading provider of video and audio codecs, has announced a partnership with cloud playout solutions provider, Veset, to integrate its JPEG XS SD...

20/02/2025

OOONA Launches OnStage - A Free Archive for Media Localiz...

OOONA, a leading provider of professional management and production tools for the media localization industry, announces the launch of On Stage, a free-to-acces...

20/02/2025

Feisty Feminist Murder Mystery He Had It Coming Announced

18 02 2025 - Media release Feisty Feminist Murder Mystery He Had It Coming Announced Stars of He Had It Coming, Lydia West, Natasha Liu Bordizzo and Liv Hewso...

20/02/2025

Screen Australia and Stan Announce New Comedy-Horror Series Gnomes

18 02 2025 - Media release Screen Australia and Stan Announce New Comedy-Horror Series Gnomes Gnomes writers Tegan Higginbotham and Paul Verhoeven, and creato...

20/02/2025

Do I Know You From Somewhere? Shot with URSA Mini Pro 12K

Do I Know You From Somewhere? Shot with URSA Mini Pro 12K Brie Clayton February 19, 2025 0 Comments Canadian indie film uses Blackmagic camera to crea...

20/02/2025

Berklee Ensemble for Musicians with Disabilities Is Stronger for Our Differences

Berklee Ensemble for Musicians with Disabilities Is Stronger for Our Differences Associate Professor Adrian Anantawan, who founded Berklee's Music Inclusi...

19/02/2025

Give Your Playlist Covers a Tyler, The Creator Touch With Our Exclusive Stickers

For more than a decade, Tyler, The Creator has blazed his own trail, exploring unique aesthetics across music, fashion, and art. Now he's bringing his signa...

19/02/2025

10 Billion Streams and Counting: Spotify Singles Celebrates Its Biggest Hits

Spotify Singles, our longest-running original recorded music franchise, has officially surpassed 10 billion collective streams worldwide. That's a whole lot...

19/02/2025

ST Engineering iDirect CEO Nominated for Via Satellite's Satellite Executive of the Year Award 2024

Via Satellite recognizes Don Claussen's leadership in driving innovation to ...

19/02/2025

Calrec Launches True Control 2.0 at 2025 NAB Show

At the 2025 NAB Show, April 6-9 in Las Vegas, Calrec will introduce a suite of new interconnected products and updates aiming to help broadcasters attract more ...

19/02/2025

ThinkAnalytics To Feature Newly Launched ThinkMediaAI At 2025 NAB Show

LONDON and LOS ANGELES ThinkAnalytics will feature its newly launched ThinkMediaAI, a unified artificial intelligence (AI)-powered platform that encompass conte...

19/02/2025

Fubo Launches New Multicultural Content Bundles

NEW YORK FuboTV Inc.has announced plans to launch of multicultural content bundles that will provide U.S. consumers with international programming available in ...

19/02/2025

Spectrum Business Launches New Flexible Packages, Boosts Internet Speeds

STAMFORD, Conn. Charter's Spectrum Business has announced new packages, pricing, improved internet speeds for new business customers and free internet speed...

19/02/2025

C2HR: Broadcast Engineering Among Hottest Jobs in Content Development

A new survey among media HR professionals identifies broadcast operations professionals as among the most sought after jobs in the content development sector....

19/02/2025

Behind the Scenes of The Brutalist: An Oscar-Buzzed Workflow with Signiant Media Shuttle

Behind the Scenes of The Brutalist: An Oscar-Buzzed Workflow with Signiant Media...

19/02/2025

Berklee Ensemble for Disabled Musicians Is Stronger for Our Differences

Berklee Ensemble for Disabled Musicians Is Stronger for Our Differences Associate Professor Adrian Anantawan, who founded Berklee's Music Inclusion Ensemb...

19/02/2025

DNEG makes move into AI with Metaphysic acquisition

Metaphysics AI neural performance toolset, which was used on the Tom Hanks and Robin Wright film Here, was recently honoured at the Visual Effects Society Award...

19/02/2025

SES responds as Moody's downgrades company's outlook

Responding to the ratings realignment, SES provided a market update ahead of its Full Year 2024 Results which will be published on 26th February By Matthew Co...

19/02/2025

Ikegami Electronics Announces US Market Introduction of I...

Ikegami Electronics announces a new addition to its range of broadcast quality television production, control and monitoring equipment. The IPX-100 is an IP gat...

19/02/2025

BCNEXXT Innovates Playout in the Cloud with Amazon Web Se...

BCNEXXT, a leading provider of virtualized, cloud-native systems for Linear, VoD, and OTT publishing, today announced its successful collaboration with Amazon...

19/02/2025

Experience Commerce Secures Social Media Mandate for Inve...

Experience Commerce, a leading full-service digital marketing agency within the Cheil Network, has won the social media mandate for Invecto Technologies Pvt. Lt...

19/02/2025

ThinkAnalytics launches ThinkMediaAI - the first unified...

ThinkAnalytics, the global leader in video content discovery and personalization, today launched ThinkMediaAI, the video industry's first unified AI powered...

19/02/2025

SES Market Update in Context of Moody's Ratings Press Release

Luxembourg, 18 February 2025 -- SES S.A. has taken note of Moody's Ratings Press Release today with regards to SES and is providing a market update ahead of...

19/02/2025

Ikegami Electronics to Introduce IPX-100 IP Gateway at 2025 NAB Show

MAHWAH, N.J. Ikegami Electronics has announced a new addition to its range of broadcast quality television production, control and monitoring equipment for the ...

19/02/2025

Joel Davis Named President & GM of NBCU Local Philadelphia

NEW YORK Veteran local media executive Joel Davis has been named president and general manager of NBCU Local Philadelphia's NBC10 / WCAU, Telemundo62 / WWSI...

19/02/2025

Heather Gray Named New Interim General Manager at WRAL & Fox 50

RALEIGH, N.C. Capitol Broadcasting Company, Inc. (CBC) has announced that longtime local media leader Heather Gray has been named interim general manager of WRA...

19/02/2025

Shure Forms Wireless Microphone Spectrum Alliance

NILES, Ill. Shure Incorporated has announced that it is forming the Wireless Microphone Spectrum Alliance (WMSA), a coalition that will work to ensure access to...

19/02/2025

Lipstick on the Legacy - The Pitfalls of Superficial Medi...

Lipstick on the Legacy: The Pitfalls of Superficial Media Transformation The media and entertainment (M&E) industry has been undergoing digital transformation ...

19/02/2025

Strengthened by Successful 2024 TAG Video Systems Plans t...

In 2024, TAG Video Systems dedicated itself to empowering customers to achieve their goals and deliver truly extraordinary media experiences. For 2025, TAG has ...

19/02/2025

Alfalite VP XR LED screens power Spain largest virtual pr...

Alfalite, the only European manufacturer of LED screens, is proud to announce that its Modularpix Pro VP XR solution is the core technology behind the new Coru ...

19/02/2025

The Ninth Order Completed With URSA Mini Pro 4.6K G2 and Fairlight

The Ninth Order Completed With URSA Mini Pro 4.6K G2 and Fairlight Brie Clayton February 18, 2025 0 Comments Film shown at 20 global film festivals us...

19/02/2025

People Want Ads as Intelligent as their Content

People Want Ads as Intelligent as their Content Andy Marken February 18, 2025 0 Comments In situations like this, carelessness, mistakes they will ha...

19/02/2025

Heather Gray Named Interim General Manager of WRAL-TV and FOX 50

Leadership change at Capitol Broadcasting Company's WRAL-TV/FOX 50 Capitol Broadcasting Company, Inc. (CBC) today announced that longtime local media lea...

19/02/2025

Temenos' Barb Morgan Shares How Chatbots and AI Agents Are Reshaping Customer Service in Banking

In financial services, AI has traditionally been used primarily for fraud detect...

19/02/2025

XRANGE and Mira Aerospace Partner to Advance High-Altitude Platform Station (HAPS) Flight Testing

Collaboration enables long-term testing and evaluation support for HAPS platform...

19/02/2025

Mercedes-Benz Stadium Overhauls Production Ecosystem With Upgraded Control Rooms, New Integrated Workflows

Mercedes-Benz Stadium Overhauls Production Ecosystem With Upgraded Control Rooms...

19/02/2025

Season two of the Sky Exclusive drama series The Last of Us debuts 14 April

Season two of the Sky Exclusive drama series The Last of Us debuts 14 AprilOfficial Teaser Posters ReleasedWednesday 19 February 2025 The seven-episode second ...

19/02/2025

Elevate your networks IQ: ipoques AI-driven DPI technology unveiled

Elevate your networks IQ: ipoques AI-driven DPI technology unveiled ipoque, a Rohde & Schwarz company, showcases its groundbreaking Encrypted Traffic Intellig...

19/02/2025

Kyocera and Rohde & Schwarz join forces to demonstrate OTA characterization of mmWave PAAM at MWC 2025

Kyocera and Rohde & Schwarz join forces to demonstrate OTA characterization of m...

19/02/2025

Taiwanese Rom-Com Series I am MarriedBut!' Wins Hearts with Its Realistic Take on Marriage

Back to All News Taiwanese Rom-Com Series I am Married But!' Wins Hearts w...