Commons:Timed Text
Media community: Audio and video requests · Featured media (candidates) · Media help · Media of the Day · Timed Text · Video info · Video2commons–Upload · Video cut tool
For other uses, you may be looking for Commons:File captions.
TimedText is a custom Wikimedia Commons namespace to hold closed captioning text, or subtitles, to be associated with other media, such as audio or video files. This page intends to explain the feature's concept and use.
Closed captioning (CC) and subtitling are both processes of displaying text on a television, video screen, or other visual display to provide additional or interpretive information. Both are typically used as a transcription of the audio portion of a program as it occurs (either verbatim or in edited form), sometimes including descriptions of non-speech elements. This aids hearing-impaired and deaf people and provides a way for non-native language speakers to understand the content in a multimedia file.
Using
Also see Commons:Video#Subtitles and closed captioning.
Thumbnails of videos and audio clips that have closed captioning available will show the CC icon overlayed. After opening the player, subtitles in your language are automatically enabled. You can find the icon in the controls of the player to switch between languages, toggling subtitles on and off, or to change the formatting of the subtitles.
Timed Text can be used for any media that is presented in a time sequence:
- Audio file
- Silent video
- Spoken video
- Animation demonstrating a concept or how something works
Actual examples
- Commons:Timed Text Demo Page a page highlighting a few timed-text examples.
- TimedText:Krazy Kat Bugologist 1916 silent.ogv.de.srt, German captions
- TimedText:Krazy Kat Bugologist 1916 silent.ogv.en.srt, English captions
- TimedText:Wikipedia Edit2014.ogv.pl.srt redirected to TimedText:Wikipedia Edit 2014.webm.pl.srt
Finding
TimedText:
prefix, add the text after it, e.g. TimedText:Elephants_Dream.ogv
).TimedText:Elephants_Dream.ogv.en.srt
) to create a TimedText page. see Commons:Timed Text{{Allpages|102}}
is rendered as and lists all pages in namespace 102.
Commons needs a means to find Timed Text files for specific languages; the following suffer from the Search function's limitations (such as: it does not show all matches; it includes non-matches; it needs regular expression support). Search including some Timed Text .srt files in different languages:
English • German • French • Portuguese • Russian • Swedish • Ukrainian • Polish • Indonesian
Other methods to help user find Timed Text:
- {{Closed captions}} displays links to all the closed captioning files available for a file, can be placed on a media page and its talk page.
{{special|Prefixindex/TimedText:{{PAGENAME}}.|stripprefix|1|subtitles}}
yields a link to all related Timed Text files (example).- Commons:Timed Text/search by lang displays search links for all Timed Text files for a given language, useful for Commons pages, Categories and Talk pages.
Marking and Finding videos that need subtitles
The template {{Captions requested}} can be used to mark that a video needs caption. The template add it to the category Videos needing subtitles, so one can see which videos, users or authors have requested transcripts.
This template and category is in the scope of Commons:WikiProject Deaf and its sisters meta:Deaf Wikimedians and Wikipedia:WikiProject Deaf.
Finding videos that need subtitles translation
One way to find such videos, is to open one of the subcategories of Category:Files with closed captioning depending on the preferred starting language, and then to use Help:FastCCI (on the top right of the page) to include only the videos that don't have subtitles for your preferred target language.
Example
- To find videos with subtitles in English to translate them, go to Category:Files with closed captioning in English.
- Then, click on the FastCCI arrow to open the sub-menu and select "In this category but not in..."
- In the textbox, enter the corresponding category depending of your preferred target language:
- For German, enter
Files with closed captioning in German
- For French, enter
Files with closed captioning in French
- For Russian, enter
Files with closed captioning in Russian
- For German, enter
etc..
Timed Text talk
The
namespace is for discussing the respective Timed Text pages, but it could also be used to link and categorize the Timed Text page.Maintenance tasks
- Patrol changes in the TimedText namespace: RecentChanges
- Find orphaned timed text pages that have no associated media file (any longer).
Uploading
To upload an already created subtitle file, open the file on your computer in a text editor (such as Notepad) and copy the text into a new page in the TimedText namespace that matches the filename of the video and the language code.
Creating
Commons uses the SubRip (.srt) file format for closed captioning and subtitles. You can create these files in multiple ways.
Create subtitles page for existing Commons files
Option 1: in the Commons page of the file (recommended)
You can use the "TimedText" link at the top of any suitable multimedia file on Commons.
Option 2: directly in the media player
By using the CC button in the toolbar of the Wikimedia HTML5 media player, you can select subtitles if they are available, or open the Subtitles editor to create subtitles for the video.
Option 3: creating a blank page (for advanced users)
You can always directly create the page in Commons using the template TimedText:[Common_File_Name.extension].[language].srt, where [Common_File_Name.extension] is the name of the file, and [language] is the ISO code for the language.
Example: to add subtitles to Elephants_Dream.ogg
, you can create the page TimedText:Elephants_Dream.ogg.en.srt
for english subtitles, or TimedText:Elephants_Dream.ogg.fr.srt
for french subtitles.
Extracting existing subtitles to import them
Create Subtitles from DVD
To copy existing subtitles from a DVD you can use software such as SubRip. You can then copy-paste them in the wiki Commons subtitle page.
Create Subtitles with YouTube
YouTube allows users with a YouTube account to create subtitles out of any uploaded file. Keep in mind the speech recognition is automated and produces unexpected results. It is preferable to upload a transcript of the file to YouTube. This will provide a much better result. You can then copy-paste them in the wiki Commons subtitle page.
Steps to create the subtitles (a video tutorial of the steps can be found here):
- Upload the file. (The multimedia file must also include a video track but you are free to choose a blank one or any other)
- While uploading set the Video language for your file to the appropriate language under the "Show more" menu.
- Or, after uploading, select "Subtitles" in the specific videos Details or in the YouTube Studio navigation.
- Click on "Add" or "Add language".
- You can add subtitles in one of three ways:
- Upload a transcript in the proper format.
- Copy and paste the transcript.
- Type manually while watching the video.
- The captions are then integrated into the video.
- Download the .sbv file from the Subtitles menu under the three dot menu while in the "Edit Timings" view.
- Convert the contents of the .sbv file into .srt file. There are various online tools to help with this step.
- ffmpeg is one open-source option (directions).
- Upload the .srt file to the corresponding page of the video on Wikimedia Commons.
Convert YouTube Subtitles to Timed Text format
SBV Subtitles
If you export the SBV format from YouTube subtitles you can use ffmpeg to convert the subtile file to the SRT (SubRip) format used by Commons. This feature also solves the overlap issue that is common when converting YouTube subtitles to Commons.
ffmpeg -fix_sub_duration -i input.sbv output.srt
XML Subtitles
This section describes how to convert XML YouTube subtitles to SubRip (srt) format, that is TimedText subtitles format used in Wikimedia Commons.
If
- the YouTube video has subtitles in some language (e.g. I created this YouTube video with subtitles in English, in Russian and in Livvi-Karelian languages),
- this video was uploaded to Wikimedia Commons (e.g. this file),
- you want to copy YouTube subtitles to the same video at Commons.
Then:
- Download the subtitle in XML, put the ID of the YouTube video at the end of the URL: http://video.google.com/timedtext?hl=en&lang=en&v=__youtube_video_ID__
- Install Ruby.
- Download a Ruby program to convert video subtitles from YouTube's XML format to the SubRip format.
- Run this program and convert XML file to .SRT file.
- Copy and paste the contents of the .SRT file into the corresponding page of the video on Wikimedia Commons.
General tips
Noise, etc.
Keep in mind to paraphrase unspoken sounds and surround them with rounded brackets, e.g.
1 00:00:20,000 --> 00:00:24,400 (engine sound)
Music
Music should be surrounded by the ♪ character, Unicode U+9834, or Alt+266A. You can also use ♫ Unicode U+9835, or Alt+266B, e.g.
1 00:00:20,000 --> 00:00:24,400 ♪ rock music playing in the jukebox ♪ ♫ she's singing ♫
Markup
The only supported markup of the SRT format is
- Bold – <b> ... </b>
- Italic – <i> ... </i>
- Underline – <u> ... </u>
REMINDER: Wikicode formatting is not supported.
Internationalization
After the subtitles have been transcribed in the original language of the video onto a Timed Text file, they can be translated into other languages as follows:
- Open the Timed Text file in the original language, say English for example TimedText:Elephants Dream.ogv.en.srt, in edit mode and copy the whole of the page.
- In the address bar replace "en" with the language code of your choice, say "fr", then paste the original text in the new page.
- View the original video, then translate the text into your language.
- After saving the new page, the video with the subtitles should load onto the page; you can view it to check the timing of the subtitles.
- Add a category link to the talk page [[Category:Timed Text in Language Name|Language Name]]. For example, see TimedText talk:Elephants Dream.ogv.fr.srt.
Wikipedia articles about the topics of Timed Text or subtitles
These are articles about either Q844253: Timed text, or Q204028: subtitle.
- Dansk: Undertekster
- Deutsch: Untertitelung
- Ελληνικά: Υπότιτλοι
- English: Timed Text is also termed subtitles, closed captioning and closed caption text. See also Subtitle (captioning).
- Esperanto: Subtekstoj
- Español: Subtítulo
- Français : sous-titrage
- Interlingua: Subtitulos
- Italiano: Sottotitolo
- 日本語: 字幕
- 한국어: 자막
- Македонски: Толкување
- Nederlands: Ondertiteling
- Norsk bokmål: Undertekster
- Português: Legenda
- Русский: Субтитры
- Slovenščina: podnaslovi
- Svenska: Textning
- Українська: Субтитри
- 粵語: 字幕
- 中文:字幕
- Bahasa Indonesia: Teks Berwaktu
Linking
This section needs expansion.
How to associate closed captions with multimedia files?
- Redirect to avoid duplicated content, for example TimedText:Elephants Dream (high quality).ogv.pt.srt redirects to the existing TimedText:Elephants Dream.ogv.pt.srt. This ensures the closed captions template displays the correct file name of the caption files (this could be important with movie clips).
- {{Closed captions}}'s parameter is an alternative
- more support is needed for the Timed Text function;
- Categorizing: Not possible to categorize the Timed Text page itself, but the Timed Text Talk can be.
A possible categorization scheme is:
[[:Category:File formats]] + [[:Category:Media types]] | [[:Category:Timed Text]] + [[:Category:Legend in German]] | [[:Category:Timed Text in German]] + [[:Category:Legend in French]] | [[:Category:Timed Text in French]] + [[:Category:Legend in English]] | [[:Category:Timed Text in English]]
Related categories: Category:Files with closed captioning
See also
- {{Captions requested}}
- SubRip
- Help:Namespaces lists all Commons namespaces
- Category:Video, base category for media about video
- Category:Videos, base category for video files
- National Captioning Institute (NCI).
- The W3C Timed Text homepage
- Captions For Deaf and Hard-of-Hearing Viewers, National Institute on Deafness and Other Communication Disorders (NIDCD).