Documente Academic
Documente Profesional
Documente Cultură
- WHAT IS SUBRIP?
- LICENSE AGREEMENT
- WARNING
- OPERATING INSTRUCTIONS
I
- The SubTitles ripping
II - OCR Process
III - Text correction and conversion
IV - Split output text file
V
- Time correction
VI - Post OCR correction
VII - Font Styles
VIII - Global Options
IX - Ripping SubTitles from video files
5 - OUTPUT FORMATS NOTES
6 - FAQ
7 - RELATED LiNKS
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*~~~~~~~ WHAT IS SUBRIP? ~~~~~~~
- SubRip is a program that allows you to rip DVD subtitles with
their timing as a text file.
- The goal is to allow you to translate the subtitles in
other languages, or just use them as they are.
- You can also rip subtitles as BMP in the goal of putting them
on a SVCD (for viewing on a stand-alone DVD player) or on a
homemade DVD.
- SubRip use an OCR algorithm and I (Brain) spend a lot of time and
nights (and Dub's a bit too) coding it for you :) but I learned
a lot of things with this experience.
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*~~~~~~~ LICENSE AGREEMENT ~~~~~~~
SubRip is FREEWARE (with sources released under GNU GPL). That means
it can be free distributed, copied, burned to CD etc., but only as
is in this installation package. It cannot be distributed as part
of any other software (without permission), none part of SubRip can
You can try to click the "Search for an existing Matrix that match
for this Character" button, then to case:
- A Matrix containing the same font is found:
This will avoid you to re-enter all the characters.
Accept to open, complete it if some character are missed then
resave the matrix file.
- Not any Matrix is found:
Try on the next Character after typing current one, if still
not found then do the following:
Enter the character(s) you see in the red box. This process will
not take-up too much time on most DVDs. Try to be as accurate as
possible, for this will result in a better text file.
If you make an error typing the character, click the "Pause"
button, then go to "Characters Matrix" menu and click "Edit ..."
Here you are able to correct your error, but you must restart the
OCR process at the first VOB to get all of the text recognized right.
If SubRip already has a character that it thinks is close to the
current one, it will display it on top of the window, as the "best
guess". To use it (including the font style), just press the "Use"
button. A new character will be inserted in the character matrix,
just as if you typed it manually. You can also have the best guess
filled for you by pressing Ctrl-Enter.
3) Wait until the OCR process has finished. In case some rare characters
appear in the middle of the DVD the PC will beep to warn you.
4) After the OCR process has finished save the Characters Matrix File,
it can be used for other DVD that have same subtitles font -> Select
"Save Character Matrix As" from the "Character Matrix" menu and put
it in the proposed directory ('ChMatrix' subdir) to enable Matrix
auto detection feature.
5) If you think you recognize the font that the subtitles use, you can
try to generate all the characters automatically, instead of typing
them one by one. Press the "Fill Matrix From Text" button. In the
window that appears, first type one of the lines of text in the
image, then use the buttons to adjust the text position, size and
angle so that it overlaps exactly with the text in the image. Try
changing the font size and style to improve the match (if you select
italics or bold, the styles will also show up in the character
matrix). When done, press "Done" to continue. If the match is not
perfect, or the OCR tolerance is too high, SubRip may not recognize
the newly-inserted characters, and just show you a best guess. If it
matches, press the "Use" button, and the current character will be
introduced in the character matrix.
III - Text correction and conversion
1) Go to the Subtitles text window, click the spelling correction button
or select "Post OCR Spelling Correction" in the "Corrections" menu.
If "l" is confused with "I" (They both look the same for the OCR),
check the corresponding box and click "Correct This". This action
will correct most of the errors.
2) Click the save as button or select "Save as" from the "File" menu to
save the text in SubRip's own format (this will allow you to re-open
and edit it later).
3) Click the output format button or select "Set Output Format" in the
"Output Format" menu and choose the format you want.
IV - Split output text file
1) If you want to divide the original subtitle file into smaller parts
to put the movie on multiple CDs, you can use this function:
1- Click the split text file button or choose "Split output
file" from the "Output Format" menu.
2- Check the "Split output File" box.
3- Enter the Start and Stop time for the current file.
4- Click "OK".
5- Save file As "Name_xx.???" Where xx is the number of the CD
and ??? is the extension depending on the chosen format.
6- Repeat step 1 to 5 for each file. (Use the arrow-button to
set the last Stop time as Start time).
Important Remarks:
- The unselected part of the file will be kept until you open
another file or quit SubRip. It's recommended to first save
the full file in SRT format in case you need to modify it late
r.
- The time codes will be corrected depending on the start time
(Start time will be deducted from all time codes in the file).
V - Time Correction
1) After a subtitle playback, you may want to change the start time
(time of the first subtitle) or the speed (global time).
!!! Never believe the time at DVD player displays !!! In most
cases their timing is wrong (e.g.: 8s/2h slower) especially on R1 DVD
s.
Use a chronometer to verify it yourself!
2) If there is really a time issue, launch SubRip and re-open the SRT
file. Go to time correction and do the necessary...
VI - Post OCR correction
* Try to correct punctuation (?. -> ? ; !. -> ! etc.) and
Try to correct "Capital Letters"
- do not use it for right-to-left languages
- MJQ info: " 'Correct Capital Letters' should be used together
with 'Correct Punctuation' "
- some examples to see how it works:
BEFORE
0:00:18:... test
0:00:20:Test ...
0:00:22:Test...test
0:01:22:Test...test ...
0:07:32:- ...test|-... test|-...test
0:22:33:Test?... test...
0:44:44:test?...test...
0:49:44:test ... test
0:59:44:-test.
1:01:01:Test test. -test.
AFTER
...test
Test...
Test... test
Test... test...
...test|...test|...test
Test? ...test...
test? ...test...
test... test
- Test.
Test test. - Test.
1:02:01:Test
1:03:01:Test
1:03:01:Test
2:22:22:Test
2:22:32:Test
test...-test.
test, - test
test!?Test - test
test.test.
test. 12.32.
Test
Test
Test
Test
Test
test... - Test.
test. - Test
test!? Test-test
test. Test.
test. 12.32.
STL
the VOBs next time, only the required packets are read. That will
result significantly faster subtitles processing. After processing
of all your wanted subtitle streams, you can delete the .srm files.
Line Breaks
Virtually all software which deals with text-type subtitles (including most
Windows software) can handle Unix line breaks in subtitle files, therefor
"Unix" is the recommended default because it creates slightly smaller files.
Unicode Options
These specify the default encoding for all text-type subtitle output files,
except XML (currently only .ttxt) files which are always saved in plain 7-bit
ASCII format since they already use XML encoding for all non-ASCII characters.
Earlier SubRip versions used to always ask if saving as Unicode is desired if
the text contains any non-ASCII characters. This behaviour can still be
restored by selecting "no (ask if appropriate)" under "Save as Unicode", but
it would be easier to just select "always" and "UTF-8" for a common and
unified output format for all languages.
Also, UTF-16LE is no longer the default Unicode format because most people
prefer UTF-8 these days (UTF-16LE was only added here for historic reasons).
IX - Ripping SubTitles from video files
Some video files have SubTitles "burned into them". SubRip can be used
to extract subtitles from video files as follows.
1) Open the video file by selecting "Open Hard Subbed Video Files" from
the "File" menu. SubRip will try to open any file that MediaPlayer can
play, but it can only detect the frame rate of .avi files.
2) The movie will start playing in the video window. Wait until you see
a subtitle, or use the track bar or the edit box to advance to another
frame. Press the "Pause" button to stop the movie from playing when you
see a subtitle.
3) In the video window, click INSIDE a character (the white area) so
that SubRip can detect the subtitle color and width. Look in the panel
to confirm: the detected colors should match what you see in the video
window. If not, click again. The "text line width" box should show
values between 1 and 5. Anything larger would probably mean that
detection was unsuccessful. If after several tries, the detection still
does not seem to work, try lowering the "text color tolerance".
4) In the video window, right-click and drag to define the region inside
which the detection takes place. The smaller the region, the faster it
will work. The region should completely enclose the subtitles, and have
a generous margin of at least the width of the text line and outline
combined. Be careful for cases when there are more lines in the
subtitle. You can always stop processing and redefine the region by
right-clicking and dragging.
5) Check the "View" check box. In the main window, a rectangle the size
of the selected region should show up, with the subtitles in white and
the outline in red. If the subtitles have outlines (they mostly do),
also check the "Use" check box.
7) If the subtitles do not show up properly (the lines are too thin),
try playing with the text line and outline widths and increasing the
tolerances. Ideally, even on a bright background, you should only see
the text and the outline. If large bright areas also show up, try
checking the "Try to fill open areas" and "Try to fill large areas"
check boxes. Do not use them unless you have to, they slow down the
recognition process.
8) When you are satisfied, press the "Rewind" button to go to the start
of the movie, then the "Run" button to start the OCR process. The OCR
will be similar to what you see when ripping SubTitles from DVDs. If at
any time you need to change any parameters, press the "Pause/Abort"
button, change them, then press the "Continue" button in the main
window.
9) If you want to change the last subtitle, press the "Back" button to
go the first frame of that subtitle in the movie.
See the guide at http://zuggy.wz.cz/ for more details.
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*~~~~~~~ OUTPUT FORMATS NOTES ~~~~~~~
Important notice: Set and configurate output format before processing
the DVD subtitles. It's important mainly when changing from BMP to
TXT output or vice-versa.
--- BMP Outputs --Friendly info: Almost all BMP outputs are made on request. Most
of the developers don't author DVDs ATM and don't own any authoring
software. The outputs are done according to user guides or infos
from requestors. That's why bugs appear. If you find one and
discover the reason, please, inform us. Be sure, we'll repair it.
FreQuent PrOblem #1 - NTSC sync:
- Problem: "Whenever I downsample a DVD9 movie, I have a M2V file and I
use pulldown to make another M2V with which to import subtitles
correctly. This syncs them just fine, but I cannot use the same method
for DVD5's. SubRip's subtitles just won't sync when I'm using an
un-transcoded M2V stream."
- Solution - Excerption from PULLDOWN v0.99d readme:
"The drop_frame_flag is not used during playback; however, I have
discovered that DVDMaestro and Scenarist use this flag to synchronize
subtitles in 29.97fps streams. If you set this flag to true then you
can mux your subtitle streams from MaestroSBT or ScriptConverter
without problems. This flag is required for proper subtitle sync on
progressive, hybrid or interlaced 29.97fps streams. PAL, of course,
does not have this problem."
- Example usage for a 29.97 int or hybrid source stream:
pulldown source.m2v target.m2v -nopulldown -drop_frame true
Drop Frame (NTSC only): SubRip adds NON_DROP or DROP flag to header
of output file. When checked, then in addition some timecode
corrections are done (info from Brain: For NTSC Dropframe
timecodes, the following timecodes do not exist: Frame 0 and 1 of
each minute, except for 0, 10, 20, 30, 40 and 50 minutes.).
Nothing more, so no FPS change or anything else!
\
/
submagic|at|netcourrier.com
\
/_____________________________________________________________\
(_
_)
_) ()
BRAIN
() (_
(_______________________________________________________________)
\
/
\
http://www.subrip.fr.st/
/
\_________________________________________________________/