Trace PDF

Încărcat de

picala

0% au considerat acest document util (0 voturi)

34 vizualizări2 pagini

Titlu original

trace.pdf

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

34 vizualizări2 pagini

Trace PDF

Încărcat de

picala

Drepturi de autor:

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 2

Căutați în document

SOEN 6481 Software Systems Requirements Specification (Winter 2015/16)

Worksheet #6

Automatic Traceability Link Recovery based on

the Vector Space Model (VSM) and Cosine Similarity

You just started a new job where you inherited a few hundred thousand lines of code, together with
thousands of pages of (possibly) relevant documents including requirements, domain information, design
descriptions, user guides, among others. Now its your job to figure out how they are connected, in order to
understand the system and be able to make changes, while keeping all artifacts consistent.
Fortunately, you just learned how to automatically create traceability links between software artifacts using
the vector space model (VSM).

The Input Artifacts. You start with two requirements documents (r1 , r2 ), and one source code file (s1 ):
r1 = The server has a database.
r2 = The client has encryption and the server has encryption.
s1 = // Server encryption.
Your goal here is to automatically create vertical traceability links: from source code to requirements.

Step 1: Tokenize each file and remove stopwords. The first step is to break up the artifacts into indi-
vidual tokens (tokenization), separating tokens by whitespace, ignoring punctuation marks, and converting
all tokens to lower case. Then, remove all stopwords (this includes words like the, a, is, be, has,
and, or, . . .).

The resulting list of tokens for the artifacts are:

r1 =

r2 =

s1 =

Step 2: Compute the document vectors and normalize them. Here, we want to find out which require-
ments documents should be linked from the source code file. For this, you first need to compute the vectors
for the query (source code file) and the two requirements documents. = (contd.)
SOEN 6481 Worksheet #6 Winter 2015/16

Fill in the empty values in the table below, using these definitions:
tf: term frequency
df: document frequency
N: number of documents
N
idf = log10
df
tf.idf =tfidf (i.e., no log weighting for tf)
Assume N = 10,000,000, each qi is the normalized weight for the tf.idf weights for the query words, and
di are the normalized weights for tf in the document.1 To normalize a vector, you have to (1) compute
its length ||~v || = x1 + . . . + xn2 , then (2) divide each element by the length: ||~xvi|| . Here, you end up with
p
2

4-dimensional vectors:

query (s1 ) r1 r2
token tf df idf tf.idf qi tf di tf di

server 50,000

database 10,000

client 100,000

encryption 10,000

Step 3: Compute the similarity between query vector and the other artifact vectors. Now compute
the cosine similarity between the vector for the query (source code s1 ) and each of the requirements
documents ~
~ P (r1 , r2 ). Since the vectors are already normalized, this is simply their dot product: cos(~q , d ) =
~q d = i qi di :
sim(~
s1 , r~1 ) = cos(~
s1 , r~1 ) =

sim(~
s1 , r~2 ) = cos(~
s1 , r~2 ) =

Step 4: Filter links by similarity. Now we have to filter the results. Here, we apply filtering by similarity:
only artifacts with a cosine similarity above 80% are linked. Show the resulting traceability link(s):

links =

So, now you know which of the requirements documents to consult/update for more information/changes
on the given source code file!
1
That means, you only have to do tf.idf weighting for the query, not for the documents, but this is only done for the purpose
of this exercise to do less calculations by hand in a real implementation, you do tf.idf weighting for all terms.

S-ar putea să vă placă și

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
De la Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Evaluare: 4 din 5 stele
4/5 (5794)
Advanced Statistical Approaches To Quality: INSE 6220 - Week 6
Document44 pagini
Advanced Statistical Approaches To Quality: INSE 6220 - Week 6
picala
Încă nu există evaluări
The Little Book of Hygge: Danish Secrets to Happy Living
De la Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Evaluare: 3.5 din 5 stele
3.5/5 (399)
W3inse6220 PDF
Document44 pagini
W3inse6220 PDF
picala
Încă nu există evaluări
Shoe Dog: A Memoir by the Creator of Nike
De la Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Evaluare: 4.5 din 5 stele
4.5/5 (537)
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
Document44 pagini
Advanced Statistical Approaches To Quality: INSE 6220 - Week 4
picala
Încă nu există evaluări
Yes Please
De la Everand
Yes Please
Amy Poehler
Evaluare: 4 din 5 stele
4/5 (1891)
LectureNotes PDF
Document55 pagini
LectureNotes PDF
picala
Încă nu există evaluări
Never Split the Difference: Negotiating As If Your Life Depended On It
De la Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Evaluare: 4.5 din 5 stele
4.5/5 (838)
Chapters4 5 PDF
Document96 pagini
Chapters4 5 PDF
picala
Încă nu există evaluări
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
De la Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Evaluare: 4 din 5 stele
4/5 (895)
SampleMidterms PDF
Document15 pagini
SampleMidterms PDF
picala
Încă nu există evaluări
The Yellow House: A Memoir (2019 National Book Award Winner)
De la Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Evaluare: 4 din 5 stele
4/5 (98)
W2inse6220 PDF
Document40 pagini
W2inse6220 PDF
picala
Încă nu există evaluări
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
De la Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Evaluare: 3.5 din 5 stele
3.5/5 (231)
W5inse6220 PDF
Document44 pagini
W5inse6220 PDF
picala
Încă nu există evaluări
Grit: The Power of Passion and Perseverance
De la Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Evaluare: 4 din 5 stele
4/5 (588)
1z2 P1Z z2 1 22 E: Table II
Document3 pagini
1z2 P1Z z2 1 22 E: Table II
picala
Încă nu există evaluări
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
De la Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Evaluare: 4.5 din 5 stele
4.5/5 (474)
W1inse6220 PDF
Document11 pagini
W1inse6220 PDF
picala
Încă nu există evaluări
On Fire: The (Burning) Case for a Green New Deal
De la Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Evaluare: 4 din 5 stele
4/5 (73)
MidtermFormula PDF
Document1 pagină
MidtermFormula PDF
picala
Încă nu există evaluări
Team of Rivals: The Political Genius of Abraham Lincoln
De la Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Evaluare: 4.5 din 5 stele
4.5/5 (234)
Slides09 PDF
Document17 pagini
Slides09 PDF
picala
Încă nu există evaluări
Principles: Life and Work
De la Everand
Principles: Life and Work
Ray Dalio
Evaluare: 4 din 5 stele
4/5 (599)
A1INSE6220 Winter17sol PDF
Document5 pagini
A1INSE6220 Winter17sol PDF
picala
Încă nu există evaluări
The Emperor of All Maladies: A Biography of Cancer
De la Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Evaluare: 4.5 din 5 stele
4.5/5 (271)
Syllabus INSE6220
Document4 pagini
Syllabus INSE6220
picala
Încă nu există evaluări
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
De la Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Evaluare: 4.5 din 5 stele
4.5/5 (344)
LectureNotes PDF
Document55 pagini
LectureNotes PDF
picala
Încă nu există evaluări
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
De la Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Evaluare: 4.5 din 5 stele
4.5/5 (266)
SOEN 6481 Software Systems Requirements Specification (Winter 2015/16) Worksheet #5: Z
Document1 pagină
SOEN 6481 Software Systems Requirements Specification (Winter 2015/16) Worksheet #5: Z
picala
Încă nu există evaluări
The Glass Castle: A Memoir
De la Everand
The Glass Castle: A Memoir
Jeannette Walls
Evaluare: 4.5 din 5 stele
4.5/5 (1712)
A1INSE6220 Winter17
Document2 pagini
A1INSE6220 Winter17
picala
Încă nu există evaluări
Rise of ISIS: A Threat We Can't Ignore
De la Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Evaluare: 3.5 din 5 stele
3.5/5 (137)
Slides08 PDF
Document7 pagini
Slides08 PDF
picala
Încă nu există evaluări
Fear: Trump in the White House
De la Everand
Fear: Trump in the White House
Bob Woodward
Evaluare: 3.5 din 5 stele
3.5/5 (738)
Requirements Evolution: Traceability and Change Management
Document27 pagini
Requirements Evolution: Traceability and Change Management
picala
Încă nu există evaluări
Angela's Ashes: A Memoir
De la Everand
Angela's Ashes: A Memoir
Frank McCourt
Evaluare: 4.5 din 5 stele
4.5/5 (440)
Slides10 PDF
Document18 pagini
Slides10 PDF
picala
Încă nu există evaluări
The Unwinding: An Inner History of the New America
De la Everand
The Unwinding: An Inner History of the New America
George Packer
Evaluare: 4 din 5 stele
4/5 (45)
Seqact PDF
Document1 pagină
Seqact PDF
picala
Încă nu există evaluări
The World Is Flat 3.0: A Brief History of the Twenty-first Century
De la Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Evaluare: 3.5 din 5 stele
3.5/5 (2219)
Notes12 PDF
Document3 pagini
Notes12 PDF
picala
Încă nu există evaluări
Steve Jobs
De la Everand
Steve Jobs
Walter Isaacson
Evaluare: 4.5 din 5 stele
4.5/5 (806)
State PDF
Document1 pagină
State PDF
picala
Încă nu există evaluări
John Adams
De la Everand
John Adams
David McCullough
Evaluare: 4.5 din 5 stele
4.5/5 (2409)
Slides11 PDF
Document24 pagini
Slides11 PDF
picala
Încă nu există evaluări
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
De la Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brene Brown
Evaluare: 4 din 5 stele
4/5 (1090)
16 Statecharts and UML State Machine Diagrams: Learning Objectives
Document2 pagini
16 Statecharts and UML State Machine Diagrams: Learning Objectives
picala
Încă nu există evaluări
Bad Feminist: Essays
De la Everand
Bad Feminist: Essays
Roxane Gay
Evaluare: 4 din 5 stele
4/5 (1015)
Resolution PDF
Document2 pagini
Resolution PDF
picala
Încă nu există evaluări
The Outsider: A Novel
De la Everand
The Outsider: A Novel
Stephen King
Evaluare: 4 din 5 stele
4/5 (1839)
mt2 Example Questions PDF
Document6 pagini
mt2 Example Questions PDF
picala
Încă nu există evaluări
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
De la Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Evaluare: 4.5 din 5 stele
4.5/5 (120)
Notes15 PDF
Document2 pagini
Notes15 PDF
picala
Încă nu există evaluări
The Woman in Cabin 10
De la Everand
The Woman in Cabin 10
Ruth Ware
Evaluare: 3.5 din 5 stele
3.5/5 (2322)
Notes14 PDF
Document2 pagini
Notes14 PDF
picala
Încă nu există evaluări
A Man Called Ove: A Novel
De la Everand
A Man Called Ove: A Novel
Fredrik Backman
Evaluare: 4.5 din 5 stele
4.5/5 (4609)
KEIL 2012 Software Pricelist USD
Document5 pagini
KEIL 2012 Software Pricelist USD
Kashyap Gada
Încă nu există evaluări
Brooklyn: A Novel
De la Everand
Brooklyn: A Novel
Colm Toibin
Evaluare: 3.5 din 5 stele
3.5/5 (1937)
Upload A Document To Access Your Download
Document4 pagini
Upload A Document To Access Your Download
Димитрије Живановић
Încă nu există evaluări
The Light Between Oceans: A Novel
De la Everand
The Light Between Oceans: A Novel
M.L. Stedman
Evaluare: 4.5 din 5 stele
4.5/5 (789)
Network Hacking Project Report MCA-402
Document36 pagini
Network Hacking Project Report MCA-402
Ravi Kandeyang
100% (1)
Wolf Hall: A Novel
De la Everand
Wolf Hall: A Novel
Hilary Mantel
Evaluare: 4 din 5 stele
4/5 (3811)
SCmod Readme v1 40
Document4 pagini
SCmod Readme v1 40
Anonymous uJRFjt1W
50% (2)
The Perks of Being a Wallflower
De la Everand
The Perks of Being a Wallflower
Stephen Chbosky
Evaluare: 4.5 din 5 stele
4.5/5 (2100)
Material Quantity Calculation for Process Industry
Document11 pagini
Material Quantity Calculation for Process Industry
Swapnil Rajane
Încă nu există evaluări
Little Women
De la Everand
Little Women
Louisa May Alcott
Evaluare: 4 din 5 stele
4/5 (104)
Bait and Switch Honeypot HOWTO
Document7 pagini
Bait and Switch Honeypot HOWTO
Dhruv Jain
Încă nu există evaluări
Sing, Unburied, Sing: A Novel
De la Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Evaluare: 4 din 5 stele
4/5 (1103)
Solarisx86 Sparc Boot Troubleshoot
Document33 pagini
Solarisx86 Sparc Boot Troubleshoot
RajeshSgss
Încă nu există evaluări
Manhattan Beach: A Novel
De la Everand
Manhattan Beach: A Novel
Jennifer Egan
Evaluare: 3.5 din 5 stele
3.5/5 (792)
Logfile 20171026 105833
Document58 pagini
Logfile 20171026 105833
Gerry Trenas
Încă nu există evaluări
The Art of Racing in the Rain: A Novel
De la Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Evaluare: 4 din 5 stele
4/5 (4200)
TMS320C6713 DSK
Document4 pagini
TMS320C6713 DSK
shifali
Încă nu există evaluări
Her Body and Other Parties: Stories
De la Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Evaluare: 4 din 5 stele
4/5 (821)
WWW Sanfoundry Com C Program Number Divisible by 5
Document5 pagini
WWW Sanfoundry Com C Program Number Divisible by 5
प्रतीक प्रकाश
Încă nu există evaluări
The Constant Gardener: A Novel
De la Everand
The Constant Gardener: A Novel
John le Carré
Evaluare: 3.5 din 5 stele
3.5/5 (104)
CRM Tables
Document2 pagini
CRM Tables
navinsap
Încă nu există evaluări
A Tree Grows in Brooklyn
De la Everand
A Tree Grows in Brooklyn
Betty Smith
Evaluare: 4.5 din 5 stele
4.5/5 (1929)
Gauss-Jordan Elimination:) XQGDP Hqwdorshudwlrqv
Document8 pagini
Gauss-Jordan Elimination:) XQGDP Hqwdorshudwlrqv
Kanta Prajapat
Încă nu există evaluări
Jane
Document3 pagini
Jane
Issa Salazar
100% (1)
Memo - Journey Management PDF
Document2 pagini
Memo - Journey Management PDF
Ahmed Al Adawi
Încă nu există evaluări
Exam1 s16 Sol
Document10 pagini
Exam1 s16 Sol
Vũ Quốc Ngọc
Încă nu există evaluări
READMESP
Document28 pagini
READMESP
Faldy Hildan
Încă nu există evaluări
Creating A Menu in Visual Basic
Document9 pagini
Creating A Menu in Visual Basic
Piyush Verma
Încă nu există evaluări
MCQ
Document10 pagini
MCQ
akyadav123
100% (1)
Data Aire DAP III Modbus Integration Instructions: Data Aire, Inc. 230 West Blueridge Avenue Orange, California 92865
Document12 pagini
Data Aire DAP III Modbus Integration Instructions: Data Aire, Inc. 230 West Blueridge Avenue Orange, California 92865
magoo1234
Încă nu există evaluări
Openstack
Document68 pagini
Openstack
ick semarang
Încă nu există evaluări
ITSG-33 - Annex 1
Document56 pagini
ITSG-33 - Annex 1
franjasama
Încă nu există evaluări
Knowledge Management Systems Life Cycle: Lecture Two
Document37 pagini
Knowledge Management Systems Life Cycle: Lecture Two
Vairavel Chenniyappan
Încă nu există evaluări
Vikas Statement
Document1 pagină
Vikas Statement
Abhishek Pareek
Încă nu există evaluări
K Dim DC
Document16 pagini
K Dim DC
Kedar Katabathula
Încă nu există evaluări
Read Me
Document2 pagini
Read Me
Monika Tripathy
Încă nu există evaluări
IT Opportunity Indian SMB Sector
Document25 pagini
IT Opportunity Indian SMB Sector
ppdeepak
Încă nu există evaluări
Ack Tan
Document1 pagină
Ack Tan
HIMA HOSPITAL
Încă nu există evaluări
54433
Document2 pagini
54433
Sahooashu
Încă nu există evaluări
TELECOM Lecture 13 Traffic Engineering
Document19 pagini
TELECOM Lecture 13 Traffic Engineering
Miguel Villarroel
Încă nu există evaluări
Casos de BPMN
Document10 pagini
Casos de BPMN
Perrin Haarp
Încă nu există evaluări
ITIL 4: Direct, plan and improve: Reference and study guide
De la Everand
ITIL 4: Direct, plan and improve: Reference and study guide
Lou Hunnebeck
Încă nu există evaluări
Blockchain Basics: A Non-Technical Introduction in 25 Steps
De la Everand
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Daniel Drescher
Evaluare: 4.5 din 5 stele
4.5/5 (24)
Dark Data: Why What You Don’t Know Matters
De la Everand
Dark Data: Why What You Don’t Know Matters
David J. Hand
Evaluare: 4.5 din 5 stele
4.5/5 (3)
Learn SQL in 24 Hours
De la Everand
Learn SQL in 24 Hours
Alex Nordeen
Evaluare: 5 din 5 stele
5/5 (2)
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
De la Everand
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Walter Shields
Evaluare: 4.5 din 5 stele
4.5/5 (46)
Grokking Algorithms: An illustrated guide for programmers and other curious people
De la Everand
Grokking Algorithms: An illustrated guide for programmers and other curious people
Aditya Bhargava
Evaluare: 4 din 5 stele
4/5 (16)
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
De la Everand
Business Intelligence Strategy and Big Data Analytics: A General Management Perspective
Steve Williams
Evaluare: 5 din 5 stele
5/5 (5)
Monitored: Business and Surveillance in a Time of Big Data
De la Everand
Monitored: Business and Surveillance in a Time of Big Data
Peter Bloom
Evaluare: 4 din 5 stele
4/5 (1)
Excel 2021
De la Everand
Excel 2021
JIAYI SIMONDS
Evaluare: 4 din 5 stele
4/5 (11)