"The price of greatness is responsibility." Sir Winston Churchill


Search the IBPA



Top Menu

Menu Sidebar

IBPA Issues
About IBPA
IBPA Constitution
FAQ-s
IBPA Events
Individual Membership
Institutional Membership
IBPA Forums / Groups
Cooperation with IBPA
Links

Publications
IBPA Careers Newsletter
Past Issues
Industry Publications
Promote Yourself within the Industry
Submit Your Article

Career Center: Employers
Job Posting
Free Resume Database
Volunteers Database

Career Center: Job Seekers
Now Hiring
Submit Resume
Career Training
Nurses Careers in Biopharm
Scholarship Programs
Internship Programs
Resume Editing & Interview Coaching
Volunteer for the Industry
Download IBPA Career Info Brochure

Industry Directories and Listings
Pharmaceutical Companies
Contract Research Organizations
Professional Associations
Recruiters and Staffing Agencies
Clinical Research Centers
Consulting Companies
Education & Training Institutions
Jobs and Resume Searching Directories
Research and Development Companies
Industry Service Providers
List Your Company

Investor's Center
Offers
Calls

Contact IBPA
USAChapter
Canadian Chapter
European Chapter
Asian Chapter

Start Your Career in Biotech with IBPA Scholarship Programs
Untitled Document



Subscribe to our "Careers in the Biopharmaceutical Industry" newsletter:

Name*:

Email*:

City:

Country:

Phone:

To unsubscribe, click here

 

 

Needleman-Wunsch algorithm

From Wikipedia, the free encyclopedia.

 

The Needleman-Wunsch algorithm performs a global alignment on two sequences (called A and B here). It is commonly used in bioinformatics to align protein or nucleotide sequences. The algorithm was proposed in 1970 by Saul Needleman and Christian Wunsch in their paper A general method applicable to the search for similarities in the amino acid sequence of two proteins, J Mol Biol. 48(3):443-53.

The Needleman-Wunsch algorithm is an example a of dynamic programming, and is guaranteed to find the alignment with the maximum score. Needleman-Wunsch is the first instance of dynamic programming being applied to biological sequence comparison.

Scores for aligned characters are specified by a similarity matrix. Here, S(i,j) is the similarity of characters i and j. It uses a linear gap penalty, here called d.

For example, if the similarity matrix was

- A G C T
A 10 -1 -3 -4
G -1 7 -5 -3
C -3 -5 9 0
T -4 -3 0 8

then the alignment:

  AGACTAGTTAC

  CGA---GACGT

with a gap penalty of -5, would have the following score...

  S(A,C) + S(G,G) + S(A,A) + 3\times d + S(G,G) + S(T,A) + S(T,C) + S(A,G) + S(C,T)

  = -3 + 7 + 10 - 3\times 5 + 7 + -4 + 0 + -1 + 0 = 1

To find the alignment with the highest score, a two-dimensional array (or matrix) is allocated. This matrix is often called the F matrix, and its (i,j)th entry is often denoted Fij There is one column for each character in sequence A, and one row for each character in sequence B. Thus, if we are aligning sequences of sizes n and m, the amount of memory used by the algorithm is in O(nm). (However, there is a modified version of the algorithm which uses only O(m + n) space, at the cost of a higher running time. This modification is in fact a general technique which applies to many dynamic programming algorithms; this method was introduced in Hirschberg's algorithm for solving the longest common subsequence problem.)

As the algorithm progresses, the Fij will be assigned to be the optimal score for the alignment of the first i characters in A and the first j characters in B. The principle of optimality is then applied as follows.

  Basis:

  F11 = 0

  F1j = 0

  Fi1 = 0

  Recursion, based on the principle of optimality:

  Fij = max(Fi − 1,j − 1 + S(Ai − 1,Bj − 1),Fi,j − 1 + d,Fi − 1,j + d)

The pseudo-code for the algorithm to compute the F matrix therefore looks like this (array indexes start at 0):

  for i=0 to length(A)-1

    F(i,0) <- 0

  for j=0 to length(B)-1

    F(0,j) <- 0

  for i=1 to length(A)-1

    for j = 1 to length(B)-1

    {

      Choice1 <- F(i-1,j-1) + S(A(i-1), B(j-1))

      Choice2 <- F(i-1, j) - d

      Choice3 <- F(i, j-1) - d

      F(i,j) <- max(Choice1, Choice2, Choice3)

    }

Once the F matrix is computed, note that the bottom right hand corner of the matrix is the maximum score for any alignments. To compute which alignment actually gives this score, you can start from the bottom left cell, and compare the value with the three possible sources(Choice1, Choice2, and Choice3 above) to see which it came from. If it was Choice1, then A(i) and B(i) are aligned, if it was Choice2 then A(i) is aligned with a gap, and if it was Choice3, then B(i) is aligned with a gap.

  AlignmentA <- ""

  AlignmentB <- ""

  i <- length(A) - 1

  j <- length(B) - 1

  while (i > 0 AND j > 0)

  {

    Score <- F(i,j)

    ScoreDiag <- F(i - 1, j - 1)

    ScoreUp <- F(i, j - 1)

    ScoreLeft <- F(i - 1, j)

    if (Score - S(A(i), B(j)) == ScoreDiag)

    {

      AlignmentA <- A(i) + AlignmentA

      AlignmentB <- B(j) + AlignmentB

      i <- i - 1

      j <- j - 1

    }

    else if (Score == ScoreLeft - d)

    {

      AlignmentA <- A(i) + AlignmentA

      AlignmentB <- "-" + AlignmentB

      i <- i - 1

    }

    otherwise (Score == ScoreUp - d)

    {

      AlignmentA <- "-" + AlignmentA

      AlignmentB <- B(j) + AlignmentB

      j <- j - 1

    }

  }

  while (i >= 0)

  {

    AlignmentA <- A(i) + AlignmentA

    AlignmentB <- "-" + AlignmentB

    i <- i - 1

  }

  while (j >= 0)

  {

    AlignmentA <- "-" + AlignmentA

    AlignmentB <- B(j) + AlignmentB

    j <- j - 1

  }



External links




Learn More About the Biopharmaceutical Industry and Clinical Research:


Category:

Logo sidebar
  • Analytical Chemistry
  • Bioinformatics
  • Biology
  • Biochemistry
  • Biotechnology
  • Biotechnology Companies
  • Cell Imaging
  • Chemistry
  • Chemists
  • Crystallography
  • Ecology
  • Environmentalism
  • Genetic Engineering
  • Genetically Modified Organisms
  • Genetics
  • Health
  • Health Care
  • Health Sciences
  • Medical Specialities
  • Medicine
  • Molecular Genetics
  • Pharmaceutical Industry
  • Pharmacy
  • Pharmacology

  • Powered by Wikipedia, the free encyclopedia. Articles were developed by IBPA volunteers.

    Logo sidebar

    A

    B

    C

    D

    E

    F

    G

    I

    K

    L

    M

    N

    P

    Q

    R

    S

    T


    Logo sidebar


    IBPA Sponsors and Active Supporters

    http://www.payoneer.com/
    Access Clinical Trials

    Access Clinical Trials
    Access Clinical Trials


    Allied Research International
    Allied Research International

    Altaspera Global Services Inc.
    Altaspera Global Services

    Financial Planning and Personal Insurance
    For Canadian Pharmaceutical Industry Executives


    Biorole Scientific Solutions
    Biorole Scientific Solutions

    CEREPROTEC INC. Development of Novel Neuroprotective Drugs
    CEREPROTEC INC. Development of Novel Neuroprotective Drugs

    Recruitment Advertising Agencies
    Recruitment Advertising Agencies

    Cellular Technology Ltd.
    Cellular Technology Ltd.

    Clinical Trial Network
    Free Database of Clinical Investigators

    ClinQua Clinical Trials Inc.
    ClinQua Clinical Trials Inc.

    Coronis Clinical Research Organization
    Coronis Clinical Research Organization

    CPIC Latin America
    CPIC Latin America

    Espoir Bridge Recruiters
    Espoir Bridge Recruiters

    Genentech
    Genentech

    ILS SA
    Independent Research and Laboratory Solutions

    Inova Health Research
    Inova Health Research, Inc.

    Kriger Research Group International
    Kriger Research Group International

    LCCT
    LCCT

    Metrics Research
    Complete Research Solutions on a Single Platform

    Pharmalef Developments
    Pharmalef Developments

    PrimeHealth Clinical Research Organization
    PrimeHealth Clinical Research Organization

    Research & Development RA SA
    Research & Development RA SA

    Scios Inc.
    Scios Inc. - Manufacturer of Health Care Products

    Scios Inc.
    Southeast Regional Research Group LLC.

    UniMR
    UniMR Clinical Research

    YM BioSciences
    YM BioSciences

    Become IBPA Sponsor
    Post Your Logo Here

    ©2004 International Biopharmaceutical Association Inc., all rights reserved
    Privacy Policy - Terms of Use

    Google