; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G12410 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G12410
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDirigent protein
Genome locationClcChr02:23995568..23996189
RNA-Seq ExpressionClc02G12410
SyntenyClc02G12410
Gene Ontology termsGO:0048046 - apoplast (cellular component)
InterPro domainsIPR004265 - Dirigent protein
IPR044859 - Allene oxide cyclase/Dirigent protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651893.1 hypothetical protein Csa_006446 [Cucumis sativus]1.4e-5673.65Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPWIQSLN KKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNV
         ALTYEFTAG+F GSS+VVLGKNSVMHTVRELP+VGGTGVFR ARGYA ARTYWLNS+GDAIVGYNV
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNV

XP_004147026.1 dirigent protein 23 [Cucumis sativus]5.0e-5974.27Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPWIQSLN KKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAG+F GSS+VVLGKNSVMHTVRELP+VGGTGVFR ARGYA ARTYWLNS+GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

XP_008457663.1 PREDICTED: dirigent protein 23-like [Cucumis melo]5.5e-5873.68Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPW QSLN KKPVISRH+SQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAGEF GSSIVVLGKNSVMHTVRELPVVGGTGVFR ARGYA ARTYW N++GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

XP_022146249.1 dirigent protein 23 [Momordica charantia]7.7e-5269.05Show/hide
Query:  LPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDAL
        LPW Q+LN KK VISR  SQ QTVTNIQFYFHD VSGKTP+AV+VAEAP T KSPTLFGA+                 + RA G          GL  AL
Subjt:  LPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDAL

Query:  TYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
        TYEFTAGE+NGSSI +LGKNSVMH VRELPVVGGTGVFRLARGYALARTYW N+ GDAIVGYNVTVIH
Subjt:  TYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

XP_038902199.1 dirigent protein 23 [Benincasa hispida]2.7e-6076.02Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPWIQSLN KKPVISRH SQKQTV+NIQFYFHDTVSGKTPSAVKVAEAPTTG SPTLFGAV I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAGEFNGSSIVV+GKNSVMHTVRELPVVGGTGVFR+ARGYALARTYW+N++GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

TrEMBL top hitse value%identityAlignment
A0A0A0LPM2 Dirigent protein2.4e-5974.27Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPWIQSLN KKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAG+F GSS+VVLGKNSVMHTVRELP+VGGTGVFR ARGYA ARTYWLNS+GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

A0A1S3C5Z7 Dirigent protein2.7e-5873.68Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPW QSLN KKPVISRH+SQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAGEF GSSIVVLGKNSVMHTVRELPVVGGTGVFR ARGYA ARTYW N++GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

A0A5A7TRN3 Dirigent protein2.7e-5873.68Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPW QSLN KKPVISRH+SQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAGEF GSSIVVLGKNSVMHTVRELPVVGGTGVFR ARGYA ARTYW N++GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

A0A6J1CXK9 Dirigent protein3.7e-5269.05Show/hide
Query:  LPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDAL
        LPW Q+LN KK VISR  SQ QTVTNIQFYFHD VSGKTP+AV+VAEAP T KSPTLFGA+                 + RA G          GL  AL
Subjt:  LPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDAL

Query:  TYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
        TYEFTAGE+NGSSI +LGKNSVMH VRELPVVGGTGVFRLARGYALARTYW N+ GDAIVGYNVTVIH
Subjt:  TYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

E5GC06 Dirigent protein2.7e-5873.68Show/hide
Query:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN
        +ATLPW QSLN KKPVISRH+SQKQTVTNIQFYFHDTVSGKTPSA+KVAEAPT+ KSPTLFGA+ I                 RA G          GL 
Subjt:  MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLN

Query:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         ALTYEFTAGEF GSSIVVLGKNSVMHTVRELPVVGGTGVFR ARGYA ARTYW N++GDAIVGYNVTVIH
Subjt:  DALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

SwissProt top hitse value%identityAlignment
Q67YM6 Dirigent protein 116.0e-2342.76Show/hide
Query:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKNSVM
        +T++ FYFHD +SG  P+ ++VAEAP T  S T+FGAV I                 RA G          G      + FT GEFNGS+  + G+N ++
Subjt:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKNSVM

Query:  HTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
           RELP++GGTG FR ARGYAL +TY + +I DA+V YNV + H
Subjt:  HTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

Q84TH6 Dirigent protein 233.5e-3150.34Show/hide
Query:  KQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKN
        K  VTN+QFYFHDT+SGK P+AVKVA+   T KSPTLFGAV                 + RA G          GL  A+++ F  G +  S+I ++GKN
Subjt:  KQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKN

Query:  SVMHTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH
        S M+ +RE+P+VGGTG+FR+ARGYA+ART W +   GDAIVGYNVT++H
Subjt:  SVMHTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH

Q9FIG6 Dirigent protein 12.1e-2344.44Show/hide
Query:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH
        T++ FYFHD +SG  P+AVKVAEA  T      FG + I                 RA G   + A+          Y FTAGEFNGS+I V G+N +  
Subjt:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH

Query:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         VRELP++GGTG FR ARGYAL +TY +  + DA+V YNV + H
Subjt:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

Q9FIG7 Dirigent protein 23.9e-2242.36Show/hide
Query:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH
        T++ FYFHD +SG  P+AVKVAEA  T  S   FG + I                 RA G     A+            FTAGEFNGS++ + G+N +  
Subjt:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH

Query:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         VRE+P++GGTG FR ARGYA A+TY +  + DA+V YNV + H
Subjt:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

Q9SS03 Dirigent protein 216.6e-2241.78Show/hide
Query:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGS---------GLNDALTYEFTAGEFNGSSIVVLGKNSVM
        +T++ FYFHD VSG  P++V+VA  PTT  S T FG V +                 RA G          GL  A    FT G+F+ S++ + G+N V+
Subjt:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGS---------GLNDALTYEFTAGEFNGSSIVVLGKNSVM

Query:  HTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH
          VRE+P++GGTG FR  RGYALA+T   N + GDA+V YNV + H
Subjt:  HTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH

Arabidopsis top hitse value%identityAlignment
AT1G22900.1 Disease resistance-responsive (dirigent-like protein) family protein4.3e-2442.76Show/hide
Query:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKNSVM
        +T++ FYFHD +SG  P+ ++VAEAP T  S T+FGAV I                 RA G          G      + FT GEFNGS+  + G+N ++
Subjt:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKNSVM

Query:  HTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
           RELP++GGTG FR ARGYAL +TY + +I DA+V YNV + H
Subjt:  HTVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

AT1G65870.1 Disease resistance-responsive (dirigent-like protein) family protein4.7e-2341.78Show/hide
Query:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGS---------GLNDALTYEFTAGEFNGSSIVVLGKNSVM
        +T++ FYFHD VSG  P++V+VA  PTT  S T FG V +                 RA G          GL  A    FT G+F+ S++ + G+N V+
Subjt:  VTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGS---------GLNDALTYEFTAGEFNGSSIVVLGKNSVM

Query:  HTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH
          VRE+P++GGTG FR  RGYALA+T   N + GDA+V YNV + H
Subjt:  HTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH

AT2G21100.1 Disease resistance-responsive (dirigent-like protein) family protein2.5e-3250.34Show/hide
Query:  KQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKN
        K  VTN+QFYFHDT+SGK P+AVKVA+   T KSPTLFGAV                 + RA G          GL  A+++ F  G +  S+I ++GKN
Subjt:  KQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAV----------------RICRAAG---------SGLNDALTYEFTAGEFNGSSIVVLGKN

Query:  SVMHTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH
        S M+ +RE+P+VGGTG+FR+ARGYA+ART W +   GDAIVGYNVT++H
Subjt:  SVMHTVRELPVVGGTGVFRLARGYALARTYWLN-SIGDAIVGYNVTVIH

AT5G42500.1 Disease resistance-responsive (dirigent-like protein) family protein2.8e-2342.36Show/hide
Query:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH
        T++ FYFHD +SG  P+AVKVAEA  T  S   FG + I                 RA G     A+            FTAGEFNGS++ + G+N +  
Subjt:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH

Query:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         VRE+P++GGTG FR ARGYA A+TY +  + DA+V YNV + H
Subjt:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH

AT5G42510.1 Disease resistance-responsive (dirigent-like protein) family protein1.5e-2444.44Show/hide
Query:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH
        T++ FYFHD +SG  P+AVKVAEA  T      FG + I                 RA G   + A+          Y FTAGEFNGS+I V G+N +  
Subjt:  TNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRIC----------------RAAGSGLNDAL---------TYEFTAGEFNGSSIVVLGKNSVMH

Query:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH
         VRELP++GGTG FR ARGYAL +TY +  + DA+V YNV + H
Subjt:  TVRELPVVGGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTCTCCCATGGATTCAAAGCCTCAACACAAAGAAACCCGTAATCTCAAGACACGTTTCTCAGAAACAGACCGTCACCAACATCCAATTCTACTTCCACGACAC
CGTCAGCGGGAAAACCCCCTCCGCCGTCAAGGTCGCCGAGGCCCCAACCACCGGAAAATCGCCGACCCTCTTCGGGGCTGTACGGATCTGTCGGGCAGCAGGATCTGGGC
TTAATGATGCATTAACCTACGAATTCACCGCCGGCGAGTTCAATGGCAGCTCCATTGTTGTTTTGGGGAAGAATTCGGTGATGCATACCGTTCGTGAGTTGCCGGTCGTC
GGAGGGACGGGGGTTTTCCGGTTAGCTCGTGGCTATGCTCTGGCAAGGACTTATTGGCTCAACTCCATTGGTGATGCCATTGTGGGTTATAATGTAACAGTTATACACTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGCAACTCTCCCATGGATTCAAAGCCTCAACACAAAGAAACCCGTAATCTCAAGACACGTTTCTCAGAAACAGACCGTCACCAACATCCAATTCTACTTCCACGACAC
CGTCAGCGGGAAAACCCCCTCCGCCGTCAAGGTCGCCGAGGCCCCAACCACCGGAAAATCGCCGACCCTCTTCGGGGCTGTACGGATCTGTCGGGCAGCAGGATCTGGGC
TTAATGATGCATTAACCTACGAATTCACCGCCGGCGAGTTCAATGGCAGCTCCATTGTTGTTTTGGGGAAGAATTCGGTGATGCATACCGTTCGTGAGTTGCCGGTCGTC
GGAGGGACGGGGGTTTTCCGGTTAGCTCGTGGCTATGCTCTGGCAAGGACTTATTGGCTCAACTCCATTGGTGATGCCATTGTGGGTTATAATGTAACAGTTATACACTA
GAAAATTCGAGAAATTATTCAGGGACATCATCACAACAAAACGGATTATTTTGTGTTGTTGTTTGGTGATTAAGAAATTGTAGAAGTATCTAATGATTTATTGGAGAG
Protein sequenceShow/hide protein sequence
MATLPWIQSLNTKKPVISRHVSQKQTVTNIQFYFHDTVSGKTPSAVKVAEAPTTGKSPTLFGAVRICRAAGSGLNDALTYEFTAGEFNGSSIVVLGKNSVMHTVRELPVV
GGTGVFRLARGYALARTYWLNSIGDAIVGYNVTVIH