; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021087 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021087
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNase H domain-containing protein
Genome locationchr7:4554129..4554839
RNA-Seq ExpressionLag0021087
SyntenyLag0021087
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-2232.59Show/hide
Query:  ELVVIFWWSVWNLR-KSLFWGGQSDGRDLWAYSSDYLSAFH---VGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGYVLRG
        EL++++ W +W+ R K +F G +SD R L A +   L A+      G   GA+D    Q +       W+PP    LKLN++A+V    ++ G G ++R 
Subjt:  ELVVIFWWSVWNLR-KSLFWGGQSDGRDLWAYSSDYLSAFH---VGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGYVLRG

Query:  AEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLAR
        AEG++        Q    V LAE  A++ G+Q+A Q+     +VE+D   +V++LN      +E+  ++ D+RR    +   +  F PR  N  AH LA+
Subjt:  AEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLAR

Query:  LAF--SYVDRVWLEEWPSEVSDVL
         A   S  D VW+  +P+EV +VL
Subjt:  LAF--SYVDRVWLEEWPSEVSDVL

XP_015383105.1 uncharacterized protein LOC107175839 [Citrus sinensis]2.7e-2231.86Show/hide
Query:  MRDKLTGSNFELVVIFWWSVWNLRK-SLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAG
        M + L  ++FEL+V  +WS+W+ R   LF G ++D     A +   L ++     R     SS  +S+   ++  W+PPP   +K+N+NA+ + + + AG
Subjt:  MRDKLTGSNFELVVIFWWSVWNLRK-SLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAG

Query:  GGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNK
         G VLR   G V  AA    +    +  AE  A+  G+Q+AR +     +VE DS  +  +LN +  + SEV  ++ +I+ ++  + N KV +TPR  N 
Subjt:  GGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNK

Query:  VAHVLARLAFSYVDRV-WLEEWPSEV
        +AH LARLA    + V W   +P  +
Subjt:  VAHVLARLAFSYVDRV-WLEEWPSEV

XP_022150944.1 uncharacterized protein LOC111018973 [Momordica charantia]2.3e-2936.44Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLR----KSLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEER-GVWRPPPNRELKLNINASVRPDTEE
        D ++  +   +V+  W++WN R    +    GG     DL ++S +YL  +     +   R S  A    +  R  +WRPP    LK+N++A+ R ++  
Subjt:  DKLTGSNFELVVIFWWSVWNLR----KSLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEER-GVWRPPPNRELKLNINASVRPDTEE

Query:  AGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFTPRQ
        AG G ++R + G V++ A   L R   VD  EG+AVY GI LA + GF+ F +ETDSLR+  +L  +  D SEVG+L   I+  LS         FT R 
Subjt:  AGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFTPRQ

Query:  GNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGD
        GN  AH+LA+LA +    ++W+EEWP E+S VL  D
Subjt:  GNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGD

XP_023884925.1 uncharacterized protein LOC111997106 [Quercus suber]2.1e-2232.05Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG
        ++L+    EL     W VWN R  L  GGQ      L   + +Y+S F     R G       Q  +Q     W+PPP  E K+N +A++  +  + G G
Subjt:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG

Query:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS--PWVNGKVLFTPRQGNK
         ++R   GEV  A   S     + D AE  A  R I+ A   GF   +VE D++ +V+ ++  + ++S +G+++DDIR +L    WV+  +    R GN 
Subjt:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILS--PWVNGKVLFTPRQGNK

Query:  VAHVLARLAFSYVDR--VWLEEWPSEVSDVLRGD
        VAHVLA+ A + +D    WLE+ P   ++ L  D
Subjt:  VAHVLARLAFSYVDR--VWLEEWPSEVSDVLRGD

XP_023928118.1 uncharacterized protein LOC112039474 [Quercus suber]8.7e-2132.03Show/hide
Query:  KLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGY
        KL+ + FEL VI  W +WN R ++ +GG+  D + L  ++ ++L  FH   G+     S    S       VWRPPP+   KLN +A V      +G G 
Subjt:  KLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGY

Query:  VLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAH
        ++R   GEV       +       +AE  A  R ++ A + GF D VVE D+L ++K L     D+S +G ++ DI+ +   +      +  R  N VA+
Subjt:  VLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAH

Query:  VLARLAFS-YVDRVWLEEWPSEVSDVLRGDF
         LAR A   + D  W+E+ P  V + L  DF
Subjt:  VLARLAFS-YVDRVWLEEWPSEVSDVLRGDF

TrEMBL top hitse value%identityAlignment
A0A2P6SAP1 Putative RNA-directed DNA polymerase6.1e-2030.3Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG
        D +T  +  L  ++ W +W+ R +L W G   +  +   ++S +L  +         +     Q +    R  W  PP   LK+NI+ S RP++ + G G
Subjt:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG

Query:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVA
         V+R  +G    A         S    E  A   G+ LA    + +F++E+D   +V  LN  L D SEVG ++DD +  L+   + K+    R+ N VA
Subjt:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVA

Query:  HVLARLAFS-YVDRVWLEEWPSEVSDVLRGD
        + LA LA S +++ VWLEE P  + DVL  D
Subjt:  HVLARLAFS-YVDRVWLEEWPSEVSDVLRGD

A0A5B7BI33 Uncharacterized protein (Fragment)3.6e-2030.74Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG
        +KL  S  EL  +  W VW  R +++   Q  D   +   +  YL+ +H    R     S  +++       VW PPP    KLN++ S  P +   G G
Subjt:  DKLTGSNFELVVIFWWSVWNLRKSLFWGGQ-SDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGG

Query:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVA
         V+R + G V       L+ C S D AE  A++ G+  A+++G VD ++E+D L LV  +     D S +G + DDIRR +    + +V    R  N+ A
Subjt:  YVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVA

Query:  HVLARLAFSYVDR-VWLEEWPSEVSDVLRGD
        H +A  A    +  +W+E  PS    VL  D
Subjt:  HVLARLAFSYVDR-VWLEEWPSEVSDVLRGD

A0A6J1C467 uncharacterized protein LOC1110077754.7e-2030.04Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLR-KSLFWGGQSDGR-DLWAYSSDYLSAFHVG----------GGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINAS
        D++  +  E + +F W++WN R +S+F  G      ++  + +DYL  +              GR G R S+W             PP     K+N++A+
Subjt:  DKLTGSNFELVVIFWWSVWNLR-KSLFWGGQSDGR-DLWAYSSDYLSAFHVG----------GGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINAS

Query:  VRPDTEEAGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSP-WVNGK
         +     AG   ++R +   V ++A   +     V LAE  A   G+ LA + G + F +ETDS ++  +L  +  D SE+G+L   IR I+S   + G 
Subjt:  VRPDTEEAGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSP-WVNGK

Query:  VLFTPRQGNKVAHVLARLAF-SYVDRVWLEEWPSEVSDVLRGD
          F  R+GN  AH LAR+   S    VW+EEW S++S+V+  D
Subjt:  VLFTPRQGNKVAHVLARLAF-SYVDRVWLEEWPSEVSDVLRGD

A0A6J1DBJ7 uncharacterized protein LOC1110189731.1e-2936.44Show/hide
Query:  DKLTGSNFELVVIFWWSVWNLR----KSLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEER-GVWRPPPNRELKLNINASVRPDTEE
        D ++  +   +V+  W++WN R    +    GG     DL ++S +YL  +     +   R S  A    +  R  +WRPP    LK+N++A+ R ++  
Subjt:  DKLTGSNFELVVIFWWSVWNLR----KSLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEER-GVWRPPPNRELKLNINASVRPDTEE

Query:  AGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFTPRQ
        AG G ++R + G V++ A   L R   VD  EG+AVY GI LA + GF+ F +ETDSLR+  +L  +  D SEVG+L   I+  LS         FT R 
Subjt:  AGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNG-KVLFTPRQ

Query:  GNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGD
        GN  AH+LA+LA +    ++W+EEWP E+S VL  D
Subjt:  GNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGD

A0A803PE40 Uncharacterized protein2.1e-2030.87Show/hide
Query:  LTGSNFELVVIFWWSVWNLRKSLFWG-GQSDGRDLWAYSSDYLSAFHVGGGRCGA--RDSSWAQSREQEERGV-WRPPPNRELKLNINASVRPDTEEAGG
        L+    ELV +  W +W  R  +  G  + DG  L  Y+++Y+  +H    R  A     + AQ       G+ W+PP    LKLN++A+V P ++  G 
Subjt:  LTGSNFELVVIFWWSVWNLRKSLFWG-GQSDGRDLWAYSSDYLSAFHVGGGRCGA--RDSSWAQSREQEERGV-WRPPPNRELKLNINASVRPDTEEAGG

Query:  GYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKV
        G ++R  +GEV  A    +Q C+  D  E  A++  +    QL     +VETD+LR+   LN E  D+S    ++ D+R +LS +    +    R  N+ 
Subjt:  GYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKV

Query:  AHVLARLAFSY-VDRVWLEEWPSEVSDVLR
        AH LA+ A    VD  W+ E P  +  V++
Subjt:  AHVLARLAFSY-VDRVWLEEWPSEVSDVLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G13980.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-0826.67Show/hide
Query:  WRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLL
        W+ PP+  +K N +A       ++   +++R  +G       L L    +   AE  A+   +Q     G++  ++E D   L  +++G     + +  L
Subjt:  WRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEGEVFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLL

Query:  MDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS----YVDRVWLEEW
        +DDIR     + N +  F  R GNKVAH LA+L  +    Y D   L  W
Subjt:  MDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS----YVDRVWLEEW

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.3e-1126.37Show/hide
Query:  WSVWNLRKS----LFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGV---WRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEGE
        W +W L KS    +F G + D  ++   + +    +         R+     S  Q ER +   W+ PP + +K N +A+ + +    G G++LR   G 
Subjt:  WSVWNLRKS----LFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGV---WRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEGE

Query:  VFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS
        V      +L R  +V  AE  A+   +    +  +   + E+D+  LV +LN +      +   ++DI+++L  +   K  FTPR GNKVA  +AR + S
Subjt:  VFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS

Query:  Y
        +
Subjt:  Y

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1631.1Show/hide
Query:  ELVVIFWWSVWNLRKSL-FWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEG
        +LV    W +W  R  L F G + + +++   + D L  + +   R  A          +   G WRPPP++ +K N +A+   D E  G G+VLR  +G
Subjt:  ELVVIFWWSVWNLRKSL-FWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEG

Query:  EVFMAAFLSLQRCWSVDLAE----GWAVYRGIQLAR-QLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVL
        EV      +L +  SV  AE     WAV   + L+R Q  +V F  E+DS  L++ILN +      +   + D++R+LS +   K +F PR+GN +A  +
Subjt:  EVFMAAFLSLQRCWSVDLAE----GWAVYRGIQLAR-QLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVL

Query:  ARLAFSYVD
        AR + S+++
Subjt:  ARLAFSYVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGACAAACTGACAGGGTCGAATTTTGAGCTTGTGGTCATTTTTTGGTGGTCTGTGTGGAATCTACGAAAAAGCCTGTTTTGGGGTGGGCAGTCAGACGGTCGGGA
TCTCTGGGCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGACGTTGCGGGGCAAGGGACTCCTCATGGGCTCAATCGAGAGAGCAGGAAGAGCGCGGTG
TATGGAGACCGCCCCCTAATAGGGAGCTGAAACTTAATATCAATGCTTCGGTACGGCCGGATACAGAAGAAGCGGGGGGTGGCTATGTGCTGCGAGGGGCTGAGGGTGAG
GTATTCATGGCAGCTTTTTTGAGCTTACAGAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAGAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGA
TTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTCTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGAAGGATCCTCAGTC
CTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAACAAGGTTGCGCATGTTCTGGCCCGCCTGGCCTTTTCATACGTTGATCGTGTATGGCTTGAGGAGTGG
CCTAGCGAGGTCTCAGACGTCTTGAGGGGTGATTTTGTTTCAGTTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGACAAACTGACAGGGTCGAATTTTGAGCTTGTGGTCATTTTTTGGTGGTCTGTGTGGAATCTACGAAAAAGCCTGTTTTGGGGTGGGCAGTCAGACGGTCGGGA
TCTCTGGGCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGGGACGTTGCGGGGCAAGGGACTCCTCATGGGCTCAATCGAGAGAGCAGGAAGAGCGCGGTG
TATGGAGACCGCCCCCTAATAGGGAGCTGAAACTTAATATCAATGCTTCGGTACGGCCGGATACAGAAGAAGCGGGGGGTGGCTATGTGCTGCGAGGGGCTGAGGGTGAG
GTATTCATGGCAGCTTTTTTGAGCTTACAGAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAGAGGGATCCAACTTGCTCGACAGTTGGGGTTTGTGGA
TTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTCTGAATGGGGAGCTGCATGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGAAGGATCCTCAGTC
CTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAACAAGGTTGCGCATGTTCTGGCCCGCCTGGCCTTTTCATACGTTGATCGTGTATGGCTTGAGGAGTGG
CCTAGCGAGGTCTCAGACGTCTTGAGGGGTGATTTTGTTTCAGTTGCATGA
Protein sequenceShow/hide protein sequence
MRDKLTGSNFELVVIFWWSVWNLRKSLFWGGQSDGRDLWAYSSDYLSAFHVGGGRCGARDSSWAQSREQEERGVWRPPPNRELKLNINASVRPDTEEAGGGYVLRGAEGE
VFMAAFLSLQRCWSVDLAEGWAVYRGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRVWLEEW
PSEVSDVLRGDFVSVA