; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026265 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026265
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Genome locationchr10:33566434..33567093
RNA-Seq ExpressionLag0026265
SyntenyLag0026265
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7061809.1 unnamed protein product [Microthlaspi erraticum]5.2e-5244.95Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        I F   + TD R  +  ++ +V     GSYLGLP  F  SK +    L +RV + + GW     S GGKE LLKSV+ A+P YAM CF+LPKG+  K+ S
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWW S E+K  +HW  W ++  PK +GG+ FRD+E FNQA+LAKQAWR+L  P S VA++L+ RYFP    +++    R S+ W    WG +LL  
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        GL+K++G+G S++   DP
Subjt:  GLQKQIGDGRSIRWLQDP

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]9.5e-5450.5Show/hide
Query:  SLIVQMVTVDHLG---SYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTES
        SLI  +++V+ +     YLGLP+   R++R  F  + DRVW  LQGWK   FS GGKE+L+K+V QAIP Y M CFRLPK L+ +   + A+FWWGS++ 
Subjt:  SLIVQMVTVDHLG---SYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTES

Query:  KNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI
           IHW  W  LY PK  GG+ FRDLE+FN+A+LAKQ WR+LN+P S +++VLKGRYF     +EA      SY W+   WG DLLK GL+ +IG+G S+
Subjt:  KNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI

XP_030477990.1 uncharacterized protein LOC115695032 [Cannabis sativa]1.0e-5247.95Show/hide
Query:  MICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIF
        ++ FS N S  ++     I+ M   +   SYLGLP+   R K++ F  + +R+W  L  W    FS GGKE+LLK+V+Q+IPTYAM CF+LP     +I 
Subjt:  MICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIF

Query:  SLCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLK
        SL + FWWGST  K  IHWKQW+ L K K  GGL F++   FNQA+LAKQAWR+  NP S + +VLKGRYF   D L A TC  SS  W+G  WG +LLK
Subjt:  SLCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLK

Query:  AGLQKQIGDGRSIRWLQDP
         G++ Q+G+G  I    DP
Subjt:  AGLQKQIGDGRSIRWLQDP

XP_030505322.1 uncharacterized protein LOC115720309 [Cannabis sativa]1.9e-5450.23Show/hide
Query:  FSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLC
        FS N + D +  ++ ++ +   D +  YLGLP +F RSK++ F  L DRVWS L  W    FS GGKE+LLKSVVQAIP+YAM CF++P   LSKI S+ 
Subjt:  FSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLC

Query:  AKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGL
        A FWWG TE    IHWK+W  L   K  GGL FR ++ FNQAMLAKQAWR+L  P S VA +LK  YFP T  LEA    + S  W   +WG +LLK G+
Subjt:  AKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGL

Query:  QKQIGDGRSIRWLQD
        +K +GDG +     D
Subjt:  QKQIGDGRSIRWLQD

XP_030509135.1 uncharacterized protein LOC115723805 [Cannabis sativa]1.5e-5450.23Show/hide
Query:  FSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLC
        FS N +   +  ++ ++ +   D +  YLGLP +F RSK+  F  + DRVW+ L  W   FFS GGKE+LLKSVVQAIP+YAM CF+LP    SKI SL 
Subjt:  FSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLC

Query:  AKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGL
        A+FWWG  E+   IHWK W  L   K  GGL FR ++ FNQAMLAKQAWR+L  P S VA +LK RYFP T  L +    R S  W    WG +LL++GL
Subjt:  AKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGL

Query:  QKQIGDGRSIRWLQD
        +K IGDG +I    D
Subjt:  QKQIGDGRSIRWLQD

TrEMBL top hitse value%identityAlignment
A0A803PM68 Uncharacterized protein2.8e-5949.08Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        +CF R VS   +  L+ ++ +  VD+ G YLGLPS   R+K++ F+ +  +VW+ L+GWK SFFS  GKEIL+K++VQAIPTY M CFRLPK  ++ I S
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWWGS+E    IHW +W  L K KE GGL FRDL +FNQA+LAKQ WR +  P S  +KVLK  YFP   +LEA +   +S+ W+   WG  +++ 
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        G + +IG+G S+R L+DP
Subjt:  GLQKQIGDGRSIRWLQDP

A0A803PV25 Uncharacterized protein2.8e-5949.08Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        +CF R+VS   R +L+  + +  VD+ G YLGLPS   R+K++ F+  +++VW+ L+GWK SFFS  GKE+L+K++VQAIPTY M CFRLPK  ++ I S
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWWGS+E    IHW +W  L K KE GGL FRDL +FNQA+LAKQ WR +  P S  +KVLK  Y+P   +LEA     +S+ W+   WG  +++A
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        G + +IG+G S+R L DP
Subjt:  GLQKQIGDGRSIRWLQDP

A0A803PWX1 Uncharacterized protein1.8e-5847.25Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        +CF R V+   R +L+ I+ +  VD+ G YLGLPS   R+K++ F+ + ++VW+ L+GWK SFFS  GKE+L+K+V+QAIPTY M CFRLPK  ++ I S
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWWGS+E  + IHW +W  L K KE GGL FRDL +FNQA+LAKQ WR +  P S  ++VLK  Y+P   ++EA +   +S+ W+   WG  +++ 
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        G + +IG+G S+R + DP
Subjt:  GLQKQIGDGRSIRWLQDP

A0A803Q0L5 Uncharacterized protein1.9e-5546.79Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        +CF RNVS + R  L+L + +  VD+ G YLGLPS   R+K++    + ++VW+ ++GWK S FS  GKE+L+K+VVQAIPTYAM CFRL K  +S I  
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWWGS+E    IHW +W  L KPK+ GGL FRDL  FNQA+LAKQ WR +    +  ++VLK  YFP   +LEA +   +S+ W+   WG  ++  
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        G + ++G+G ++R L+DP
Subjt:  GLQKQIGDGRSIRWLQDP

A0A803QJV0 Uncharacterized protein1.9e-5546.33Show/hide
Query:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS
        +CF +NV+  T+L L+  + +  VD+ G YLGL S   R+K++ F  + +RVW+SL+GWK   FS GG E+L+K++VQAIP Y M  +RL K  ++ I  
Subjt:  ICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFS

Query:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA
        + A+FWWGST  K  IHW +W+ L +PKE GGL FRDLE+FNQA+LAKQ WR L  P S   KVLK  YFP   +L A     +S+ W+   WG +++  
Subjt:  LCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKA

Query:  GLQKQIGDGRSIRWLQDP
        G + ++G+G+ +R L+DP
Subjt:  GLQKQIGDGRSIRWLQDP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.9e-2130.85Show/hide
Query:  LPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKEMGG
        +P    R  +  F  +L+RV S + GW+    S  G+  L K+V+ ++P ++M    LP+ +L+++  L   F WGST  K   H  +W ++  PK+ GG
Subjt:  LPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKEMGG

Query:  LNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSY--FWKGFAWGL-DLLKAGLQKQIGDGRSIRWLQD
        L  R  +  N+A+++K  WR+L    S    VL+ +Y            P+ S+   W+  A GL D++  G+    GDG+ IR+  D
Subjt:  LNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSY--FWKGFAWGL-DLLKAGLQKQIGDGRSIRWLQD

P93295 Uncharacterized mitochondrial protein AtMg003104.0e-3144.83Show/hide
Query:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKE-MGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLE
        A+P YAM CFRL K L  K+ S   +FWW S E+K  I W  W++L K KE  GGL FRDL  FNQA+LAKQ++R+++ P + ++++L+ RYFP + ++E
Subjt:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKE-MGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLE

Query:  APTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWLQD
             R SY W+    G +LL  GL + IGDG        RW+ D
Subjt:  APTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWLQD

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein7.1e-3142.25Show/hide
Query:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEA
        A+PTY M CF LPK +  +I S+ A FWW + +    +HWK W  L   K  GG+ F+D+E FN A+L KQ WR+L+ P S +AKV K RYF  +D L A
Subjt:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEA

Query:  PTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWL
        P   R S+ WK      ++L+ G +  +G+G  I     +WL
Subjt:  PTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-3244.83Show/hide
Query:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKE-MGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLE
        A+P YAM CFRL K L  K+ S   +FWW S E+K  I W  W++L K KE  GGL FRDL  FNQA+LAKQ++R+++ P + ++++L+ RYFP + ++E
Subjt:  AIPTYAMGCFRLPKGLLSKIFSLCAKFWWGSTESKNCIHWKQWRELYKPKE-MGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLE

Query:  APTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWLQD
             R SY W+    G +LL  GL + IGDG        RW+ D
Subjt:  APTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSI-----RWLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTGTTTCTCGAGGAATGTGTCTACTGATACTAGGCTATACCTAAGCTTGATTGTGCAGATGGTAACAGTGGATCACTTGGGTTCATACCTTGGTCTACCCTCGTC
GTTCCATCGAAGCAAAAGAAAGGATTTTAAAGGCCTTCTTGACAGAGTCTGGTCTTCTTTACAAGGTTGGAAGATGAGTTTTTTCTCTGGAGGAGGGAAAGAGATACTTC
TCAAGAGTGTGGTCCAAGCCATTCCGACATATGCAATGGGGTGCTTTAGACTGCCAAAGGGATTGTTGTCTAAAATCTTCTCTTTGTGTGCTAAGTTCTGGTGGGGCTCA
ACTGAGAGTAAAAATTGTATACACTGGAAACAATGGAGGGAGTTATATAAACCCAAGGAGATGGGCGGTTTAAATTTCAGAGATTTGGAGATTTTTAATCAAGCCATGCT
AGCCAAACAAGCTTGGAGAGTGTTGAATAACCCAAGATCAACGGTAGCCAAGGTCCTTAAAGGAAGATATTTCCCAACAACGGATCTACTGGAGGCTCCAACGTGCCCAC
GCTCTTCCTACTTTTGGAAGGGCTTTGCCTGGGGTTTGGATCTCTTGAAGGCGGGCTTACAGAAACAGATCGGTGACGGCAGGTCTATTAGATGGCTTCAGGATCCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTGTTTCTCGAGGAATGTGTCTACTGATACTAGGCTATACCTAAGCTTGATTGTGCAGATGGTAACAGTGGATCACTTGGGTTCATACCTTGGTCTACCCTCGTC
GTTCCATCGAAGCAAAAGAAAGGATTTTAAAGGCCTTCTTGACAGAGTCTGGTCTTCTTTACAAGGTTGGAAGATGAGTTTTTTCTCTGGAGGAGGGAAAGAGATACTTC
TCAAGAGTGTGGTCCAAGCCATTCCGACATATGCAATGGGGTGCTTTAGACTGCCAAAGGGATTGTTGTCTAAAATCTTCTCTTTGTGTGCTAAGTTCTGGTGGGGCTCA
ACTGAGAGTAAAAATTGTATACACTGGAAACAATGGAGGGAGTTATATAAACCCAAGGAGATGGGCGGTTTAAATTTCAGAGATTTGGAGATTTTTAATCAAGCCATGCT
AGCCAAACAAGCTTGGAGAGTGTTGAATAACCCAAGATCAACGGTAGCCAAGGTCCTTAAAGGAAGATATTTCCCAACAACGGATCTACTGGAGGCTCCAACGTGCCCAC
GCTCTTCCTACTTTTGGAAGGGCTTTGCCTGGGGTTTGGATCTCTTGAAGGCGGGCTTACAGAAACAGATCGGTGACGGCAGGTCTATTAGATGGCTTCAGGATCCTTAG
Protein sequenceShow/hide protein sequence
MICFSRNVSTDTRLYLSLIVQMVTVDHLGSYLGLPSSFHRSKRKDFKGLLDRVWSSLQGWKMSFFSGGGKEILLKSVVQAIPTYAMGCFRLPKGLLSKIFSLCAKFWWGS
TESKNCIHWKQWRELYKPKEMGGLNFRDLEIFNQAMLAKQAWRVLNNPRSTVAKVLKGRYFPTTDLLEAPTCPRSSYFWKGFAWGLDLLKAGLQKQIGDGRSIRWLQDP