; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025249 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025249
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:10465586..10466641
RNA-Seq ExpressionLag0025249
SyntenyLag0025249
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]1.3e-3539.66Show/hide
Query:  PPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAA-------------------
        PPP+S+ L      N +PQ       P    P++  PL +KL D NY++WK QLLN +IA  +E F++G    PP++LD                     
Subjt:  PPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAA-------------------

Query:  --------ETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVT
                E+ LG+I+G ++AS+IWE L  +Y ++S A +  LR+ LQ I K+GL+   Y+ + + + +  + IGEP++Y DHL Y L GLG +YNPFVT
Subjt:  --------ETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVT

Query:  SIQNRSDRPSLADVRSLLLAYVARLEKQSSVE
        SIQ+++ RPS+ +V SLLL+Y ARLE+QS+ +
Subjt:  SIQNRSDRPSLADVRSLLLAYVARLEKQSSVE

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]9.1e-3743.05Show/hide
Query:  PPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAAE-------------------------TQ--LGEIIGCSTAS
        PP  P    P++  P  IKL   NYL+WKNQLLN IIA  +E FI+G  P PP++ D A                          TQ  +G+I+G ++A 
Subjt:  PPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAAE-------------------------TQ--LGEIIGCSTAS

Query:  EIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYV
        EIWE L  +Y SSS A+I  LR++LQ + KDGL+  +Y+ + K+I +  + +GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLLL+Y 
Subjt:  EIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYV

Query:  ARLEKQSSVEQLNMVQANLANLS
         RLE Q++  QL+ +QANLA+L+
Subjt:  ARLEKQSSVEQLNMVQANLANLS

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]3.5e-3645.08Show/hide
Query:  FPP-PSSHTLSFFQPFNTSPQYFPPPQ--QPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFIN-GMPPPPKYLDAAETQ------------
        FPP P+S++ +      T+    P PQ  Q   P P+L+  L+IKL ++N LL K+QLLN IIA  +E FI+     PPKYLDAA  Q            
Subjt:  FPP-PSSHTLSFFQPFNTSPQYFPPPQ--QPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFIN-GMPPPPKYLDAAETQ------------

Query:  ---------------LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYN
                       +G+I+  STA +IW  L   YES S A +M L SQLQ+I K  + +S+YL+++K + D+F+ IGEPLSYRD L  ILEGL  EY+
Subjt:  ---------------LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYN

Query:  PFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQAN
         FVTSI NRSDRPSL +V SLL  Y  RL ++S  + LN  QAN
Subjt:  PFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQAN

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]6.5e-5951.01Show/hide
Query:  QFPPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-MPPPPKYLD-------------------
        QFPPP+ + L+            PP     NP+PTL  PLN+KL+D+N+LLWKNQLLN +IA  + G+++G + PPP++LD                   
Subjt:  QFPPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-MPPPPKYLD-------------------

Query:  --------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPF
                 +E ++GE++   T  +IW  L  VY+S +TARIMGL+++LQ + KDG SVSQYLA+IK+I DKF+ +GEPLSYRDHL ++L+GLG+EYN F
Subjt:  --------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPF

Query:  VTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLS
        VTSI NR+D PSL DVRSLLLAY ARL+KQ++V+QLN+ QANL NLS
Subjt:  VTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLS

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]1.6e-4467.88Show/hide
Query:  LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSL
        +GEI+G  +A +IWE LR VYESSS A IMG  SQLQKI KDGL+VSQYLAQIKD++D F+ IGEPLSYRDHL YILEGLG+EYNPFV+SI NR++RPS+
Subjt:  LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSL

Query:  ADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSFPS
        ADVR+LL+ Y +RLEKQ++ + L ++QAN+A+LS  S
Subjt:  ADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSFPS

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein4.4e-3743.05Show/hide
Query:  PPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAAE-------------------------TQ--LGEIIGCSTAS
        PP  P    P++  P  IKL   NYL+WKNQLLN IIA  +E FI+G  P PP++ D A                          TQ  +G+I+G ++A 
Subjt:  PPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAAE-------------------------TQ--LGEIIGCSTAS

Query:  EIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYV
        EIWE L  +Y SSS A+I  LR++LQ + KDGL+  +Y+ + K+I +  + +GEP+S +DHL Y+  GL  EYN FVTSI  R D   L ++ SLLL+Y 
Subjt:  EIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYV

Query:  ARLEKQSSVEQLNMVQANLANLS
         RLE Q++  QL+ +QANLA+L+
Subjt:  ARLEKQSSVEQLNMVQANLANLS

A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE11.7e-3645.08Show/hide
Query:  FPP-PSSHTLSFFQPFNTSPQYFPPPQ--QPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFIN-GMPPPPKYLDAAETQ------------
        FPP P+S++ +      T+    P PQ  Q   P P+L+  L+IKL ++N LL K+QLLN IIA  +E FI+     PPKYLDAA  Q            
Subjt:  FPP-PSSHTLSFFQPFNTSPQYFPPPQ--QPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFIN-GMPPPPKYLDAAETQ------------

Query:  ---------------LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYN
                       +G+I+  STA +IW  L   YES S A +M L SQLQ+I K  + +S+YL+++K + D+F+ IGEPLSYRD L  ILEGL  EY+
Subjt:  ---------------LGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYN

Query:  PFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQAN
         FVTSI NRSDRPSL +V SLL  Y  RL ++S  + LN  QAN
Subjt:  PFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQAN

A0A6J1DQX7 uncharacterized protein LOC1110223153.2e-5951.01Show/hide
Query:  QFPPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-MPPPPKYLD-------------------
        QFPPP+ + L+            PP     NP+PTL  PLN+KL+D+N+LLWKNQLLN +IA  + G+++G + PPP++LD                   
Subjt:  QFPPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-MPPPPKYLD-------------------

Query:  --------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPF
                 +E ++GE++   T  +IW  L  VY+S +TARIMGL+++LQ + KDG SVSQYLA+IK+I DKF+ +GEPLSYRDHL ++L+GLG+EYN F
Subjt:  --------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPF

Query:  VTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLS
        VTSI NR+D PSL DVRSLLLAY ARL+KQ++V+QLN+ QANL NLS
Subjt:  VTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLS

A0A7J0EGI5 Uncharacterized protein6.4e-3639.66Show/hide
Query:  PPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAA-------------------
        PPP+S+ L      N +PQ       P    P++  PL +KL D NY++WK QLLN +IA  +E F++G    PP++LD                     
Subjt:  PPPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPP-PPKYLDAA-------------------

Query:  --------ETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVT
                E+ LG+I+G ++AS+IWE L  +Y ++S A +  LR+ LQ I K+GL+   Y+ + + + +  + IGEP++Y DHL Y L GLG +YNPFVT
Subjt:  --------ETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVT

Query:  SIQNRSDRPSLADVRSLLLAYVARLEKQSSVE
        SIQ+++ RPS+ +V SLLL+Y ARLE+QS+ +
Subjt:  SIQNRSDRPSLADVRSLLLAYVARLEKQSSVE

A0A803NL56 Uncharacterized protein2.8e-3135.57Show/hide
Query:  PPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTP-------PLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-------MPPPPK------------
        P  + T++  Q  NT+     PP    +  P+L P        +++KL D+NYL+W+ Q+ N IIA  +EG+I+G        P                
Subjt:  PPSSHTLSFFQPFNTSPQYFPPPQQPQNPYPTLTP-------PLNIKLSDSNYLLWKNQLLNHIIAFDMEGFING-------MPPPPK------------

Query:  ---------YLDAAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTE
                 Y   +++ LG+I+G +TA+EIW  L   Y ++S AR    R  LQ + KD L+ S YL ++K + +  + +G+P+S ++HL Y+L GLG E
Subjt:  ---------YLDAAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTE

Query:  YNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSFP
        YN FVT I  R  +P++ +V +LLL+Y ARLE+Q++    + +QAN ANLSFP
Subjt:  YNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSFP

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.9e-1426.18Show/hide
Query:  KLSDSNYLLWKNQLLNHIIAFDMEGFING---MPPPPKYLDAA------------------ETQLGEI--------IGCSTASEIWEHLRVVYESSSTAR
        KL+ +NYL+W  Q+      +++ GF++G   MPP     DAA                     LG I           +TA++IWE LR +Y + S   
Subjt:  KLSDSNYLLWKNQLLNHIIAFDMEGFING---MPPPPKYLDAA------------------ETQLGEI--------IGCSTASEIWEHLRVVYESSSTAR

Query:  IMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSS
        +  LR+QL++ +K   ++  Y+  +    D+ +++G+P+ + + +  +LE L  EY P +  I  +   P+L ++   LL + +++   SS
Subjt:  IMGLRSQLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.8e-1323.2Show/hide
Query:  PLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPPPPKYLD------------------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQ
        P+ + + +SNY  W+   L H ++FD+ G I+G   P    D                    +   G  +  ST+ +IW  ++  + ++  AR + L S+
Subjt:  PLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPPPPKYLD------------------AAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRSQ

Query:  LQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEK
        L+      + V+ Y  ++K + D    +  P++ R+ + Y+L GL  +++  +  I++R   PS  D  ++L     RL++
Subjt:  LQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEK

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.7e-1225Show/hide
Query:  LNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPPPPK-----------------YLDAAETQLGEII--GCSTASEIWEHLRVVYESSSTARIMGLRSQ
        + + L+  NY +W+       ++F + G I+G   P                   Y    ++ L  II  GC TA ++W  L  ++  +  AR +   ++
Subjt:  LNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPPPPK-----------------YLDAAETQLGEII--GCSTASEIWEHLRVVYESSSTARIMGLRSQ

Query:  LQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSF
        L+  + D LSV +Y  ++K + D  + +  P+S R  + ++L GL  +Y+  +  I+++S  PS  + RS+LL   +RL  +S     +    +L+N+ F
Subjt:  LQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCAACCTCTAGCTCATCGTCGTCCACCATGGATATCTTTCACACCCTGCCGACGTCTTCCTCTGCTCTTACCATTCCTATTCCAACACCTGGGCTTTCGTCAAC
CTCTATCACCACCACCCCTGTGTCGACACCTATCGCTGCTCAGCGCACCTCCACCAGAGCACCAAATACCAATCCAAACACAACTGCCCGACCCTTAAACCCTAACATTC
CCCCTTTCTCTTCGACTTTTCAGCATTATCCCAATGTTTCTTCATCTTTTCCTTTTCCCAATTCTTCGACTCAGGCCGGTTTCCAGTTTCCTCCACCTTCTTCCCATACT
CTTTCCTTCTTCCAGCCCTTTAATACCTCACCTCAGTATTTTCCACCACCTCAACAACCTCAGAATCCATATCCCACTCTCACTCCTCCCCTGAATATCAAGCTCTCTGA
CTCGAACTACCTCTTGTGGAAGAATCAGCTGCTGAACCACATTATTGCCTTCGACATGGAAGGTTTCATTAACGGTATGCCTCCCCCTCCCAAGTATCTAGATGCTGCTG
AAACACAGCTAGGTGAAATTATAGGCTGCTCTACTGCTTCTGAAATCTGGGAACACCTTAGAGTGGTGTATGAGTCTTCTTCCACAGCTAGGATCATGGGGTTACGGTCT
CAATTGCAGAAAATTAGTAAGGATGGTCTATCGGTTTCACAATACTTGGCTCAAATTAAGGACATAGTAGACAAATTTTCGGTCATAGGTGAACCATTATCCTATAGAGA
TCATCTAGGGTACATACTTGAAGGTCTCGGTACCGAATATAATCCTTTCGTAACCTCCATACAAAATCGCAGTGATCGTCCATCGCTAGCTGATGTCCGCAGTCTTCTTC
TTGCATATGTAGCCAGGCTTGAAAAACAATCCTCCGTTGAGCAGCTGAATATGGTTCAGGCCAACCTCGCTAACCTATCTTTCCCCTCCTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCAACCTCTAGCTCATCGTCGTCCACCATGGATATCTTTCACACCCTGCCGACGTCTTCCTCTGCTCTTACCATTCCTATTCCAACACCTGGGCTTTCGTCAAC
CTCTATCACCACCACCCCTGTGTCGACACCTATCGCTGCTCAGCGCACCTCCACCAGAGCACCAAATACCAATCCAAACACAACTGCCCGACCCTTAAACCCTAACATTC
CCCCTTTCTCTTCGACTTTTCAGCATTATCCCAATGTTTCTTCATCTTTTCCTTTTCCCAATTCTTCGACTCAGGCCGGTTTCCAGTTTCCTCCACCTTCTTCCCATACT
CTTTCCTTCTTCCAGCCCTTTAATACCTCACCTCAGTATTTTCCACCACCTCAACAACCTCAGAATCCATATCCCACTCTCACTCCTCCCCTGAATATCAAGCTCTCTGA
CTCGAACTACCTCTTGTGGAAGAATCAGCTGCTGAACCACATTATTGCCTTCGACATGGAAGGTTTCATTAACGGTATGCCTCCCCCTCCCAAGTATCTAGATGCTGCTG
AAACACAGCTAGGTGAAATTATAGGCTGCTCTACTGCTTCTGAAATCTGGGAACACCTTAGAGTGGTGTATGAGTCTTCTTCCACAGCTAGGATCATGGGGTTACGGTCT
CAATTGCAGAAAATTAGTAAGGATGGTCTATCGGTTTCACAATACTTGGCTCAAATTAAGGACATAGTAGACAAATTTTCGGTCATAGGTGAACCATTATCCTATAGAGA
TCATCTAGGGTACATACTTGAAGGTCTCGGTACCGAATATAATCCTTTCGTAACCTCCATACAAAATCGCAGTGATCGTCCATCGCTAGCTGATGTCCGCAGTCTTCTTC
TTGCATATGTAGCCAGGCTTGAAAAACAATCCTCCGTTGAGCAGCTGAATATGGTTCAGGCCAACCTCGCTAACCTATCTTTCCCCTCCTCTTAA
Protein sequenceShow/hide protein sequence
MASTSSSSSSTMDIFHTLPTSSSALTIPIPTPGLSSTSITTTPVSTPIAAQRTSTRAPNTNPNTTARPLNPNIPPFSSTFQHYPNVSSSFPFPNSSTQAGFQFPPPSSHT
LSFFQPFNTSPQYFPPPQQPQNPYPTLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMEGFINGMPPPPKYLDAAETQLGEIIGCSTASEIWEHLRVVYESSSTARIMGLRS
QLQKISKDGLSVSQYLAQIKDIVDKFSVIGEPLSYRDHLGYILEGLGTEYNPFVTSIQNRSDRPSLADVRSLLLAYVARLEKQSSVEQLNMVQANLANLSFPSS