; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025709 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025709
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr10:18223297..18226400
RNA-Seq ExpressionLag0025709
SyntenyLag0025709
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFY82848.1 hypothetical protein Acr_02g0010880 [Actinidia rufa]3.6e-2030.48Show/hide
Query:  FLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLG----PDVLGRP
        +    E   G+I+G + A +IWE L  +Y ++S   +  LR+ LQ I+KDGLT L Y+ + + + N  ++I EP+ Y DH  Y L  LG    P V    
Subjt:  FLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLG----PDVLGRP

Query:  QPSARP---------------------------------------PRWSSPSTNC-----------------------------PQCQICDKMGHTALVC
          + RP                                       P++ +PSTN                              P+CQIC K GHTA  C
Subjt:  QPSARP---------------------------------------PRWSSPSTNC-----------------------------PQCQICDKMGHTALVC

Query:  YNRHNPLYHASTSAPPPQALFNQFQPSSSPTP-VSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
        Y+R N  Y      PPP   FN +    +P P +S S     S  S P+ +W+MDSGA+HH TP +N L
Subjt:  YNRHNPLYHASTSAPPPQALFNQFQPSSSPTP-VSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]9.5e-2932.51Show/hide
Query:  SSSFPFPHSSQPTGFQYPLASSTLPFYPSQPTVPQYFPTPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD-----
        SSS P P    PT    PL SS++P    +P  PQ   T        P++  PL VKL D NY++WK  LLN +IA  +E F++ + +   RFLD     
Subjt:  SSSFPFPHSSQPTGFQYPLASSTLPFYPSQPTVPQYFPTPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD-----

Query:  ----------------------AGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHC
                                E+  G+I+G + A +IWE L  +Y ++S   +  LR+ LQ I+K+GLT L Y+ + + + N  ++I EP+ Y DH 
Subjt:  ----------------------AGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHC

Query:  GYILEDLG----------------PDVLGRPQPSA--RPPRWSSPSTNC-----------------------------PQCQICDKMGHTALVCYNRHNP
         Y L  LG                P V     P++  R P++ +PSTN                              P+CQIC K GHTA  CY+  N 
Subjt:  GYILEDLG----------------PDVLGRPQPSA--RPPRWSSPSTNC-----------------------------PQCQICDKMGHTALVCYNRHNP

Query:  LYHASTSAPPPQALFNQF-QPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
         Y      PPP   FN +  P+ S +  S+ P  ++ L S P+ +W+MDSGA+HH TP++N L
Subjt:  LYHASTSAPPPQALFNQF-QPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]5.7e-2631.99Show/hide
Query:  PTPQHT--PAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFIN-DTLASSRFLDAGETQ---------------------------FGEIIGCS
        P PQ T    P P+L+  L++KL ++N LL K+ LLN IIA  +E+FI+ D  +  ++LDA   Q                            G+I+  S
Subjt:  PTPQHT--PAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFIN-DTLASSRFLDAGETQ---------------------------FGEIIGCS

Query:  MAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPDV------------------------
         A +IW  L   YES S   +M L SQLQ+I+K  + + +YL+++K + ++F+ I EPL YRD    ILE L  +                         
Subjt:  MAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPDV------------------------

Query:  --------------LGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVS----LPSN
                      L  PQ + R P +++   + PQCQIC K GH AL  Y+R N  YH      P  A FN   P  + +P+S   T S +       +
Subjt:  --------------LGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVS----LPSN

Query:  PNEAWFMDSGATHHMTPNINNL
         + +W+MDSGATHH TP   ++
Subjt:  PNEAWFMDSGATHHMTPNINNL

RVX22862.1 hypothetical protein CK203_008411 [Vitis vinifera]2.1e-2032.94Show/hide
Query:  TPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFI-NDTLASSRFLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQK
        +P  T A +P+L+ PL +KL ++N  LWKN L+N IIA  +E+FI  +T   ++FLD  +     I+    A EIW  L  VY+S S   ++ L SQLQK
Subjt:  TPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFI-NDTLASSRFLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQK

Query:  IRKDGLTVLQYLAQIKDIANKFSAID-----EPLFYRDHCGYILEDLGPDVLGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTS
        I+K+G+T+ +YLA+IK++ +K+SA+D         YR      ++ L        QP  +PP                             NP+    T+
Subjt:  IRKDGLTVLQYLAQIKDIANKFSAID-----EPLFYRDHCGYILEDLGPDVLGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTS

Query:  APPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
                     S+SPT  S          +N +  W+MDSGATHH TP   NL
Subjt:  APPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]4.5e-3932.79Show/hide
Query:  PTVPQYFPTPQH--TPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD---------------------------AGETQF
        P  P +   P +  +  PFPTL  PLNVKL+D+N+LLWKN LLN +IA  +  +++ T +   +FLD                             E + 
Subjt:  PTVPQYFPTPQH--TPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD---------------------------AGETQF

Query:  GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPD------------------
        GE++     ++IW  L  VY+S +T  IMGL+++LQ +RKDG +V QYLA+IK+IA+KF+A+ EPL YRDH  ++L+ LG +                  
Subjt:  GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPD------------------

Query:  -------------------------------------------------------------------VLGRPQPSAR-PPRWSSPSTNCPQCQICDKMGH
                                                                           +LG+PQ   + PP+   PS++  QCQIC K+GH
Subjt:  -------------------------------------------------------------------VLGRPQPSAR-PPRWSSPSTNCPQCQICDKMGH

Query:  TALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPN
        +A VCY+R N  YH ++    PQAL++  QP  SPT  S           +P+E+WFMDSGATHHMTP+
Subjt:  TALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPN

TrEMBL top hitse value%identityAlignment
A0A438GC62 Retrovirus-related Pol polyprotein from transposon RE12.8e-2631.99Show/hide
Query:  PTPQHT--PAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFIN-DTLASSRFLDAGETQ---------------------------FGEIIGCS
        P PQ T    P P+L+  L++KL ++N LL K+ LLN IIA  +E+FI+ D  +  ++LDA   Q                            G+I+  S
Subjt:  PTPQHT--PAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFIN-DTLASSRFLDAGETQ---------------------------FGEIIGCS

Query:  MAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPDV------------------------
         A +IW  L   YES S   +M L SQLQ+I+K  + + +YL+++K + ++F+ I EPL YRD    ILE L  +                         
Subjt:  MAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPDV------------------------

Query:  --------------LGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVS----LPSN
                      L  PQ + R P +++   + PQCQIC K GH AL  Y+R N  YH      P  A FN   P  + +P+S   T S +       +
Subjt:  --------------LGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVS----LPSN

Query:  PNEAWFMDSGATHHMTPNINNL
         + +W+MDSGATHH TP   ++
Subjt:  PNEAWFMDSGATHHMTPNINNL

A0A438KNU0 Uncharacterized protein1.0e-2032.94Show/hide
Query:  TPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFI-NDTLASSRFLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQK
        +P  T A +P+L+ PL +KL ++N  LWKN L+N IIA  +E+FI  +T   ++FLD  +     I+    A EIW  L  VY+S S   ++ L SQLQK
Subjt:  TPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFI-NDTLASSRFLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQK

Query:  IRKDGLTVLQYLAQIKDIANKFSAID-----EPLFYRDHCGYILEDLGPDVLGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTS
        I+K+G+T+ +YLA+IK++ +K+SA+D         YR      ++ L        QP  +PP                             NP+    T+
Subjt:  IRKDGLTVLQYLAQIKDIANKFSAID-----EPLFYRDHCGYILEDLGPDVLGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTS

Query:  APPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
                     S+SPT  S          +N +  W+MDSGATHH TP   NL
Subjt:  APPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

A0A6J1DQX7 uncharacterized protein LOC1110223152.2e-3932.79Show/hide
Query:  PTVPQYFPTPQH--TPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD---------------------------AGETQF
        P  P +   P +  +  PFPTL  PLNVKL+D+N+LLWKN LLN +IA  +  +++ T +   +FLD                             E + 
Subjt:  PTVPQYFPTPQH--TPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD---------------------------AGETQF

Query:  GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPD------------------
        GE++     ++IW  L  VY+S +T  IMGL+++LQ +RKDG +V QYLA+IK+IA+KF+A+ EPL YRDH  ++L+ LG +                  
Subjt:  GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGPD------------------

Query:  -------------------------------------------------------------------VLGRPQPSAR-PPRWSSPSTNCPQCQICDKMGH
                                                                           +LG+PQ   + PP+   PS++  QCQIC K+GH
Subjt:  -------------------------------------------------------------------VLGRPQPSAR-PPRWSSPSTNCPQCQICDKMGH

Query:  TALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPN
        +A VCY+R N  YH ++    PQAL++  QP  SPT  S           +P+E+WFMDSGATHHMTP+
Subjt:  TALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPN

A0A7J0E8R3 Uncharacterized protein1.7e-2030.48Show/hide
Query:  FLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLG----PDVLGRP
        +    E   G+I+G + A +IWE L  +Y ++S   +  LR+ LQ I+KDGLT L Y+ + + + N  ++I EP+ Y DH  Y L  LG    P V    
Subjt:  FLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLG----PDVLGRP

Query:  QPSARP---------------------------------------PRWSSPSTNC-----------------------------PQCQICDKMGHTALVC
          + RP                                       P++ +PSTN                              P+CQIC K GHTA  C
Subjt:  QPSARP---------------------------------------PRWSSPSTNC-----------------------------PQCQICDKMGHTALVC

Query:  YNRHNPLYHASTSAPPPQALFNQFQPSSSPTP-VSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
        Y+R N  Y      PPP   FN +    +P P +S S     S  S P+ +W+MDSGA+HH TP +N L
Subjt:  YNRHNPLYHASTSAPPPQALFNQFQPSSSPTP-VSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

A0A7J0GPN0 UBX domain-containing protein4.6e-2932.51Show/hide
Query:  SSSFPFPHSSQPTGFQYPLASSTLPFYPSQPTVPQYFPTPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD-----
        SSS P P    PT    PL SS++P    +P  PQ   T        P++  PL VKL D NY++WK  LLN +IA  +E F++ + +   RFLD     
Subjt:  SSSFPFPHSSQPTGFQYPLASSTLPFYPSQPTVPQYFPTPQHTPAPFPTLTPPLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDT-LASSRFLD-----

Query:  ----------------------AGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHC
                                E+  G+I+G + A +IWE L  +Y ++S   +  LR+ LQ I+K+GLT L Y+ + + + N  ++I EP+ Y DH 
Subjt:  ----------------------AGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHC

Query:  GYILEDLG----------------PDVLGRPQPSA--RPPRWSSPSTNC-----------------------------PQCQICDKMGHTALVCYNRHNP
         Y L  LG                P V     P++  R P++ +PSTN                              P+CQIC K GHTA  CY+  N 
Subjt:  GYILEDLG----------------PDVLGRPQPSA--RPPRWSSPSTNC-----------------------------PQCQICDKMGHTALVCYNRHNP

Query:  LYHASTSAPPPQALFNQF-QPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL
         Y      PPP   FN +  P+ S +  S+ P  ++ L S P+ +W+MDSGA+HH TP++N L
Subjt:  LYHASTSAPPPQALFNQF-QPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNINNL

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0436.27Show/hide
Query:  QPSARPPRWSS--PSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNIN
        QPS+   R  +  P     +CQIC   GH+A  C   H             Q+  NQ Q +S  TP       +V+ P N N  W +DSGATHH+T + N
Subjt:  QPSARPPRWSS--PSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNIN

Query:  NL
        NL
Subjt:  NL

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.0e-0622.45Show/hide
Query:  PLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDTLASSRFLDAG-----------------ETQF-GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQ
        P+ + + +SNY  W+ + L H ++FD+   I+ TL  +   D                     QF G  +  S + +IW  +   + ++     + L S+
Subjt:  PLNVKLSDSNYLLWKNMLLNHIIAFDMENFINDTLASSRFLDAG-----------------ETQF-GEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQ

Query:  LQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGP
        L+      + V  Y  ++K +A+    +D P+  R+   Y+L  L P
Subjt:  LQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGYILEDLGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCAACAGGCAATGATCAAATACCTACCATACCCACTATCTCCACCGAAACCACCATACCCGTCATCTCTTTCACTGCCAATGTGACCACTTCTGTGTCCACACT
TATCTCTTCTCAACCCTACCTTTACATTTCATCTTCTTTCCCCTTCCCTCATTCTTCTCAACCTACCGGTTTTCAATATCCACTGGCGTCTTCAACCCTGCCCTTCTATC
CTTCCCAGCCGACGGTGCCACAGTATTTCCCCACCCCTCAGCATACGCCAGCTCCCTTTCCTACTCTCACGCCCCCTCTCAATGTCAAACTTTCAGATTCCAACTATCTT
TTATGGAAGAACATGCTACTTAACCATATCATTGCCTTTGATATGGAAAACTTCATCAACGACACACTTGCTTCATCTCGTTTTCTTGATGCTGGTGAAACTCAGTTTGG
TGAGATCATTGGTTGTTCTATGGCTTATGAAATATGGGAACACCTTTGCGTGGTTTATGAATCCTCATCTACGACCATGATAATGGGGCTTCGTTCTCAGTTGCAGAAGA
TTCGCAAGGATGGTCTAACAGTTTTGCAATATCTTGCTCAGATAAAGGATATTGCCAACAAGTTCTCAGCCATTGATGAGCCTCTCTTCTATCGGGATCATTGTGGTTAC
ATACTCGAAGACCTCGGTCCTGATGTTTTAGGCCGACCACAGCCTTCCGCTCGTCCCCCTCGTTGGTCATCTCCTTCAACAAATTGTCCTCAATGCCAAATCTGTGACAA
AATGGGGCACACAGCTTTGGTGTGTTATAATCGCCACAATCCTCTATATCATGCTTCTACTTCTGCCCCTCCCCCTCAAGCTCTCTTTAATCAATTTCAGCCTTCTTCCT
CTCCCACCCCTGTTTCAGACTCTCCAACTGATTCTGTTTCTTTACCTTCGAATCCCAATGAAGCATGGTTTATGGATTCAGGGGCAACACATCATATGACCCCTAATATC
AATAACCTTCAGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCAACAGGCAATGATCAAATACCTACCATACCCACTATCTCCACCGAAACCACCATACCCGTCATCTCTTTCACTGCCAATGTGACCACTTCTGTGTCCACACT
TATCTCTTCTCAACCCTACCTTTACATTTCATCTTCTTTCCCCTTCCCTCATTCTTCTCAACCTACCGGTTTTCAATATCCACTGGCGTCTTCAACCCTGCCCTTCTATC
CTTCCCAGCCGACGGTGCCACAGTATTTCCCCACCCCTCAGCATACGCCAGCTCCCTTTCCTACTCTCACGCCCCCTCTCAATGTCAAACTTTCAGATTCCAACTATCTT
TTATGGAAGAACATGCTACTTAACCATATCATTGCCTTTGATATGGAAAACTTCATCAACGACACACTTGCTTCATCTCGTTTTCTTGATGCTGGTGAAACTCAGTTTGG
TGAGATCATTGGTTGTTCTATGGCTTATGAAATATGGGAACACCTTTGCGTGGTTTATGAATCCTCATCTACGACCATGATAATGGGGCTTCGTTCTCAGTTGCAGAAGA
TTCGCAAGGATGGTCTAACAGTTTTGCAATATCTTGCTCAGATAAAGGATATTGCCAACAAGTTCTCAGCCATTGATGAGCCTCTCTTCTATCGGGATCATTGTGGTTAC
ATACTCGAAGACCTCGGTCCTGATGTTTTAGGCCGACCACAGCCTTCCGCTCGTCCCCCTCGTTGGTCATCTCCTTCAACAAATTGTCCTCAATGCCAAATCTGTGACAA
AATGGGGCACACAGCTTTGGTGTGTTATAATCGCCACAATCCTCTATATCATGCTTCTACTTCTGCCCCTCCCCCTCAAGCTCTCTTTAATCAATTTCAGCCTTCTTCCT
CTCCCACCCCTGTTTCAGACTCTCCAACTGATTCTGTTTCTTTACCTTCGAATCCCAATGAAGCATGGTTTATGGATTCAGGGGCAACACATCATATGACCCCTAATATC
AATAACCTTCAGCAATAA
Protein sequenceShow/hide protein sequence
MSSTGNDQIPTIPTISTETTIPVISFTANVTTSVSTLISSQPYLYISSSFPFPHSSQPTGFQYPLASSTLPFYPSQPTVPQYFPTPQHTPAPFPTLTPPLNVKLSDSNYL
LWKNMLLNHIIAFDMENFINDTLASSRFLDAGETQFGEIIGCSMAYEIWEHLCVVYESSSTTMIMGLRSQLQKIRKDGLTVLQYLAQIKDIANKFSAIDEPLFYRDHCGY
ILEDLGPDVLGRPQPSARPPRWSSPSTNCPQCQICDKMGHTALVCYNRHNPLYHASTSAPPPQALFNQFQPSSSPTPVSDSPTDSVSLPSNPNEAWFMDSGATHHMTPNI
NNLQQ