; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027718 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027718
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:4016217..4017509
RNA-Seq ExpressionLag0027718
SyntenyLag0027718
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS33695.1 hypothetical protein Acr_00g0030110 [Actinidia rufa]8.0e-3243.07Show/hide
Query:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH
        ++S A +  LR+ LQ I+KDGLT   Y+ + + + +  ++IGEP++Y D L Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++YD+RLE+QS+ D 
Subjt:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH

Query:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ
        L+ +QANLANL+    +   P +++ P  NS+  P      PS    P           SSP+P          RP+CQIC K GHTA  CY+R N  YQ
Subjt:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ

Query:  AP
         P
Subjt:  AP

XP_022148871.1 uncharacterized protein LOC111017438 [Momordica charantia]1.4e-3949.28Show/hide
Query:  QLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS
        ++Q+++KDGL+V+QYLA+IK+I  K S+IGEP+S +D + Y++EGLG EYN FVTSIQNR+D  +L DVR+LL+AYD RLEKQ+SVD LN+VQAN+ANL 
Subjt:  QLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS

Query:  SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNR---PQCQICSKFGHTALVCYNRHNPMYQ----APSQS
         + ++     +   P Q+S PRP   F F +        PGLLG+P  + +PP WPPS    R    QCQIC K GHT   CY+R N  Y+      +Q 
Subjt:  SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNR---PQCQICSKFGHTALVCYNRHNPMYQ----APSQS

Query:  SPQAFLN
        +P A  +
Subjt:  SPQAFLN

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]7.7e-5942.05Show/hide
Query:  QYPPQTTHSLPFFPAYAPPSVFPPTQQPTSAYPSLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDT-PAPLKHLDAAQTQ---------------
        Q+PP T + L            PP     + +P+L  PLN+KL+D+N+LLWKNQLLN +IA  +  Y++ T   P + LD  Q Q               
Subjt:  QYPPQTTHSLPFFPAYAPPSVFPPTQQPTSAYPSLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDT-PAPLKHLDAAQTQ---------------

Query:  ----------------------------------SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFV
                                          S +TAR+MGL+++LQ +RKDG +V+QYLA+IK+IADKF+ +GEPLSYRD L +VL+GLG EYN FV
Subjt:  ----------------------------------SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFV

Query:  TSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS--SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRP
        TSI NR D PSL DVRSLL+AY++RL+KQ++VD LN+ QANL NLS   ++KRPP  +S     ++SFP  P S          +    +LG+PQS    
Subjt:  TSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS--SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRP

Query:  PRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP
         +WPP  + ++ QCQIC K GH+A VCY+R N  Y     +SPQA  +   P
Subjt:  PRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP

XP_038887133.1 uncharacterized protein LOC120077323 [Benincasa hispida]5.2e-3964.79Show/hide
Query:  QSSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVD
        +SSS A +MG  SQLQKI+KDGLTV+QYLAQIKD+ D F+ IGEPLSYRD L Y+LEGLG EYNPFV+SI NRT+RPS+ADVR+LLI YDSRLEKQ++ D
Subjt:  QSSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVD

Query:  HLNLVQANLANLS-SSTKRPPR-------PYSSTTPRQNSFP
        HL L+QAN+A+LS +S  R P+          S+TP   SFP
Subjt:  HLNLVQANLANLS-SSTKRPPR-------PYSSTTPRQNSFP

XP_038891713.1 uncharacterized protein LOC120081111 [Benincasa hispida]1.2e-3545.07Show/hide
Query:  MGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQAN
        M L+++LQKIRKD L+++QYL+QIKD+ADKFS +GE +SYRD L ++L+GLG EYN FVTSIQN  D  S+ DV SLL++Y+++LEKQ+++DHLN+ QA 
Subjt:  MGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQAN

Query:  LANLS---SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPR-PPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPS
        L+ LS   +S +   RP+    P  +S   P  +F    +  +PS    +  RP  + + PP  PPS   ++PQCQI  KFGH    C+   +  YQ   
Subjt:  LANLS---SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPR-PPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPS

Query:  QSSPQAFLNQFSP
          +PQA ++   P
Subjt:  QSSPQAFLNQFSP

TrEMBL top hitse value%identityAlignment
A0A6J1D6N7 uncharacterized protein LOC1110174386.6e-4049.28Show/hide
Query:  QLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS
        ++Q+++KDGL+V+QYLA+IK+I  K S+IGEP+S +D + Y++EGLG EYN FVTSIQNR+D  +L DVR+LL+AYD RLEKQ+SVD LN+VQAN+ANL 
Subjt:  QLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS

Query:  SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNR---PQCQICSKFGHTALVCYNRHNPMYQ----APSQS
         + ++     +   P Q+S PRP   F F +        PGLLG+P  + +PP WPPS    R    QCQIC K GHT   CY+R N  Y+      +Q 
Subjt:  SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNR---PQCQICSKFGHTALVCYNRHNPMYQ----APSQS

Query:  SPQAFLN
        +P A  +
Subjt:  SPQAFLN

A0A6J1DQX7 uncharacterized protein LOC1110223153.7e-5942.05Show/hide
Query:  QYPPQTTHSLPFFPAYAPPSVFPPTQQPTSAYPSLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDT-PAPLKHLDAAQTQ---------------
        Q+PP T + L            PP     + +P+L  PLN+KL+D+N+LLWKNQLLN +IA  +  Y++ T   P + LD  Q Q               
Subjt:  QYPPQTTHSLPFFPAYAPPSVFPPTQQPTSAYPSLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDT-PAPLKHLDAAQTQ---------------

Query:  ----------------------------------SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFV
                                          S +TAR+MGL+++LQ +RKDG +V+QYLA+IK+IADKF+ +GEPLSYRD L +VL+GLG EYN FV
Subjt:  ----------------------------------SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFV

Query:  TSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS--SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRP
        TSI NR D PSL DVRSLL+AY++RL+KQ++VD LN+ QANL NLS   ++KRPP  +S     ++SFP  P S          +    +LG+PQS    
Subjt:  TSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLS--SSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRP

Query:  PRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP
         +WPP  + ++ QCQIC K GH+A VCY+R N  Y     +SPQA  +   P
Subjt:  PRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP

A0A7J0DER3 Uncharacterized protein3.9e-3243.07Show/hide
Query:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH
        ++S A +  LR+ LQ I+KDGLT   Y+ + + + +  ++IGEP++Y D L Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++YD+RLE+QS+ D 
Subjt:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH

Query:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ
        L+ +QANLANL+    +   P +++ P  NS+  P      PS    P           SSP+P          RP+CQIC K GHTA  CY+R N  YQ
Subjt:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ

Query:  AP
         P
Subjt:  AP

A0A7J0E8R3 Uncharacterized protein3.9e-3243.07Show/hide
Query:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH
        ++S A +  LR+ LQ I+KDGLT   Y+ + + + +  ++IGEP++Y D L Y L GLG +YNPFVTSIQ++  RPS+ +V SLL++YD+RLE+QS+ D 
Subjt:  SSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDH

Query:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ
        L+ +QANLANL+    +   P +++ P  NS+  P      PS    P           SSP+P          RP+CQIC K GHTA  CY+R N  YQ
Subjt:  LNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQ

Query:  AP
         P
Subjt:  AP

A5BPS3 Uncharacterized protein6.6e-3239.55Show/hide
Query:  QLLNHIIAFDMENYI-NDTPAPLKHLDAAQTQSSS------------TARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLE
        Q +N      +E++I  +TP P K LD AQ Q +              + +    +   KI+K+G+T+++YLA+IK++ DK+S +GEPLSYRD+L Y L 
Subjt:  QLLNHIIAFDMENYI-NDTPAPLKHLDAAQTQSSS------------TARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLE

Query:  GLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPP----TSFP-FPSNLSM---P
        GL  EY+ FVTSI NR+D+ SL +V SLL  Y   LE++++   L   Q NL   S   K           +Q +FP+ P    +SFP  P NL     P
Subjt:  GLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPP----TSFP-FPSNLSM---P

Query:  SVGPGLLGRPQSSPR-PPRW----PPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAP---SQSSPQ
        +  P +LG+PQ  P+   +W       N   RPQCQIC KFGH AL CY+R N  YQ     SQ  PQ
Subjt:  SVGPGLLGRPQSSPR-PPRW----PPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAP---SQSSPQ

SwissProt top hitse value%identityAlignment
Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-0422Show/hide
Query:  KLSDSNYLLWKNQLLNHIIAFDMENYIN-DTPAPLKHL--DAA------------QTQSSSTARVMGLRSQLQKIRKDGLTVTQ---------------Y
        KL+ +NYL+W  Q+      +++  +++  TP P   +  DA             Q +   +A +  +   +Q       T  Q               +
Subjt:  KLSDSNYLLWKNQLLNHIIAFDMENYIN-DTPAPLKHL--DAA------------QTQSSSTARVMGLRSQLQKIRKDGLTVTQ---------------Y

Query:  LAQIKDIA--DKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSSSTKR------PP
        + Q++ I   D+ + +G+P+ + +Q+  VLE L  +Y P +  I  +   PSL ++   LI  +S+L   +S + + +    + + +++T R        
Subjt:  LAQIKDIA--DKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSSSTKR------PP

Query:  RPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRP---QCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP
        R Y++   R NS+                        +P SS         N Q +P   +CQICS  GH+A  C   H    Q  S ++ Q   + F+P
Subjt:  RPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPPRWPPSNNQNRP---QCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.0e-0820.39Show/hide
Query:  PLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDTPAPLKHLDA----------------------------------------AQTQSSSTARVMGLRSQ
        P+ + + +SNY  W+   L H ++FD+  +I+ T  P    D                                          Q +++  AR + L S+
Subjt:  PLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDTPAPLKHLDA----------------------------------------AQTQSSSTARVMGLRSQ

Query:  LQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSS
        L+      + V  Y  ++K +AD    +  P++ R+ + YVL GL P+++  +  I++R   PS  D  ++L   + RL++    +  ++  ++ + + +
Subjt:  LQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSS

Query:  STKRPP
         ++ PP
Subjt:  STKRPP

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0726.12Show/hide
Query:  QSSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRL--EKQSS
        + +  AR +   ++L+    D L+V +Y  ++K ++D  + +  P+S R  + ++L GL  +Y+  +  I++++  PS  + RS+L+  +SRL  + +SS
Subjt:  QSSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVLEGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRL--EKQSS

Query:  VDHLNLVQANLANLSSSTKRPPRPYSSTTPRQNS
        + H N    +L+N+  +  R    Y       NS
Subjt:  VDHLNLVQANLANLSSSTKRPPRPYSSTTPRQNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTGCTTCTTCTACCTCGTCAGCTAATGATGGTATCCTACCCCCTATTACCTCTTCAGTCACCGCCCCAGTAACCACTCCGGTAGCCACACCTGTGTCGTCTCA
GAGGGCCTCTTTTACCACTTCTTCTACCACCCCCATTCCCTCCAACAGACCTCTCAACCCTAATACTTCTTCTTTCCAGCCTAATTTTCCTCAGTTCCCAAATGCTTTTC
CCCAAAACCCTTCGTCTGGTTTTCAGTACCCACCTCAGACGACTCATTCTCTTCCATTTTTTCCAGCATATGCGCCTCCATCTGTTTTTCCGCCTACGCAGCAACCAACC
TCTGCTTATCCTTCTCTTACACCTCCACTCAATATTAAGCTCTCTGATTCAAATTATCTTCTCTGGAAAAATCAGCTTCTCAACCACATAATCGCATTTGATATGGAGAA
CTACATCAATGACACTCCGGCTCCTTTGAAACACTTGGATGCAGCACAAACCCAGTCGTCTTCTACTGCTAGGGTAATGGGTCTCCGTTCTCAATTACAAAAGATTCGTA
AAGACGGCTTGACTGTCACTCAGTATCTAGCCCAAATTAAAGACATAGCTGATAAGTTCTCGACCATTGGAGAACCATTGTCTTATAGGGATCAGCTGGGTTATGTTTTG
GAAGGTCTCGGGCCGGAGTATAATCCTTTCGTCACATCCATTCAAAATAGAACAGATAGACCTTCTCTTGCGGATGTCCGCAGTCTGCTTATTGCCTATGATTCTCGGCT
AGAAAAACAGAGTTCAGTTGATCATTTGAATTTAGTTCAGGCCAACCTTGCCAATTTATCATCCTCTACCAAACGACCCCCTCGCCCTTACTCTTCCACAACACCCAGGC
AAAACTCCTTTCCTCGCCCACCTACTTCTTTTCCTTTTCCTTCCAATCTTTCGATGCCATCCGTCGGTCCTGGTCTCTTAGGCCGTCCTCAATCATCTCCTCGTCCTCCA
CGGTGGCCTCCCTCTAATAACCAAAATCGCCCCCAATGCCAAATTTGTAGCAAGTTTGGCCACACCGCCCTCGTTTGCTATAATCGCCATAACCCAATGTACCAAGCTCC
CTCACAATCCTCCCCTCAAGCCTTCCTCAATCAATTCTCGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTGCTTCTTCTACCTCGTCAGCTAATGATGGTATCCTACCCCCTATTACCTCTTCAGTCACCGCCCCAGTAACCACTCCGGTAGCCACACCTGTGTCGTCTCA
GAGGGCCTCTTTTACCACTTCTTCTACCACCCCCATTCCCTCCAACAGACCTCTCAACCCTAATACTTCTTCTTTCCAGCCTAATTTTCCTCAGTTCCCAAATGCTTTTC
CCCAAAACCCTTCGTCTGGTTTTCAGTACCCACCTCAGACGACTCATTCTCTTCCATTTTTTCCAGCATATGCGCCTCCATCTGTTTTTCCGCCTACGCAGCAACCAACC
TCTGCTTATCCTTCTCTTACACCTCCACTCAATATTAAGCTCTCTGATTCAAATTATCTTCTCTGGAAAAATCAGCTTCTCAACCACATAATCGCATTTGATATGGAGAA
CTACATCAATGACACTCCGGCTCCTTTGAAACACTTGGATGCAGCACAAACCCAGTCGTCTTCTACTGCTAGGGTAATGGGTCTCCGTTCTCAATTACAAAAGATTCGTA
AAGACGGCTTGACTGTCACTCAGTATCTAGCCCAAATTAAAGACATAGCTGATAAGTTCTCGACCATTGGAGAACCATTGTCTTATAGGGATCAGCTGGGTTATGTTTTG
GAAGGTCTCGGGCCGGAGTATAATCCTTTCGTCACATCCATTCAAAATAGAACAGATAGACCTTCTCTTGCGGATGTCCGCAGTCTGCTTATTGCCTATGATTCTCGGCT
AGAAAAACAGAGTTCAGTTGATCATTTGAATTTAGTTCAGGCCAACCTTGCCAATTTATCATCCTCTACCAAACGACCCCCTCGCCCTTACTCTTCCACAACACCCAGGC
AAAACTCCTTTCCTCGCCCACCTACTTCTTTTCCTTTTCCTTCCAATCTTTCGATGCCATCCGTCGGTCCTGGTCTCTTAGGCCGTCCTCAATCATCTCCTCGTCCTCCA
CGGTGGCCTCCCTCTAATAACCAAAATCGCCCCCAATGCCAAATTTGTAGCAAGTTTGGCCACACCGCCCTCGTTTGCTATAATCGCCATAACCCAATGTACCAAGCTCC
CTCACAATCCTCCCCTCAAGCCTTCCTCAATCAATTCTCGCCTTAA
Protein sequenceShow/hide protein sequence
MASASSTSSANDGILPPITSSVTAPVTTPVATPVSSQRASFTTSSTTPIPSNRPLNPNTSSFQPNFPQFPNAFPQNPSSGFQYPPQTTHSLPFFPAYAPPSVFPPTQQPT
SAYPSLTPPLNIKLSDSNYLLWKNQLLNHIIAFDMENYINDTPAPLKHLDAAQTQSSSTARVMGLRSQLQKIRKDGLTVTQYLAQIKDIADKFSTIGEPLSYRDQLGYVL
EGLGPEYNPFVTSIQNRTDRPSLADVRSLLIAYDSRLEKQSSVDHLNLVQANLANLSSSTKRPPRPYSSTTPRQNSFPRPPTSFPFPSNLSMPSVGPGLLGRPQSSPRPP
RWPPSNNQNRPQCQICSKFGHTALVCYNRHNPMYQAPSQSSPQAFLNQFSP