; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035027 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035027
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:13776099..13781942
RNA-Seq ExpressionLag0035027
SyntenyLag0035027
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN73380.1 hypothetical protein VITISV_032547 [Vitis vinifera]5.5e-2936.16Show/hide
Query:  PLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL--------EDVRNMLLAFEARLEKQNTVDQL--SLA
        PL++KL+   +L+WKNQLLN ++ NGL   LD T   P KFLD QQ   NPE+ +W RYN L +        E+V   ++ +    E    + Q+  S +
Subjt:  PLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL--------EDVRNMLLAFEARLEKQNTVDQL--SLA

Query:  QANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVL-PNTLSPSILGKPQ--PQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQT
         A L    +              +  ++ TP     P L  N   PSILG+PQ  P     P +  +S +  ++P+ QICGKFGHTALIC HR NL YQ 
Subjt:  QANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVL-PNTLSPSILGKPQ--PQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQT

Query:  PSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD
        PS +   TI T+        S   S ++  +   D +WF++S  THH T D++ + N   ++G EQV VG+
Subjt:  PSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD

CAN77126.1 hypothetical protein VITISV_013628 [Vitis vinifera]4.5e-2330.42Show/hide
Query:  QLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL------------------------------------------------------
        Q +N    + L  F++G  P P KFLDD Q Q NP F+ WER N L                                                      
Subjt:  QLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL------------------------------------------------------

Query:  --------------------ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQS----HFNGRHSMPR-PSLNTTVFSRTPFNPSSPVLPNTLSPSI
                            +L++V ++L  +   LE++NT  QL  +Q NL+++      H N + + P+ P  + + F   P N      PN+  PSI
Subjt:  --------------------ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQS----HFNGRHSMPR-PSLNTTVFSRTPFNPSSPVLPNTLSPSI

Query:  LGKP--QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDT------YHLDENWFLNS
        LGKP  QPQSS    QK  + +   RPQ QICGKFGH AL C HR NL YQ   P        ++QP  QP +  A++ +  T       + D  W+++S
Subjt:  LGKP--QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDT------YHLDENWFLNS

Query:  GATHHMTHDVASLSNPMSYLGGEQVTVGDVQV
        GATHH T +  +L++P    G EQ  VG+  V
Subjt:  GATHHMTHDVASLSNPMSYLGGEQVTVGDVQV

GFZ12741.1 UBX domain-containing protein [Actinidia rufa]2.8e-2530.53Show/hide
Query:  PAPPTANPNPL----HPNP----------FPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL-
        P PPT+NP P      PNP           P++ QPL+VKL+D  +++WK QLLN ++ANGL  FLDG+   P +FLD QQ Q NPEF  W+RYN L + 
Subjt:  PAPPTANPNPL----HPNP----------FPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL-

Query:  -------EDVRNMLLAFEARLEKQNTVDQLSLAQ--ANLSSFQS----------------------------------------HFNG------------
               E +   ++ + +  +    +++L  A   A+L+  ++                                        +F G            
Subjt:  -------EDVRNMLLAFEARLEKQNTVDQLSLAQ--ANLSSFQS----------------------------------------HFNG------------

Query:  ------RHSMPRPSLNTTVFSRTPF-NPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPN-RPQYQICGKFGHTALICRHRTNLAYQTPSP-----
              R S+  P+  T++  +  F NPS+   PN+ S S         S+SP+      SSP  RP+ QIC K GHTA  C H TNL YQ P P     
Subjt:  ------RHSMPRPSLNTTVFSRTPF-NPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPN-RPQYQICGKFGHTALICRHRTNLAYQTPSP-----

Query:  QAMLT-----IATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSPNWHLRLGHPAAT
         A +T       +SNQP    LS             D +W+++SGA+HH T D+  L +   Y G +QVTVG          +  TSP     +GH +  
Subjt:  QAMLT-----IATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSPNWHLRLGHPAAT

Query:  TLHSVLSKLHSQRSQS
          +S+L   H   S S
Subjt:  TLHSVLSKLHSQRSQS

RVW77188.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]4.5e-2332.3Show/hide
Query:  VWERYNSLALEDVRNMLLAFEARLEKQNTVD----QLSLAQANLSSFQS----HFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSS
        +  R +   +E + ++LL+++  LE+QN++     Q+ +A  N   ++S     FN  H  P+        S  PF P++    +    +ILGKPQPQ  
Subjt:  VWERYNSLALEDVRNMLLAFEARLEKQNTVD----QLSLAQANLSSFQS----HFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSS

Query:  FSPSQKWSSRSS--PNRPQYQICGKFGHTALICRHRTNLAYQ-TPSPQAMLTIATSNQPLTQPLSDSASVYSH--DTYHL--DENWFLNSGATHHMTHDV
          P  +W    S    +PQ QICGKFGH ALIC H TNL Y   PSP+   T+ T+N  L+ P   S    +H   T++   D NW+++SGATHH T D+
Subjt:  FSPSQKWSSRSS--PNRPQYQICGKFGHTALICRHRTNLAYQ-TPSPQAMLTIATSNQPLTQPLSDSASVYSH--DTYHL--DENWFLNSGATHHMTHDV

Query:  ASLSNPMSYLGGEQVTVGD-----------------------------VQVSAAVFLSSVTSPN-WHLRLGHPAATTLHSVLSKLHSQRSQ
          L     + G +QVTV +                              +V   VF++++  PN WH RLGHPA + ++ +L   +  R+Q
Subjt:  ASLSNPMSYLGGEQVTVGD-----------------------------VQVSAAVFLSSVTSPN-WHLRLGHPAATTLHSVLSKLHSQRSQ

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]1.2e-5238.38Show/hide
Query:  FPAPPTAN-----PNPLHPNPFPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL-----------
        FP PPT N     PNP   NPFPTLPQPL+VKLND  FLLWKNQLLNA++ANGL  +LDGTI PP +FLD  Q QPNP +  WERYN L           
Subjt:  FPAPPTAN-----PNPLHPNPFPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPNRP
        +LEDVR++LLA+EARL+KQNTVDQL++AQANL +     N +   P+ S         P +P S       S SILGKPQ         KW  + S ++ 
Subjt:  ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPNRP

Query:  QYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD
        Q QICGK GH+A +C HRTN+AY   SPQA+      +   T P S       H+  H DE+WF++SGATHHMT D + L NP  Y GGEQVTVG+
Subjt:  QYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD

TrEMBL top hitse value%identityAlignment
A0A438GYC1 Retrovirus-related Pol polyprotein from transposon RE12.2e-2332.3Show/hide
Query:  VWERYNSLALEDVRNMLLAFEARLEKQNTVD----QLSLAQANLSSFQS----HFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSS
        +  R +   +E + ++LL+++  LE+QN++     Q+ +A  N   ++S     FN  H  P+        S  PF P++    +    +ILGKPQPQ  
Subjt:  VWERYNSLALEDVRNMLLAFEARLEKQNTVD----QLSLAQANLSSFQS----HFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSS

Query:  FSPSQKWSSRSS--PNRPQYQICGKFGHTALICRHRTNLAYQ-TPSPQAMLTIATSNQPLTQPLSDSASVYSH--DTYHL--DENWFLNSGATHHMTHDV
          P  +W    S    +PQ QICGKFGH ALIC H TNL Y   PSP+   T+ T+N  L+ P   S    +H   T++   D NW+++SGATHH T D+
Subjt:  FSPSQKWSSRSS--PNRPQYQICGKFGHTALICRHRTNLAYQ-TPSPQAMLTIATSNQPLTQPLSDSASVYSH--DTYHL--DENWFLNSGATHHMTHDV

Query:  ASLSNPMSYLGGEQVTVGD-----------------------------VQVSAAVFLSSVTSPN-WHLRLGHPAATTLHSVLSKLHSQRSQ
          L     + G +QVTV +                              +V   VF++++  PN WH RLGHPA + ++ +L   +  R+Q
Subjt:  ASLSNPMSYLGGEQVTVGD-----------------------------VQVSAAVFLSSVTSPN-WHLRLGHPAATTLHSVLSKLHSQRSQ

A0A6J1DQX7 uncharacterized protein LOC1110223155.9e-5338.38Show/hide
Query:  FPAPPTAN-----PNPLHPNPFPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL-----------
        FP PPT N     PNP   NPFPTLPQPL+VKLND  FLLWKNQLLNA++ANGL  +LDGTI PP +FLD  Q QPNP +  WERYN L           
Subjt:  FPAPPTAN-----PNPLHPNPFPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPNRP
        +LEDVR++LLA+EARL+KQNTVDQL++AQANL +     N +   P+ S         P +P S       S SILGKPQ         KW  + S ++ 
Subjt:  ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPNRP

Query:  QYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD
        Q QICGK GH+A +C HRTN+AY   SPQA+      +   T P S       H+  H DE+WF++SGATHHMT D + L NP  Y GGEQVTVG+
Subjt:  QYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD

A0A7J0GPN0 UBX domain-containing protein1.4e-2530.53Show/hide
Query:  PAPPTANPNPL----HPNP----------FPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL-
        P PPT+NP P      PNP           P++ QPL+VKL+D  +++WK QLLN ++ANGL  FLDG+   P +FLD QQ Q NPEF  W+RYN L + 
Subjt:  PAPPTANPNPL----HPNP----------FPTLPQPLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL-

Query:  -------EDVRNMLLAFEARLEKQNTVDQLSLAQ--ANLSSFQS----------------------------------------HFNG------------
               E +   ++ + +  +    +++L  A   A+L+  ++                                        +F G            
Subjt:  -------EDVRNMLLAFEARLEKQNTVDQLSLAQ--ANLSSFQS----------------------------------------HFNG------------

Query:  ------RHSMPRPSLNTTVFSRTPF-NPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPN-RPQYQICGKFGHTALICRHRTNLAYQTPSP-----
              R S+  P+  T++  +  F NPS+   PN+ S S         S+SP+      SSP  RP+ QIC K GHTA  C H TNL YQ P P     
Subjt:  ------RHSMPRPSLNTTVFSRTPF-NPSSPVLPNTLSPSILGKPQPQSSFSPSQKWSSRSSPN-RPQYQICGKFGHTALICRHRTNLAYQTPSP-----

Query:  QAMLT-----IATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSPNWHLRLGHPAAT
         A +T       +SNQP    LS             D +W+++SGA+HH T D+  L +   Y G +QVTVG          +  TSP     +GH +  
Subjt:  QAMLT-----IATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSPNWHLRLGHPAAT

Query:  TLHSVLSKLHSQRSQS
          +S+L   H   S S
Subjt:  TLHSVLSKLHSQRSQS

A5AG90 Uncharacterized protein2.7e-2936.16Show/hide
Query:  PLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL--------EDVRNMLLAFEARLEKQNTVDQL--SLA
        PL++KL+   +L+WKNQLLN ++ NGL   LD T   P KFLD QQ   NPE+ +W RYN L +        E+V   ++ +    E    + Q+  S +
Subjt:  PLSVKLNDTKFLLWKNQLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLAL--------EDVRNMLLAFEARLEKQNTVDQL--SLA

Query:  QANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVL-PNTLSPSILGKPQ--PQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQT
         A L    +              +  ++ TP     P L  N   PSILG+PQ  P     P +  +S +  ++P+ QICGKFGHTALIC HR NL YQ 
Subjt:  QANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVL-PNTLSPSILGKPQ--PQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQT

Query:  PSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD
        PS +   TI T+        S   S ++  +   D +WF++S  THH T D++ + N   ++G EQV VG+
Subjt:  PSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPMSYLGGEQVTVGD

A5BPS3 Uncharacterized protein2.2e-2330.42Show/hide
Query:  QLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL------------------------------------------------------
        Q +N    + L  F++G  P P KFLDD Q Q NP F+ WER N L                                                      
Subjt:  QLLNAILANGLHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSL------------------------------------------------------

Query:  --------------------ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQS----HFNGRHSMPR-PSLNTTVFSRTPFNPSSPVLPNTLSPSI
                            +L++V ++L  +   LE++NT  QL  +Q NL+++      H N + + P+ P  + + F   P N      PN+  PSI
Subjt:  --------------------ALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQS----HFNGRHSMPR-PSLNTTVFSRTPFNPSSPVLPNTLSPSI

Query:  LGKP--QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDT------YHLDENWFLNS
        LGKP  QPQSS    QK  + +   RPQ QICGKFGH AL C HR NL YQ   P        ++QP  QP +  A++ +  T       + D  W+++S
Subjt:  LGKP--QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDT------YHLDENWFLNS

Query:  GATHHMTHDVASLSNPMSYLGGEQVTVGDVQV
        GATHH T +  +L++P    G EQ  VG+  V
Subjt:  GATHHMTHDVASLSNPMSYLGGEQVTVGDVQV

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.5e-0535.77Show/hide
Query:  KPQPQSS--FSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLT--QPLSDSASVYSHDTYHLDENWFLNSGATHHM
        KP  QSS  F P+   +++S P   + QICG  GH+A  C             Q  L+   S QP +   P    A++     Y    NW L+SGATHH+
Subjt:  KPQPQSS--FSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLT--QPLSDSASVYSHDTYHLDENWFLNSGATHHM

Query:  THDVASLSNPMSYLGGEQVTVGD
        T D  +LS    Y GG+ V V D
Subjt:  THDVASLSNPMSYLGGEQVTVGD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.7e-0431.82Show/hide
Query:  QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVAS
        QP SS S S   + +  P   + QIC   GH+A  C         T   Q       S  P T P    A++  +  Y+ + NW L+SGATHH+T D  +
Subjt:  QPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVAS

Query:  LSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSP
        LS    Y GG+ V + D          S + P
Subjt:  LSNPMSYLGGEQVTVGDVQVSAAVFLSSVTSP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAGATGAAGAAGGGGGAGAAACTTCATCTTCTCCAAGGAGTTGCAGTGAGGAAGAGGGAACAAGAACAGGAAGAGGAACAGAAACGTACCCAAGTTCGGTGAG
CTCGTCGTCGAGGGTTGCCGTCGGTGAGGGAGGAAGCGCCGTCGATGAGGGAGGAAATGTGCTGCGAGGGAGGAAGCGGCGCCGAGGGTTTGGCTACATTTTCAGCATCC
ATTCGAATGTCAAGGAACAGACGGGGCCAACACAGCATCCATTCGGAGCAAGAGTCGGGTCATCAGGAAGTAGGGAACCGAGGAGCCCAATCCCCAGCCTCGACCCTAGG
CTAAGGCTAACCTCGGGCTTATGCTCAGGCATCAGAGTGGGTGTGGCAAGCACCAGACGATGTGCAATTTCTACTGGTTTTGCAAATCACGTCTTCCCAGTCTCTATAAA
TTCACCGTTGGTGTCATGTGAAGATCAACCAAACTTTCCTCGTCCAAATCCTCAGCGATATTATCCTCCCCAATCTTTTCCAGCTCCCCCAACTGCGAATCCCAATCCCC
TTCACCCCAATCCTTTTCCCACTTTACCCCAACCACTCTCTGTCAAGCTAAACGATACAAAATTTTTACTCTGGAAGAATCAGCTTCTCAACGCCATTCTTGCTAATGGT
TTACACAGGTTCCTCGATGGTACCATTCCTCCCCCTCAGAAGTTCCTTGACGATCAACAATCACAACCGAATCCAGAGTTCCTCGTCTGGGAAAGATACAATAGTCTTGC
CCTTGAAGATGTTCGCAACATGTTGTTAGCTTTCGAGGCTAGGTTGGAAAAGCAAAATACAGTAGATCAACTAAGCTTAGCTCAAGCAAACCTCAGTAGCTTTCAATCTC
ACTTTAATGGCCGCCATTCCATGCCTCGTCCTTCCCTTAATACCACTGTTTTTTCAAGAACTCCCTTCAATCCATCTTCCCCTGTGTTGCCAAATACCTTATCTCCAAGC
ATTTTAGGTAAGCCTCAACCACAATCTTCTTTTTCTCCTTCTCAGAAATGGTCTTCTAGATCAAGTCCTAATCGTCCTCAATACCAAATATGTGGCAAATTTGGGCACAC
CGCTCTTATTTGTCGCCATCGCACAAACTTGGCCTACCAAACTCCATCACCTCAAGCAATGTTAACCATTGCTACTTCAAACCAGCCCCTAACCCAGCCTTTATCGGATT
CTGCTTCTGTTTACTCTCATGATACATATCATCTCGATGAAAATTGGTTCCTCAACTCGGGTGCTACACACCATATGACTCATGATGTTGCCTCTTTGTCCAATCCCATG
TCTTATCTTGGTGGTGAGCAAGTCACGGTCGGTGATGTCCAAGTCTCTGCTGCCGTTTTTCTGTCCTCAGTTACTTCACCCAATTGGCACCTTCGTTTGGGTCATCCAGC
TGCCACAACTTTACATTCTGTTTTGTCTAAGCTCCATAGTCAGAGAAGCCAGAGCCCAGAACATTCTCCCAAAGTCCAGAGCCTTCAGAAGTCAGAGAGTCCAGGGAATT
CAGAAGATCCAAGATTCAGAATTCGGCCATCTCAAGACTCAGAAGATCCAACGCCTGGAAGACTCCAACGAATCAACCGTTTCTTCATCAAGATACAAGTTGAAGATTCA
ACCTTCCTCCGTCTGAGATCAAGCTCGCCAGCCCTCAGATCAGACTCTACATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAGATGAAGAAGGGGGAGAAACTTCATCTTCTCCAAGGAGTTGCAGTGAGGAAGAGGGAACAAGAACAGGAAGAGGAACAGAAACGTACCCAAGTTCGGTGAG
CTCGTCGTCGAGGGTTGCCGTCGGTGAGGGAGGAAGCGCCGTCGATGAGGGAGGAAATGTGCTGCGAGGGAGGAAGCGGCGCCGAGGGTTTGGCTACATTTTCAGCATCC
ATTCGAATGTCAAGGAACAGACGGGGCCAACACAGCATCCATTCGGAGCAAGAGTCGGGTCATCAGGAAGTAGGGAACCGAGGAGCCCAATCCCCAGCCTCGACCCTAGG
CTAAGGCTAACCTCGGGCTTATGCTCAGGCATCAGAGTGGGTGTGGCAAGCACCAGACGATGTGCAATTTCTACTGGTTTTGCAAATCACGTCTTCCCAGTCTCTATAAA
TTCACCGTTGGTGTCATGTGAAGATCAACCAAACTTTCCTCGTCCAAATCCTCAGCGATATTATCCTCCCCAATCTTTTCCAGCTCCCCCAACTGCGAATCCCAATCCCC
TTCACCCCAATCCTTTTCCCACTTTACCCCAACCACTCTCTGTCAAGCTAAACGATACAAAATTTTTACTCTGGAAGAATCAGCTTCTCAACGCCATTCTTGCTAATGGT
TTACACAGGTTCCTCGATGGTACCATTCCTCCCCCTCAGAAGTTCCTTGACGATCAACAATCACAACCGAATCCAGAGTTCCTCGTCTGGGAAAGATACAATAGTCTTGC
CCTTGAAGATGTTCGCAACATGTTGTTAGCTTTCGAGGCTAGGTTGGAAAAGCAAAATACAGTAGATCAACTAAGCTTAGCTCAAGCAAACCTCAGTAGCTTTCAATCTC
ACTTTAATGGCCGCCATTCCATGCCTCGTCCTTCCCTTAATACCACTGTTTTTTCAAGAACTCCCTTCAATCCATCTTCCCCTGTGTTGCCAAATACCTTATCTCCAAGC
ATTTTAGGTAAGCCTCAACCACAATCTTCTTTTTCTCCTTCTCAGAAATGGTCTTCTAGATCAAGTCCTAATCGTCCTCAATACCAAATATGTGGCAAATTTGGGCACAC
CGCTCTTATTTGTCGCCATCGCACAAACTTGGCCTACCAAACTCCATCACCTCAAGCAATGTTAACCATTGCTACTTCAAACCAGCCCCTAACCCAGCCTTTATCGGATT
CTGCTTCTGTTTACTCTCATGATACATATCATCTCGATGAAAATTGGTTCCTCAACTCGGGTGCTACACACCATATGACTCATGATGTTGCCTCTTTGTCCAATCCCATG
TCTTATCTTGGTGGTGAGCAAGTCACGGTCGGTGATGTCCAAGTCTCTGCTGCCGTTTTTCTGTCCTCAGTTACTTCACCCAATTGGCACCTTCGTTTGGGTCATCCAGC
TGCCACAACTTTACATTCTGTTTTGTCTAAGCTCCATAGTCAGAGAAGCCAGAGCCCAGAACATTCTCCCAAAGTCCAGAGCCTTCAGAAGTCAGAGAGTCCAGGGAATT
CAGAAGATCCAAGATTCAGAATTCGGCCATCTCAAGACTCAGAAGATCCAACGCCTGGAAGACTCCAACGAATCAACCGTTTCTTCATCAAGATACAAGTTGAAGATTCA
ACCTTCCTCCGTCTGAGATCAAGCTCGCCAGCCCTCAGATCAGACTCTACATTTTGA
Protein sequenceShow/hide protein sequence
MEEDEEGGETSSSPRSCSEEEGTRTGRGTETYPSSVSSSSRVAVGEGGSAVDEGGNVLRGRKRRRGFGYIFSIHSNVKEQTGPTQHPFGARVGSSGSREPRSPIPSLDPR
LRLTSGLCSGIRVGVASTRRCAISTGFANHVFPVSINSPLVSCEDQPNFPRPNPQRYYPPQSFPAPPTANPNPLHPNPFPTLPQPLSVKLNDTKFLLWKNQLLNAILANG
LHRFLDGTIPPPQKFLDDQQSQPNPEFLVWERYNSLALEDVRNMLLAFEARLEKQNTVDQLSLAQANLSSFQSHFNGRHSMPRPSLNTTVFSRTPFNPSSPVLPNTLSPS
ILGKPQPQSSFSPSQKWSSRSSPNRPQYQICGKFGHTALICRHRTNLAYQTPSPQAMLTIATSNQPLTQPLSDSASVYSHDTYHLDENWFLNSGATHHMTHDVASLSNPM
SYLGGEQVTVGDVQVSAAVFLSSVTSPNWHLRLGHPAATTLHSVLSKLHSQRSQSPEHSPKVQSLQKSESPGNSEDPRFRIRPSQDSEDPTPGRLQRINRFFIKIQVEDS
TFLRLRSSSPALRSDSTF