; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G017700 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G017700
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionS-adenosyl-L-methionine-dependent methyltransferase superfamily protein
Genome locationchr05:25043321..25045795
RNA-Seq ExpressionLsi05G017700
SyntenyLsi05G017700
Gene Ontology termsGO:0006886 - intracellular protein transport (biological process)
GO:0032259 - methylation (biological process)
GO:0005741 - mitochondrial outer membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR005683 - Mitochondrial import receptor subunit Tom22
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY06180.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein, putative [Theobroma cacao]2.6e-9557.1Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GA +  VEIDP+VI+AS+QAMGFPAFSVMT SG+RA +KP  I+++MWKGIHERL+LYE DAE F+    NLYD+IFIDAYDGDDIFP+K WDP+S FL+
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEK-------NSGLGFTVAVPWVCNTSLVVCKG-----RHILR
        +LS ++HPKHGTVVVNLH+D++I N + SV    + ILP+GKYVS++ +AYKDVL+G+E+        SG+GFT++VPWVCNTSLVVC+G       + R
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEK-------NSGLGFTVAVPWVCNTSLVVCKG-----RHILR

Query:  ESKIEALLAFDPPKIKFPSASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDR
        +  +  L++     +K    ++       +L   D+ D G+++RLSR+VS+S I   +KRAAS+ A+VT KLL+STGKAAW AGTTFL+L+VPLIIEMDR
Subjt:  ESKIEALLAFDPPKIKFPSASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDR

Query:  EQQLNELELQHATLLGA
        EQQ  ++ELQ  +LLG+
Subjt:  EQQLNELELQHATLLGA

KAA0049209.1 S-adenosyl-L-methionine-dependent methyltransferase superfamily protein [Cucumis melo var. makuwa]1.4e-9384.88Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASG+R+S +PRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDM+FIDAYDGDDIFPHK WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGD---EKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKI
        AL+KRVHPKHGTVVVNLHSDSDI+  DGSVPSVLEHILPMGKYVSQIGRAY DVLVGD   EKNSGLGFTVAVPWVCNTSLVVCKG      ++ R+S +
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGD---EKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKI

Query:  EALLA
          L++
Subjt:  EALLA

KAF4347949.1 hypothetical protein G4B88_002863 [Cannabis sativa]5.0e-9961.85Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GA V IVEIDPLVISASIQAMGFPAFSVMT SG RA +KP   D++MWKG HERL+LYE DAE FI   TNLYDMIFIDAYDG+DIFP + WDP+S FL+
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVL-----VGDEKNSGLGFTVAVPWVCNTSLVVCKG---------RHI
        ALS ++HPKHGTVVVNLHSDS++L+ DGSVPSVLE ILPMGKYVSQ+GRAYKDVL        EK SGLGFTV+VPW+CNT+LVVCKG         R +
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVL-----VGDEKNSGLGFTVAVPWVCNTSLVVCKG---------RHI

Query:  LRESKIEALLAFD-----PPKIKFPS-ASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVV
        +  + I   +  +     P     PS +SL+      R    ++G  G  S ++R VS+S+I +Q+KR  S   FV+ KLL+STG+AAWIAGTTFLILVV
Subjt:  LRESKIEALLAFD-----PPKIKFPS-ASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVV

Query:  PLIIEMDREQQLNELELQHATLLGA
        PLIIEMDREQQ  +LE+Q A++LG+
Subjt:  PLIIEMDREQQLNELELQHATLLGA

XP_004134040.1 uncharacterized protein LOC101203692 [Cucumis sativus]1.7e-9485.64Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASG+R S +PRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDM+FIDAYDGDDIFPHK WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKIEAL
        AL+KRVHPKHGTVVVNLHSDSD++  DGSVPSVLEHILPMGKYVSQIGRAY DVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG      ++ R+S +  L
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKIEAL

Query:  LA
        ++
Subjt:  LA

XP_022947566.1 uncharacterized protein LOC111451388 isoform X1 [Cucurbita moschata]2.5e-9880.17Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERA A+P  IDD+MWKGIHERLFLYELDAEDF+ NTTN YDMIFIDAYDGDDIFP+K WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKGRHI--LRESKIEALLAF
        ALSKR+HPKHGTVVVNLHSDSD+L  DGSVPSVLEHILPMGKYVS+IGRAYKDVL  DE+ SGLGFTVAVPWVCNTSLVVCKG  +   +   + A LA 
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKGRHI--LRESKIEALLAF

Query:  DPPKIKFPSASLMAVNGKNRLPL--NDDGDA-GVLSR
        DPPK  F S SLMAVN + RL L  NDD DA GV SR
Subjt:  DPPKIKFPSASLMAVNGKNRLPL--NDDGDA-GVLSR

TrEMBL top hitse value%identityAlignment
A0A061EMF8 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein, putative1.2e-9557.1Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GA +  VEIDP+VI+AS+QAMGFPAFSVMT SG+RA +KP  I+++MWKGIHERL+LYE DAE F+    NLYD+IFIDAYDGDDIFP+K WDP+S FL+
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEK-------NSGLGFTVAVPWVCNTSLVVCKG-----RHILR
        +LS ++HPKHGTVVVNLH+D++I N + SV    + ILP+GKYVS++ +AYKDVL+G+E+        SG+GFT++VPWVCNTSLVVC+G       + R
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEK-------NSGLGFTVAVPWVCNTSLVVCKG-----RHILR

Query:  ESKIEALLAFDPPKIKFPSASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDR
        +  +  L++     +K    ++       +L   D+ D G+++RLSR+VS+S I   +KRAAS+ A+VT KLL+STGKAAW AGTTFL+L+VPLIIEMDR
Subjt:  ESKIEALLAFDPPKIKFPSASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDR

Query:  EQQLNELELQHATLLGA
        EQQ  ++ELQ  +LLG+
Subjt:  EQQLNELELQHATLLGA

A0A0A0L4X6 Uncharacterized protein8.0e-9585.64Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASG+R S +PRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDM+FIDAYDGDDIFPHK WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKIEAL
        AL+KRVHPKHGTVVVNLHSDSD++  DGSVPSVLEHILPMGKYVSQIGRAY DVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG      ++ R+S +  L
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKIEAL

Query:  LA
        ++
Subjt:  LA

A0A5A7U6S4 S-adenosyl-L-methionine-dependent methyltransferase superfamily protein6.8e-9484.88Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASG+R+S +PRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDM+FIDAYDGDDIFPHK WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGD---EKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKI
        AL+KRVHPKHGTVVVNLHSDSDI+  DGSVPSVLEHILPMGKYVSQIGRAY DVLVGD   EKNSGLGFTVAVPWVCNTSLVVCKG      ++ R+S +
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGD---EKNSGLGFTVAVPWVCNTSLVVCKG-----RHILRESKI

Query:  EALLA
          L++
Subjt:  EALLA

A0A6J1G6T2 uncharacterized protein LOC111451388 isoform X11.2e-9880.17Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERA A+P  IDD+MWKGIHERLFLYELDAEDF+ NTTN YDMIFIDAYDGDDIFP+K WDPNS+FLE
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKGRHI--LRESKIEALLAF
        ALSKR+HPKHGTVVVNLHSDSD+L  DGSVPSVLEHILPMGKYVS+IGRAYKDVL  DE+ SGLGFTVAVPWVCNTSLVVCKG  +   +   + A LA 
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKGRHI--LRESKIEALLAF

Query:  DPPKIKFPSASLMAVNGKNRLPL--NDDGDA-GVLSR
        DPPK  F S SLMAVN + RL L  NDD DA GV SR
Subjt:  DPPKIKFPSASLMAVNGKNRLPL--NDDGDA-GVLSR

A0A7J6DQT5 Uncharacterized protein2.4e-9961.85Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE
        GA V IVEIDPLVISASIQAMGFPAFSVMT SG RA +KP   D++MWKG HERL+LYE DAE FI   TNLYDMIFIDAYDG+DIFP + WDP+S FL+
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFLE

Query:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVL-----VGDEKNSGLGFTVAVPWVCNTSLVVCKG---------RHI
        ALS ++HPKHGTVVVNLHSDS++L+ DGSVPSVLE ILPMGKYVSQ+GRAYKDVL        EK SGLGFTV+VPW+CNT+LVVCKG         R +
Subjt:  ALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVL-----VGDEKNSGLGFTVAVPWVCNTSLVVCKG---------RHI

Query:  LRESKIEALLAFD-----PPKIKFPS-ASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVV
        +  + I   +  +     P     PS +SL+      R    ++G  G  S ++R VS+S+I +Q+KR  S   FV+ KLL+STG+AAWIAGTTFLILVV
Subjt:  LRESKIEALLAFD-----PPKIKFPS-ASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVV

Query:  PLIIEMDREQQLNELELQHATLLGA
        PLIIEMDREQQ  +LE+Q A++LG+
Subjt:  PLIIEMDREQQLNELELQHATLLGA

SwissProt top hitse value%identityAlignment
O64497 Mitochondrial import receptor subunit TOM9-12.3e-1448.81Show/hide
Query:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAV
        GD+ +L++    +S   I  Q +RAA +  +V+ KLL+STGKAAWIAGTTFLIL VPLI+E++++ +L E++ + A+LLG   V
Subjt:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAV

Q9FNC9 Mitochondrial import receptor subunit TOM9-28.2e-2062.5Show/hide
Query:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAVSVQK
        GD  +L+R    +S S I  Q +RAA +   V+ KLLRSTGKAAWIAGTTFLILVVPLIIEMDRE Q+NE+ELQ A+LLGA    +Q+
Subjt:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAVSVQK

Arabidopsis top hitse value%identityAlignment
AT1G04070.1 translocase of outer membrane 22-I1.6e-1548.81Show/hide
Query:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAV
        GD+ +L++    +S   I  Q +RAA +  +V+ KLL+STGKAAWIAGTTFLIL VPLI+E++++ +L E++ + A+LLG   V
Subjt:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAV

AT4G13330.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein3.9e-5760.33Show/hide
Query:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFI-SNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFL
        GA VDIVE+DPLVIS S++AMGFPAFSVMTA+G+R    P  ID +MW GIHERL LYE  AEDFI  N +N YD+IF+DAYDG DIFPH  WD +S F+
Subjt:  GANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFI-SNTTNLYDMIFIDAYDGDDIFPHKFWDPNSSFL

Query:  EALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG
        +ALSK +H +HGT+VVNLHSD+DI ++D S   V       GKYV ++G+AYK  L+ +E+N GL F   VPW+CN SLVV +G
Subjt:  EALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKG

AT5G43970.1 translocase of outer membrane 22-V5.8e-2162.5Show/hide
Query:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAVSVQK
        GD  +L+R    +S S I  Q +RAA +   V+ KLLRSTGKAAWIAGTTFLILVVPLIIEMDRE Q+NE+ELQ A+LLGA    +Q+
Subjt:  GDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAVSVQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAATGATTAGTTACAAAGCTAGCTCAAGTTCTTGTGCTTTTTCAGGTGCCAATGTTGACATAGTTGAAATCGACCCTCTTGTTATCTCAGCTTCAATTCAAGCCAT
GGGCTTCCCAGCTTTCTCAGTTATGACAGCATCAGGAGAGCGTGCGTCTGCCAAACCTAGATTCATCGACGACATCATGTGGAAAGGCATCCACGAACGACTTTTCCTCT
ATGAATTAGATGCAGAGGATTTCATCAGCAACACGACCAATCTATATGACATGATCTTCATTGATGCTTATGACGGAGACGACATATTCCCTCACAAATTCTGGGATCCA
AATTCCTCATTTCTTGAAGCTCTCAGCAAGCGGGTTCATCCTAAACATGGAACTGTGGTGGTGAATCTTCACTCAGATTCCGATATCTTAAATCTAGATGGCTCTGTCCC
ATCTGTTCTTGAACATATATTGCCAATGGGAAAGTATGTATCTCAAATAGGCCGAGCATACAAGGATGTTTTGGTGGGAGATGAGAAAAATTCTGGTTTGGGTTTCACAG
TGGCGGTTCCATGGGTTTGCAATACATCTCTAGTTGTGTGCAAAGGAAGACACATTCTCCGTGAAAGCAAAATTGAAGCATTGTTAGCGTTCGATCCCCCGAAAATTAAG
TTCCCCTCTGCGTCTCTCATGGCGGTCAATGGCAAGAACCGATTGCCGCTCAATGACGACGGCGACGCCGGTGTTCTTTCCAGGTTATCACGTTCCGTATCCGAGTCCTC
GATCTATCGCCAATCCAAACGCGCCGCTTCAAATACCGCCTTCGTCACTAATAAACTGTTGCGGAGCACCGGTAAGGCCGCCTGGATCGCCGGCACCACATTCCTCATTC
TCGTTGTCCCGCTTATCATCGAGATGGATCGCGAACAGCAGCTCAATGAGCTCGAGTTGCAGCACGCCACCTTACTCGGTGCCTCCGCCGTATCGGTTCAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACAATGATTAGTTACAAAGCTAGCTCAAGTTCTTGTGCTTTTTCAGGTGCCAATGTTGACATAGTTGAAATCGACCCTCTTGTTATCTCAGCTTCAATTCAAGCCAT
GGGCTTCCCAGCTTTCTCAGTTATGACAGCATCAGGAGAGCGTGCGTCTGCCAAACCTAGATTCATCGACGACATCATGTGGAAAGGCATCCACGAACGACTTTTCCTCT
ATGAATTAGATGCAGAGGATTTCATCAGCAACACGACCAATCTATATGACATGATCTTCATTGATGCTTATGACGGAGACGACATATTCCCTCACAAATTCTGGGATCCA
AATTCCTCATTTCTTGAAGCTCTCAGCAAGCGGGTTCATCCTAAACATGGAACTGTGGTGGTGAATCTTCACTCAGATTCCGATATCTTAAATCTAGATGGCTCTGTCCC
ATCTGTTCTTGAACATATATTGCCAATGGGAAAGTATGTATCTCAAATAGGCCGAGCATACAAGGATGTTTTGGTGGGAGATGAGAAAAATTCTGGTTTGGGTTTCACAG
TGGCGGTTCCATGGGTTTGCAATACATCTCTAGTTGTGTGCAAAGGAAGACACATTCTCCGTGAAAGCAAAATTGAAGCATTGTTAGCGTTCGATCCCCCGAAAATTAAG
TTCCCCTCTGCGTCTCTCATGGCGGTCAATGGCAAGAACCGATTGCCGCTCAATGACGACGGCGACGCCGGTGTTCTTTCCAGGTTATCACGTTCCGTATCCGAGTCCTC
GATCTATCGCCAATCCAAACGCGCCGCTTCAAATACCGCCTTCGTCACTAATAAACTGTTGCGGAGCACCGGTAAGGCCGCCTGGATCGCCGGCACCACATTCCTCATTC
TCGTTGTCCCGCTTATCATCGAGATGGATCGCGAACAGCAGCTCAATGAGCTCGAGTTGCAGCACGCCACCTTACTCGGTGCCTCCGCCGTATCGGTTCAAAAGTAA
Protein sequenceShow/hide protein sequence
MTMISYKASSSSCAFSGANVDIVEIDPLVISASIQAMGFPAFSVMTASGERASAKPRFIDDIMWKGIHERLFLYELDAEDFISNTTNLYDMIFIDAYDGDDIFPHKFWDP
NSSFLEALSKRVHPKHGTVVVNLHSDSDILNLDGSVPSVLEHILPMGKYVSQIGRAYKDVLVGDEKNSGLGFTVAVPWVCNTSLVVCKGRHILRESKIEALLAFDPPKIK
FPSASLMAVNGKNRLPLNDDGDAGVLSRLSRSVSESSIYRQSKRAASNTAFVTNKLLRSTGKAAWIAGTTFLILVVPLIIEMDREQQLNELELQHATLLGASAVSVQK