; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020464 (gene) of Snake gourd v1 genome

Gene IDTan0020464
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG11:7433834..7434967
RNA-Seq ExpressionTan0020464
SyntenyTan0020464
Gene Ontology termsGO:0005765 - lysosomal membrane (cellular component)
InterPro domainsIPR019320 - BLOC-1-related complex subunit 8


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578805.1 hypothetical protein SDJN03_23253, partial [Cucurbita argyrosperma subsp. sororia]4.1e-12485.55Show/hide
Query:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS
        M+GFSTVDGFVEIAESS EMIKYIANEPSTGLFY+QQHTKNAVPNVINLKNSVVDKSHETTLH EDSEDSITML+SMK+CGFPIADEMIRDIKK+LA+MS
Subjt:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS

Query:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM
        TKQPRRGLIRNTSG Q PGR+STWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDL QVEV+++ PQP++P VASASS+ SQPD+
Subjt:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM

Query:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL
        +T+ELPL+  VNDELQ+DDQVDVG++T+L SVSDNFDDFRADKEAKLEEWLG SGGL+D++++
Subjt:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL

KAG6602294.1 hypothetical protein SDJN03_07527, partial [Cucurbita argyrosperma subsp. sororia]6.3e-12582.37Show/hide
Query:  DELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRD
        DELPFTFEKMYGFSTVDGFVEI ESS EMIKYIANEPSTGLFYIQQHTKNAVPNV+N+KNSV + S E+TLHTEDSEDSITMLRSMKECGFPIADEMIRD
Subjt:  DELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRD

Query:  IKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASA
        IKK+LA+MS KQPRRGLIR+T GMQPPGR+STWRSATWGRS  +APRDDD GGYISTVFKSARE ASNFKWPQLDI EDL +VEV+K QP+P+QPSV SA
Subjt:  IKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASA

Query:  SSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
        SSS SQPDM++DELPLS  VND LQ DDQVDVGLDT++ SVS+ FDDFRADKEAKL++WL  SG L+DIRDLS  +GH
Subjt:  SSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

XP_022134183.1 uncharacterized protein LOC111006508 isoform X1 [Momordica charantia]4.1e-12482.52Show/hide
Query:  DKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPI
        D  EIRSDELPFTFEKMYGFSTVDGFVEI ES  EMIKYIANEPSTGLFYIQQHT+NAVPNV+ L+N VVDKSHETTLHTEDSEDSITMLRSMKE GFPI
Subjt:  DKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPI

Query:  ADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPS
        ADEMIRDIKK+LAIMSTKQPRRGLI NTSG+Q  GRISTWRSATWGRSAIVAP ++ SGGYISTVFKSAREKASNFKWPQLDIK+DL QVEV+K+ PQ +
Subjt:  ADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPS

Query:  QPSVASASSSLSQPDM-ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
        QPSVASASSS SQPD+  T+ELPLSS VNDELQ+DDQVD  LD +L S SDNFDDFRADKEAKLEEWLG +GGL  + DLS  + H
Subjt:  QPSVASASSSLSQPDM-ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

XP_022957415.1 uncharacterized protein LOC111458823 isoform X1 [Cucurbita moschata]1.2e-12882.17Show/hide
Query:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP
        D KEEIRSDELPFTFEKMYGFSTVDGFVEI ESS EMIKYIANEPSTGLFYIQQHTKNAVPN++N+KNSV + S E+TLHTEDSEDSITMLRSMKECGFP
Subjt:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP

Query:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP
        IADEMIRDIKK+LA+MS KQPRRGLIR+T GMQPPGR+STWRSATWGRS  +APRDDD GGYISTVFKSARE ASNFKWPQLDI EDL +VEV+K QP+P
Subjt:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP

Query:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
        +QPSV SASSS SQPDM++DELPLS  VND LQ DD+VDVGLDT++ SVSD FDDFRADKEAKL++WL  SG L+DIRDLS  +GH
Subjt:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

XP_038884987.1 uncharacterized protein LOC120075565 isoform X1 [Benincasa hispida]2.1e-12885.51Show/hide
Query:  SDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIR
        SDELPFTFE+MYGFSTVDGFVEIAESS EMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIR
Subjt:  SDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIR

Query:  DIKKTLAIMSTKQPRRGLIRNTSGM----QPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQP
        DI+K+LAIMSTKQPRRGLIRNTSGM    Q PGR+STWRSATWGR AIVAP DDDSGGYISTVFKSAREKASNFKWPQL+I+EDL QVEV+K+QPQP QP
Subjt:  DIKKTLAIMSTKQPRRGLIRNTSGM----QPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQP

Query:  SVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
         VA A+SS SQPDM+T+ELPLSS VNDE Q++DQV+  L+T+L  VSDNFDDFRADKEAKLEEWLGESGGL++ RDL T +GH
Subjt:  SVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

TrEMBL top hitse value%identityAlignment
A0A6J1C195 uncharacterized protein LOC111006508 isoform X12.0e-12482.52Show/hide
Query:  DKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPI
        D  EIRSDELPFTFEKMYGFSTVDGFVEI ES  EMIKYIANEPSTGLFYIQQHT+NAVPNV+ L+N VVDKSHETTLHTEDSEDSITMLRSMKE GFPI
Subjt:  DKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPI

Query:  ADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPS
        ADEMIRDIKK+LAIMSTKQPRRGLI NTSG+Q  GRISTWRSATWGRSAIVAP ++ SGGYISTVFKSAREKASNFKWPQLDIK+DL QVEV+K+ PQ +
Subjt:  ADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPS

Query:  QPSVASASSSLSQPDM-ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
        QPSVASASSS SQPD+  T+ELPLSS VNDELQ+DDQVD  LD +L S SDNFDDFRADKEAKLEEWLG +GGL  + DLS  + H
Subjt:  QPSVASASSSLSQPDM-ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

A0A6J1FML8 uncharacterized protein LOC111445573 isoform X11.3e-12385.17Show/hide
Query:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS
        M+GFSTVDGFVEIAESS EMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLH EDSEDSITML+SMK+CGFPIADEMIRDIKK+LA+MS
Subjt:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS

Query:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM
        TKQPRRGLIRNTSG Q PGR+STWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDL +VEV+++ PQP++P VASASS+ SQPD+
Subjt:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM

Query:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL
        +T+ELPL+  VNDELQ+D+QVDVG++T+L SVSDNFDDFRADKEAKLEEWLG SGGL+D++++
Subjt:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL

A0A6J1H0H0 uncharacterized protein LOC111458823 isoform X15.9e-12982.17Show/hide
Query:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP
        D KEEIRSDELPFTFEKMYGFSTVDGFVEI ESS EMIKYIANEPSTGLFYIQQHTKNAVPN++N+KNSV + S E+TLHTEDSEDSITMLRSMKECGFP
Subjt:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP

Query:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP
        IADEMIRDIKK+LA+MS KQPRRGLIR+T GMQPPGR+STWRSATWGRS  +APRDDD GGYISTVFKSARE ASNFKWPQLDI EDL +VEV+K QP+P
Subjt:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP

Query:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH
        +QPSV SASSS SQPDM++DELPLS  VND LQ DD+VDVGLDT++ SVSD FDDFRADKEAKL++WL  SG L+DIRDLS  +GH
Subjt:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGH

A0A6J1JS14 uncharacterized protein LOC111487293 isoform X11.0e-12082.12Show/hide
Query:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP
        D KEEIRSDELPFTFEKMYGFSTVDGFVEI ESS EMIKYIANEPSTGLFYIQQHTKN VPNV+N+KNSV + S E+TLHTEDSEDSITMLRSMKECGFP
Subjt:  DDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFP

Query:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP
        IADEM RDIKK+LA+MSTKQPRRGLIR+T GMQPPGR+STWRSATWGRSA +APRDDD G YISTVFK ARE  SNFKWPQLDI EDL +VEV+K QPQP
Subjt:  IADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQP

Query:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGL
        +QPSV SASSS SQPDM++DELPLS  VND LQ DD+VDVGLDT+L SVS  FDDFRADKEAKL++WL  SG +
Subjt:  SQPSVASASSSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGL

A0A6J1JWC1 uncharacterized protein LOC111489457 isoform X11.8e-12285.17Show/hide
Query:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS
        M+GFSTVDGFVEIAESS EMIKYIANEPSTGLFYIQ HTKNAVPNVINLKNSVVDKSHETTLH EDSEDSITML+SMK+CGFPIADEMIRDIKK+LA+MS
Subjt:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS

Query:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM
        TKQPRRGLIRNTSG Q PGR+STWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDL  VEV+++ PQP++P VASASS  SQPD+
Subjt:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM

Query:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL
         T+ELPLS  VNDELQ+DDQVDV ++T+L SVSDNFDDFRADKEAKLEEWLG SGGL+D++++
Subjt:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39170.1 unknown protein8.6e-5648.08Show/hide
Query:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS
        M+ FSTVDGF EI ES  EMIKYIANEPS GL+YIQQH +NA PNVINL N+V++KS ET LHTED EDSI M++SMK+CG PIADEMI DIK +LAIMS
Subjt:  MYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGFPIADEMIRDIKKTLAIMS

Query:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM
        +KQPRRG+I N++   P  R S+  + T  R +  +  + +S  Y ++VF +A+EKASN KWPQLD KE          Q   + P+V S          
Subjt:  TKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASASSSLSQPDM

Query:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDI
                    +EL+++++ D G   ++   +  F++F+A KEA L+ WLG+  G  D+
Subjt:  ETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAAAGTGAGCAAGACAGGCAAGTATGGAGACGACAAGGAGGAGATAAGAAGTGATGAACTGCCATTCACCTTTGAAAAGATGTATGGATTCTCCACAGTTGATGG
CTTTGTGGAGATAGCTGAAAGCTCGGGGGAGATGATCAAGTATATTGCTAACGAACCTTCAACTGGGCTTTTCTACATTCAACAGCACACAAAAAATGCTGTTCCCAATG
TTATCAATCTGAAGAATAGTGTGGTGGACAAGTCTCATGAAACAACTTTGCATACTGAAGATTCGGAGGATTCGATCACCATGTTGAGGTCGATGAAAGAATGTGGATTT
CCTATTGCTGATGAGATGATCAGAGACATAAAGAAGACTCTTGCTATAATGTCAACCAAACAGCCAAGAAGAGGCTTGATTCGTAATACTTCTGGTATGCAGCCACCGGG
GAGAATAAGCACTTGGAGATCGGCCACCTGGGGGCGAAGCGCAATTGTTGCCCCACGTGACGACGACAGTGGCGGTTATATTTCAACAGTTTTCAAGTCAGCTAGAGAAA
AGGCAAGCAACTTTAAGTGGCCACAGCTTGACATCAAGGAAGATCTTGTGCAGGTTGAAGTCAATAAGGTACAGCCACAACCTAGCCAACCATCAGTTGCATCAGCTAGT
TCTAGTTTATCACAGCCAGATATGGAAACCGACGAGTTGCCTCTGTCTAGTCATGTTAATGATGAGTTGCAACAAGACGACCAGGTTGATGTCGGTTTGGACACCAATTT
ATTTTCGGTGTCTGATAACTTTGACGATTTCAGGGCCGATAAAGAAGCAAAATTGGAGGAGTGGTTGGGAGAGTCTGGCGGCTTGAGTGATATAAGAGATCTTAGCACAC
GGAGAGGCCATCAACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAAAGTGAGCAAGACAGGCAAGTATGGAGACGACAAGGAGGAGATAAGAAGTGATGAACTGCCATTCACCTTTGAAAAGATGTATGGATTCTCCACAGTTGATGG
CTTTGTGGAGATAGCTGAAAGCTCGGGGGAGATGATCAAGTATATTGCTAACGAACCTTCAACTGGGCTTTTCTACATTCAACAGCACACAAAAAATGCTGTTCCCAATG
TTATCAATCTGAAGAATAGTGTGGTGGACAAGTCTCATGAAACAACTTTGCATACTGAAGATTCGGAGGATTCGATCACCATGTTGAGGTCGATGAAAGAATGTGGATTT
CCTATTGCTGATGAGATGATCAGAGACATAAAGAAGACTCTTGCTATAATGTCAACCAAACAGCCAAGAAGAGGCTTGATTCGTAATACTTCTGGTATGCAGCCACCGGG
GAGAATAAGCACTTGGAGATCGGCCACCTGGGGGCGAAGCGCAATTGTTGCCCCACGTGACGACGACAGTGGCGGTTATATTTCAACAGTTTTCAAGTCAGCTAGAGAAA
AGGCAAGCAACTTTAAGTGGCCACAGCTTGACATCAAGGAAGATCTTGTGCAGGTTGAAGTCAATAAGGTACAGCCACAACCTAGCCAACCATCAGTTGCATCAGCTAGT
TCTAGTTTATCACAGCCAGATATGGAAACCGACGAGTTGCCTCTGTCTAGTCATGTTAATGATGAGTTGCAACAAGACGACCAGGTTGATGTCGGTTTGGACACCAATTT
ATTTTCGGTGTCTGATAACTTTGACGATTTCAGGGCCGATAAAGAAGCAAAATTGGAGGAGTGGTTGGGAGAGTCTGGCGGCTTGAGTGATATAAGAGATCTTAGCACAC
GGAGAGGCCATCAACATTGA
Protein sequenceShow/hide protein sequence
MPKVSKTGKYGDDKEEIRSDELPFTFEKMYGFSTVDGFVEIAESSGEMIKYIANEPSTGLFYIQQHTKNAVPNVINLKNSVVDKSHETTLHTEDSEDSITMLRSMKECGF
PIADEMIRDIKKTLAIMSTKQPRRGLIRNTSGMQPPGRISTWRSATWGRSAIVAPRDDDSGGYISTVFKSAREKASNFKWPQLDIKEDLVQVEVNKVQPQPSQPSVASAS
SSLSQPDMETDELPLSSHVNDELQQDDQVDVGLDTNLFSVSDNFDDFRADKEAKLEEWLGESGGLSDIRDLSTRRGHQH