; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016137 (gene) of Snake gourd v1 genome

Gene IDTan0016137
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSWIM-type domain-containing protein
Genome locationLG02:94083393..94085250
RNA-Seq ExpressionTan0016137
SyntenyTan0016137
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016567 - protein ubiquitination (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0061630 - ubiquitin protein ligase activity (molecular function)
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR007527 - Zinc finger, SWIM-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR039903 - E3 ubiquitin-protein ligase Zswim2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065044.1 hypothetical protein E6C27_scaffold82G003720 [Cucumis melo var. makuwa]3.4e-11185.23Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR
        MESVASTSSP  RP   VPRFKPSQ VADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIY++ALG+SLDDVCLRR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR

Query:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV
        RTLRPCQLNRLLAAP+ML SLAEIGLR++FHQQFFQV +RA       VTV DMEDGT CPVCLDDMKKNDRVVAC+TCRNLVHEDCF+RWKRSKGRR V
Subjt:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV

Query:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        SCVVCRARWKDT D+QKYLNLSAY+NEHD ID++LYN
Subjt:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

XP_016899980.1 PREDICTED: uncharacterized protein LOC103488177 [Cucumis melo]3.4e-11185.23Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR
        MESVASTSSP  RP   VPRFKPSQ VADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIY++ALG+SLDDVCLRR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR

Query:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV
        RTLRPCQLNRLLAAP+ML SLAEIGLR++FHQQFFQV +RA       VTV DMEDGT CPVCLDDMKKNDRVVAC+TCRNLVHEDCF+RWKRSKGRR V
Subjt:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV

Query:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        SCVVCRARWKDT D+QKYLNLSAY+NEHD ID++LYN
Subjt:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

XP_023521856.1 uncharacterized protein LOC111785706 [Cucurbita pepo subsp. pepo]3.4e-11187.39Show/hide
Query:  MESVAST-SSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR
        MESVAST SSPPD PRL + RFKPSQAVADRIVRALHHHL LL+RS SNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIYIRALG+SLDDVCLR
Subjt:  MESVAST-SSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR

Query:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVSCVVC
        RRTLRPCQLNRLLAAP++L+SLAEIG+RRIFHQQFFQVKER S   V +E+GT CPVCLDDMKKNDRVVAC+TCRNLVHEDCFSRWKRSKGRR VSCVVC
Subjt:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVSCVVC

Query:  RARWKDTTDQQKYLNLSAYVNEHDAIDNDL
        RARWK+TTDQQKYLNLSAYVN+HD +DN+L
Subjt:  RARWKDTTDQQKYLNLSAYVNEHDAIDNDL

XP_023546870.1 uncharacterized protein LOC111805851 [Cucurbita pepo subsp. pepo]1.2e-11187.83Show/hide
Query:  MESVAST-SSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR
        MESVAST SSPPD PRL + RFKPSQAVADRIVRALHHHL LL+RS SNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIYIRALG+SLDDVCLR
Subjt:  MESVAST-SSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR

Query:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVSCVVC
        RRTLRPCQLNRLLAAP++L+SLAEIG+RRIFHQQFFQVKER S   V +E+GT CPVCLDDMKKNDRVVAC+TCRNLVHEDCFSRWKRSKGRR VSCVVC
Subjt:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVSCVVC

Query:  RARWKDTTDQQKYLNLSAYVNEHDAIDNDL
        RARWK+TTDQQKYLNLSAYVNEHD +DN+L
Subjt:  RARWKDTTDQQKYLNLSAYVNEHDAIDNDL

XP_038885291.1 uncharacterized protein LOC120075728 [Benincasa hispida]2.9e-11883.01Show/hide
Query:  MYVYMYMDV---SKNCLTHLNCFSFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCP
        M+V +Y+ V    +    HL+CFSFQILMESVASTSS        VPRFKPS AVADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCP
Subjt:  MYVYMYMDV---SKNCLTHLNCFSFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCP

Query:  DRITPCKHILFIYIRALGVSLDDVCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT
        DRITPCKHILFIY++ALG+SLDD CLRRRTLRPCQLNRLLAAP+ML SLAE+GLR +FHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKK DRVVAC+T
Subjt:  DRITPCKHILFIYIRALGVSLDDVCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT

Query:  CRNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        CRNLVHEDCFSRWKRSKGRR+VSCVVCRARWKD  DQQKYLNLSAY+NEHDAID++LY+
Subjt:  CRNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

TrEMBL top hitse value%identityAlignment
A0A0A0LS10 SWIM-type domain-containing protein1.4e-10783.33Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR
        MESVASTSSP      +VPRFKPSQ VADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIY++ALG+SLDDVCLRR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR

Query:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA-------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRT
        RTLRPCQLNRLLAAP+ML SLAEIGLR++FHQQFFQV +RA       SVTV DMEDG+ CPVCLDDMKK DRVVAC+TCRNLVHEDCF+RWKRSKGRR 
Subjt:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA-------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRT

Query:  VSCVVCRARWKDTTD-QQKYLNLSAY-VNEHDAIDNDLYN
        VSCVVCRARWKDT D QQKYLNLSAY +NEHD ID  LYN
Subjt:  VSCVVCRARWKDTTD-QQKYLNLSAY-VNEHDAIDNDLYN

A0A1S4DW96 uncharacterized protein LOC1034881771.6e-11185.23Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR
        MESVASTSSP  RP   VPRFKPSQ VADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIY++ALG+SLDDVCLRR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR

Query:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV
        RTLRPCQLNRLLAAP+ML SLAEIGLR++FHQQFFQV +RA       VTV DMEDGT CPVCLDDMKKNDRVVAC+TCRNLVHEDCF+RWKRSKGRR V
Subjt:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV

Query:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        SCVVCRARWKDT D+QKYLNLSAY+NEHD ID++LYN
Subjt:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

A0A5A7VCN6 SWIM-type domain-containing protein1.6e-11185.23Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR
        MESVASTSSP  RP   VPRFKPSQ VADRIVRALHHHL LLHRSGSNFFVLGATGNVY+VSLSSTPSCTCPDRITPCKHILFIY++ALG+SLDDVCLRR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRR

Query:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV
        RTLRPCQLNRLLAAP+ML SLAEIGLR++FHQQFFQV +RA       VTV DMEDGT CPVCLDDMKKNDRVVAC+TCRNLVHEDCF+RWKRSKGRR V
Subjt:  RTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERA------SVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTV

Query:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        SCVVCRARWKDT D+QKYLNLSAY+NEHD ID++LYN
Subjt:  SCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

A0A6J1BPZ4 uncharacterized protein LOC1110047032.5e-10776.74Show/hide
Query:  MYVYMYMDVSKNCLTHLNCFSFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRI
        +Y+Y+   V +     L  F  +  MESVASTSSPP RPRL + RFKPSQAVADRIVRALHHHL LLHRS SNFFVLGATGNVY+VSLS+TPSC+CPDRI
Subjt:  MYVYMYMDVSKNCLTHLNCFSFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRI

Query:  TPCKHILFIYIRALGVSLDDVCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKK--NDRVVACTTC
        TPCKHILFIYIRALGVSLDD CLRRRTLRPC LNRLL AP+ML+S+A IG+RR+FHQ+FFQVK RAS +VVD+EDGTTCPVCLD+MKK  +DRVVAC TC
Subjt:  TPCKHILFIYIRALGVSLDDVCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKK--NDRVVACTTC

Query:  RNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        RN+VHEDCF RWKRSKGRR+VSCVVCRARW+DT+ Q+KYLNLSAY+NE   I+NDLYN
Subjt:  RNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

A0A6J1GJ16 mitogen-activated protein kinase kinase kinase 19.0e-11081.78Show/hide
Query:  SFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDD
        +FQILMESVASTSS PDRPR  +P F+PSQA+  RIVRALHHHL LLHRS S+FFVLGATGNVY+VSLSSTPSCTCPDRITPCKH+LF+YIRALGVSLDD
Subjt:  SFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDD

Query:  VCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVS
         CL RRTLRPCQLNRLL APVM +SLAEI LRR+FHQQFFQVKER  V +VD+EDG  CPVCLDD+ K+DRVVACTTCRNLVH+DCFSRWKRSKGRR +S
Subjt:  VCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVS

Query:  CVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN
        CVVCRARWKDTT++QKYLNLSAY++EHDA+DN+ YN
Subjt:  CVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYN

SwissProt top hitse value%identityAlignment
Q13233 Mitogen-activated protein kinase kinase kinase 16.0e-1025.93Show/hide
Query:  RLTVPRFKPS---------QAVADRIVRALHHHLCLLHRSGSNFFVLG--ATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRRRTL
        R+T PR  PS         +    R+ + +   L LL + G N F++G  +  N Y V +    +C+C  R T C H+LF+ +R   +   D  L R+TL
Subjt:  RLTVPRFKPS---------QAVADRIVRALHHHLCLLHRSGSNFFVLG--ATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRRRTL

Query:  RPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFF------------QVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT-CRNLVHEDCFSRWKRS--
        +  ++  L        S       R   Q+F                  +S   +  E+   CP+CL  M   + +  C   CRN +H  C S W     
Subjt:  RPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFF------------QVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT-CRNLVHEDCFSRWKRS--

Query:  KGRRTVSCVVCRARWK
        + R  + C +CR++W+
Subjt:  KGRRTVSCVVCRARWK

Q62925 Mitogen-activated protein kinase kinase kinase 12.7e-1025.93Show/hide
Query:  RLTVPRFKPS---------QAVADRIVRALHHHLCLLHRSGSNFFVLG--ATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRRRTL
        R+T PR  PS         +  + R+ + +   L LL + G N F++G  +  N Y V +    +C+C  R T C H+LF+ +R   +   D  L R+TL
Subjt:  RLTVPRFKPS---------QAVADRIVRALHHHLCLLHRSGSNFFVLG--ATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLRRRTL

Query:  RPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFF------------QVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT-CRNLVHEDCFSRWKRS--
        +  ++  L        S       R   Q+F                  +S   +  E+   CP+CL  M   + +  C   CRN +H  C S W     
Subjt:  RPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFF------------QVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTT-CRNLVHEDCFSRWKRS--

Query:  KGRRTVSCVVCRARWK
        + R  + C +CR++W+
Subjt:  KGRRTVSCVVCRARWK

Arabidopsis top hitse value%identityAlignment
AT5G11620.1 SWIM zinc finger family protein / mitogen-activated protein kinase kinase kinase (MAPKKK)-related1.3e-4742.75Show/hide
Query:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHR-SGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR
        MESV S    P         F  +Q VADRI+RAL H + LLHR +   F VLGAT NVY V+L +TP+CTCPDR  PCKHILF+ IR LG+ LDD CLR
Subjt:  MESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHR-SGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIYIRALGVSLDDVCLR

Query:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTV------------VDMEDGTTCPVCLDDM--------------KKNDRVVACTTC
        +R LR C L  L +AP   D LA   L++ F Q F     +   T             VD E+  TCP+CLDD+              +K   VV C  C
Subjt:  RRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTV------------VDMEDGTTCPVCLDDM--------------KKNDRVVACTTC

Query:  RNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQK--------------YLNLSAYVNE
        +N VH++C   W++S+GRR   CVVCRARW      +               YLNL+ YV+E
Subjt:  RNLVHEDCFSRWKRSKGRRTVSCVVCRARWKDTTDQQK--------------YLNLSAYVNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGTATATATGTATATGGATGTATCTAAGAACTGTTTAACCCATCTTAATTGCTTTTCCTTCCAAATTCTCATGGAGTCTGTTGCCTCTACTTCATCCCCACCGGA
CCGCCCCCGTCTAACCGTTCCTCGATTCAAGCCCTCCCAAGCTGTGGCTGACCGGATTGTTCGAGCTCTCCACCATCACCTTTGTCTCCTTCACCGCTCCGGTTCTAATT
TCTTTGTGTTGGGAGCCACAGGCAATGTCTACGTCGTGTCTCTATCCTCCACCCCCTCATGCACTTGCCCCGATCGTATTACCCCGTGCAAACATATATTGTTTATTTAC
ATCCGAGCCCTGGGTGTGTCGCTGGATGATGTATGTCTTCGGCGGAGAACACTTCGACCTTGCCAATTGAATCGTTTGCTTGCTGCACCTGTTATGTTAGATTCACTTGC
TGAAATTGGCCTACGTAGAATATTTCATCAACAATTCTTTCAGGTAAAGGAAAGAGCTTCTGTAACTGTTGTAGATATGGAAGATGGGACTACATGTCCAGTTTGTTTGG
ATGATATGAAGAAGAACGATCGGGTTGTGGCTTGTACGACATGTCGAAATCTTGTTCACGAAGATTGTTTCTCGAGGTGGAAACGAAGCAAAGGAAGGAGAACGGTTAGT
TGCGTAGTTTGTAGGGCAAGGTGGAAAGATACGACAGACCAACAAAAGTATTTGAATTTGTCTGCCTATGTCAATGAGCACGACGCGATCGACAACGATCTCTATAATGG
TTAA
mRNA sequenceShow/hide mRNA sequence
CTTCACTTTCCTGCTTAGTTTGTCATCTTGGTGATGGGTTCTTTCACATTTGGATTGTCGGGTTCTTTGATAGTTGCGATACATTATTAACATCTTATAAACTCAACGCA
TGACTCTTTTTTCATTTATCGTGGGATTTAATTTCTGATGGATGAGAATTTTAGATTGGTGAAACATTTGATGGTTGCTGCGTAATATCATGATTATCGCTGTCTGTAGC
ATAGAGTTTGTTGTCATAGCAAGCTGTTTAGACATTGCAATTTTTTCCTGGTAAAATAAGTAATGCCTGCAAGATCGGTCTTCCAAAGTTTCAATACGAAGATAAAAGAC
CTTTAAAAACAAGCATGAAAGTTGTTAGATCAATTCGAGATCTGAGCTAACCATAGACATATAATACCCATGTGAGCCTTTTCGTTTTACATGAAAATTAAGTACCGTTT
CTGATGTTTGACGATGGATATGAATACAATCGTAATCGTTAAGATTCATTTGATCCTCCCCCAGTTTCCAAGTTTTGATCTTTATATTGTTTGTGGTTGATGATTATAAA
GTGTTTGAGATCTGAACTGCATGTAACAGGTTCTTAAGTGGGTAGAATTGAATGAAGAGGAAAAGGGTGTCCTTTTTTAGAAGGTGGAGAATGTATTGTGGGTGTTTTTT
TGCAGCCGAAGGCGCAGGTAATTTGGCCATTACGTGCCTTGTGGCCTACCACCATTACAAAACCAAACAAGTTAAGCCTACGAATTCTCCGCCTTACACTTTTCCTCTCT
TCTCCATTACCCACCAAAACTATCCTTCTCTTTTCCCTCAAAACTACACAAAGAATCGTGTGGCCACGTTCCCCAATCCATCTTCCTACCTAAAATGTATGTATATATGT
ATATGGATGTATCTAAGAACTGTTTAACCCATCTTAATTGCTTTTCCTTCCAAATTCTCATGGAGTCTGTTGCCTCTACTTCATCCCCACCGGACCGCCCCCGTCTAACC
GTTCCTCGATTCAAGCCCTCCCAAGCTGTGGCTGACCGGATTGTTCGAGCTCTCCACCATCACCTTTGTCTCCTTCACCGCTCCGGTTCTAATTTCTTTGTGTTGGGAGC
CACAGGCAATGTCTACGTCGTGTCTCTATCCTCCACCCCCTCATGCACTTGCCCCGATCGTATTACCCCGTGCAAACATATATTGTTTATTTACATCCGAGCCCTGGGTG
TGTCGCTGGATGATGTATGTCTTCGGCGGAGAACACTTCGACCTTGCCAATTGAATCGTTTGCTTGCTGCACCTGTTATGTTAGATTCACTTGCTGAAATTGGCCTACGT
AGAATATTTCATCAACAATTCTTTCAGGTAAAGGAAAGAGCTTCTGTAACTGTTGTAGATATGGAAGATGGGACTACATGTCCAGTTTGTTTGGATGATATGAAGAAGAA
CGATCGGGTTGTGGCTTGTACGACATGTCGAAATCTTGTTCACGAAGATTGTTTCTCGAGGTGGAAACGAAGCAAAGGAAGGAGAACGGTTAGTTGCGTAGTTTGTAGGG
CAAGGTGGAAAGATACGACAGACCAACAAAAGTATTTGAATTTGTCTGCCTATGTCAATGAGCACGACGCGATCGACAACGATCTCTATAATGGTTAAATGAACATACAT
GTAACATGTTAAGATGTCTCCTAATAAAATGATAGTGTAAAATCATCCTCAAGTATTTCAACTTCTAGATTAGTGTTGAATTAAATCATCCTGTTTATATAGTACTATAA
AGGATTGATTTAAAAAAAAAATGATAGGGAAATGTATATCTAATTGAGATGAAGATGGGTGGTGTGGAAAATGCAATTACTACCCATGAATGGATGAG
Protein sequenceShow/hide protein sequence
MYVYMYMDVSKNCLTHLNCFSFQILMESVASTSSPPDRPRLTVPRFKPSQAVADRIVRALHHHLCLLHRSGSNFFVLGATGNVYVVSLSSTPSCTCPDRITPCKHILFIY
IRALGVSLDDVCLRRRTLRPCQLNRLLAAPVMLDSLAEIGLRRIFHQQFFQVKERASVTVVDMEDGTTCPVCLDDMKKNDRVVACTTCRNLVHEDCFSRWKRSKGRRTVS
CVVCRARWKDTTDQQKYLNLSAYVNEHDAIDNDLYNG