; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009319 (gene) of Snake gourd v1 genome

Gene IDTan0009319
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG10:15949499..15950332
RNA-Seq ExpressionTan0009319
SyntenyTan0009319
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-9377.37Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALVSVLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SI GGN NVST++HK P+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRI RQ+T+ S PVVQRP    DECVV+P SPER+SAAAS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]2.0e-9477.78Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALVSVLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SINGGN NVST++HK P+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRI RQ+T+ S PVVQRP    DECVV+P SPER+SAAAS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]2.4e-9578.6Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALVSVLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SINGGN NVST +HK PMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRIGRQ+T+ S PVVQRPP  A ECVV+P SPER+SA AS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]3.5e-9477.64Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALV+VLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SI+GGN NVST +HK PMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRIGRQ+T+ S PVVQRPP  ADECVV+P SPER+SAAAS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSR---LWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSR   LWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSR---LWSLWSPNL

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]1.8e-8773.06Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNV-----STAHHKIPMFSC
        MEIKHKGKIHPS     P++SSSVFKLLP AILALVS+LS D+REVLAYMIARSIQSSAFTS+  SRKKST+K+SINGGNGNV     +T +HK PMFSC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNV-----STAHHKIPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVKDDG
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GE+PKK+  RGKRRDRIGRQ T  + PV+Q PP  ADECV +P   ER     +  EGSPVK+  
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVKDDG

Query:  DSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        +SG VEEV    GG  H+KGL TKVLPDVLGF NSRLWSLWSPNL
Subjt:  DSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein4.0e-8070.97Show/hide
Query:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTS-SHASRKKSTKKSSINGGNGNV--------STAHHKIPMFS
        MEIKHK KIHPS  P++SSSVFKLLPAAILAL S+LS D+REVLAYMIARSIQSSAFTS +  SRKKSTKK  IN GN NV        +T +HK P+FS
Subjt:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTS-SHASRKKSTKKSSINGGNGNV--------STAHHKIPMFS

Query:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST--EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVK
        CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GE+PKK+  RGKRRDRIGRQ +    + PVV  P    DECV +P SP  E       EGS VK
Subjt:  CDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST--EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVK

Query:  DDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        +  +SG V E VG  GGE HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  DDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A1S3BXS0 uncharacterized protein LOC1034945679.9e-7969.6Show/hide
Query:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSA-FTSSHASRKKSTKKSSINGGNGNV-------STAHHKIPMFSC
        MEIKHK KIHPS  P++SSSVFKLLPAAILAL S+LS D+REVLAYMIARSIQSSA  TS+  SRKKSTKK SIN GN NV       +T +HK P+FSC
Subjt:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSA-FTSSHASRKKSTKKSSINGGNGNV-------STAHHKIPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST-----EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSP
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GE+PKK+  RGKRRDRIGRQ +       + PV+  P   ADECV +  SP  E       EGS 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST-----EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSP

Query:  VKDDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        VK+  ++G V E VG  GGE HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  VKDDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A5A7TRT9 Uncharacterized protein4.4e-7969.6Show/hide
Query:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSA-FTSSHASRKKSTKKSSINGGNGNV-------STAHHKIPMFSC
        MEIKHK KIHPS  P++SSSVFKLLPAAILAL S+LS D+REVLAYMIARSIQSSA  TS+  SRKKSTKK SIN GN NV       +T +HK P+FSC
Subjt:  MEIKHKGKIHPS--PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSA-FTSSHASRKKSTKKSSINGGNGNV-------STAHHKIPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST-----EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSP
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GE+PKK+  RGKRRDRIGRQ +       + PV+  P   ADECV +  SP  E       EGS 
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQST-----EISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSP

Query:  VKDDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        VK+  ++G V E VG  GGE HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  VKDDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1GV32 uncharacterized protein LOC1114577379.8e-9577.78Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALVSVLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SINGGN NVST++HK P+FSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRI RQ+T+ S PVVQRP    DECVV+P SPER+SAAAS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1IQ09 uncharacterized protein LOC1114794671.2e-9578.6Show/hide
Query:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC
        MEIKHKGK+HPS     P++SSSVFKLLPAAILALVSVLS DEREVLAYMIARSIQSSA TS+H SRKKS KK+SINGGN NVST +HK PMFSCDCFYC
Subjt:  MEIKHKGKIHPS-----PATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE+PKK+  +GKRRDRIGRQ+T+ S PVVQRPP  A ECVV+P SPER+SA AS  E   GSPVK+ G+S
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGE---GSPVKDDGDS

Query:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        G  +EV G      HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GAVEEVVGGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.2e-2340.67Show/hide
Query:  EIKHKGKIHPSP---ATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYCYTA
        ++  KG +HPSP    ++  +  LLP AI +L +VLS ++REVLAY+I+        T+S++  +  T + +    +      +H  P+F CDCF CYT+
Subjt:  EIKHKGKIHPSP---ATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYCYTA

Query:  YWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGK-RRDRIGRQST
        YW RWDSSP+R+LIH+ I+AFED L   +  KK+ +  K RR R G+ S+
Subjt:  YWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGK-RRDRIGRQST

AT1G24270.1 unknown protein3.3e-2645Show/hide
Query:  MEIKHKGKIHPSPATSS----------SVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSC
        M++  KGK+HPSP   S          SVFKLL +AIL LVSVLS ++ EVLAY+I RS+ ++   S    +K+S                 HK P+  C
Subjt:  MEIKHKGKIHPSPATSS----------SVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEIS
         CF CYT+YW +WDSS NRELI+Q IEAFEDHLT   R + SAS   ++++   +  EIS
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEIS

AT1G62422.1 unknown protein1.4e-2134.35Show/hide
Query:  KGKIHPSP----ATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYCYTAYWC
        KG +HPSP     T      LLP AIL+LV+ LS ++REVLAY+I     S++  S+  SR K  K+             +H  P+F CDCF CYT+YW 
Subjt:  KGKIHPSP----ATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYCYTAYWC

Query:  RWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVKDDGDSGAVEEVVGG
        RWD+SP R+LIH+ I+A+ED L   E  KK   R KR  +   +   I                         S  +  G  S     GDS    E  G 
Subjt:  RWDSSPNRELIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVKDDGDSGAVEEVVGG

Query:  TGGEHHQKGLATKVLPDVLGFFNSRLWSLW
         GGE  +K   +  +  V+ F   R   +W
Subjt:  TGGEHHQKGLATKVLPDVLGFFNSRLWSLW

AT5G13090.1 unknown protein2.2e-3842.24Show/hide
Query:  MEIKHKGKIHPSPA--------------------TSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVST
        M++K KGK++PSP                      S SV KLLPA IL LVSVLS +EREVLAY+I R    S     ++S K  TKK S      N S+
Subjt:  MEIKHKGKIHPSPA--------------------TSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVST

Query:  AHHKIPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSAS-RGKRRDRIGRQSTEI---------------SFPVVQRPPQAADEC
         +HK P+F C+CF CYT YW RWDSSPNRELIH+ IEAFE+H        +S S RGK++++ GR+ T+                S PVV+   +     
Subjt:  AHHKIPMFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGERPKKSAS-RGKRRDRIGRQSTEI---------------SFPVVQRPPQAADEC

Query:  VVLPPSPERESAAASAGEGSP-----VKDDGDSGAVEEVV--GGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPN
             SP R S A  A EG P     V++DG       VV           KGLA KVLPDVLG F+S  W LW+PN
Subjt:  VVLPPSPERESAAASAGEGSP-----VKDDGDSGAVEEVV--GGTGGEHHQKGLATKVLPDVLGFFNSRLWSLWSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAAGGTAAAATACACCCATCGCCGGCAACTTCCTCGTCCGTCTTCAAGCTCCTTCCGGCCGCTATTTTAGCCCTAGTTTCAGTCCTCTCTCACGA
CGAACGTGAGGTCTTGGCCTACATGATCGCCAGGTCCATCCAATCCTCCGCATTCACTTCCTCTCACGCTTCGAGGAAGAAATCGACGAAAAAATCTTCGATCAATGGCG
GAAACGGTAATGTTAGTACTGCTCATCACAAAATTCCGATGTTTAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAA
CTAATCCACCAGGCGATTGAGGCGTTTGAAGATCACTTGACGAGCGGCGAGAGGCCGAAGAAGAGCGCCAGCCGAGGAAAGAGGAGAGACAGAATCGGCCGTCAAAGTAC
TGAGATATCTTTTCCTGTTGTTCAGCGTCCGCCGCAGGCGGCCGATGAGTGCGTCGTTCTTCCGCCGTCTCCGGAGCGTGAATCGGCGGCGGCTTCTGCGGGGGAAGGAA
GTCCGGTGAAGGACGACGGAGATAGCGGTGCGGTGGAGGAAGTGGTGGGCGGCACCGGCGGCGAGCATCATCAGAAGGGTTTAGCGACGAAGGTACTACCGGACGTGTTG
GGGTTTTTCAATTCCCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
CAAAAATAAATAAAATTTTCTTTTTAAGCAATGAAGCAGAAAATAAAAACACAGTCCAAACCAAAAATCCTCTGTTCGTTTTCCCCTAAAACTTCCAATGGAGATTAAGC
ACAAAGGTAAAATACACCCATCGCCGGCAACTTCCTCGTCCGTCTTCAAGCTCCTTCCGGCCGCTATTTTAGCCCTAGTTTCAGTCCTCTCTCACGACGAACGTGAGGTC
TTGGCCTACATGATCGCCAGGTCCATCCAATCCTCCGCATTCACTTCCTCTCACGCTTCGAGGAAGAAATCGACGAAAAAATCTTCGATCAATGGCGGAAACGGTAATGT
TAGTACTGCTCATCACAAAATTCCGATGTTTAGTTGCGATTGCTTCTACTGCTACACCGCCTACTGGTGCCGCTGGGACTCCTCTCCCAATCGCGAACTAATCCACCAGG
CGATTGAGGCGTTTGAAGATCACTTGACGAGCGGCGAGAGGCCGAAGAAGAGCGCCAGCCGAGGAAAGAGGAGAGACAGAATCGGCCGTCAAAGTACTGAGATATCTTTT
CCTGTTGTTCAGCGTCCGCCGCAGGCGGCCGATGAGTGCGTCGTTCTTCCGCCGTCTCCGGAGCGTGAATCGGCGGCGGCTTCTGCGGGGGAAGGAAGTCCGGTGAAGGA
CGACGGAGATAGCGGTGCGGTGGAGGAAGTGGTGGGCGGCACCGGCGGCGAGCATCATCAGAAGGGTTTAGCGACGAAGGTACTACCGGACGTGTTGGGGTTTTTCAATT
CCCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAAATTATTCTAATTGAATTTCAAAATTTTCC
Protein sequenceShow/hide protein sequence
MEIKHKGKIHPSPATSSSVFKLLPAAILALVSVLSHDEREVLAYMIARSIQSSAFTSSHASRKKSTKKSSINGGNGNVSTAHHKIPMFSCDCFYCYTAYWCRWDSSPNRE
LIHQAIEAFEDHLTSGERPKKSASRGKRRDRIGRQSTEISFPVVQRPPQAADECVVLPPSPERESAAASAGEGSPVKDDGDSGAVEEVVGGTGGEHHQKGLATKVLPDVL
GFFNSRLWSLWSPNL