; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G024100 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G024100
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUnknown protein
Genome locationCmU531Chr01:35594378..35596721
RNA-Seq ExpressionCmUC01G024100
SyntenyCmUC01G024100
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594466.1 hypothetical protein SDJN03_11019, partial [Cucurbita argyrosperma subsp. sororia]3.2e-8062.93Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M +  L  ARD F +MKNF  +F +QT+Y TTAG  ASS IISS+GL+LIY  TQ  KEK   RVFTRSMS+GALHGG+ AMKR+LQYHK+RA  + +  
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL
        YL+KLE  +N+  P+F  IQ ++AK+EMIGQEDKAIE+LK+A K+AKE SL + EYEYQMLLVEALIYKG+  EA    CLN+D+ SDVRR LYK II+L
Subjt:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL

Query:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK
        LL N Q+AEEEWE+F+ MR Q+  PPD+KDS FY L+N F+ FK+VV+LLKQEI +KKK
Subjt:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK

XP_004134647.2 uncharacterized protein LOC101211314 [Cucumis sativus]5.4e-8060.82Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M H       DFF  + NF+  F VQT+Y   AG  AS+FI+S VGLVL+Y +T+++K+KN  RVFTRS+S+GALHGGK AMKRLLQ+ KMRA P+NKDK
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEE------KINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLY
        ++KKL+       KI S  PNF KIQ IV KLEM+GQEDKAIE LK A ++AK+ SLP  E+EYQMLLVE  IYKG+  +AE +PCL ND TSDVRRPLY
Subjt:  YLKKLEE------KINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLY

Query:  KAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK
        KAII++L   TQ+A +EWEEF+ MR+ +L PPDVKDS FYALL DF+ FK+VV +L+++IF KK +AK
Subjt:  KAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK

XP_022146501.1 uncharacterized protein LOC111015701 [Momordica charantia]4.3e-8566.54Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M H  L +ARDF LIM+NF  +F +QT+Y T+A  AASS IIS +GLVLIYV TQ  +EKN  RVF RSMS+GALHGGK AMKRLLQY KMRAT KN+D+
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL
         L+KLE+ I    P+F K+Q IVAKLEM GQEDKAIE+LK+A K+AKENSL H EYEYQ+LLVE LIYKGN  EAE   CLN ++TSDVRR LYKAIIQ+
Subjt:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL

Query:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK
        LL N +KA+E+WEEFK MR+++L PPDVKDSQFY L+ +FE FKQVV LL ++I E+ K+
Subjt:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK

XP_022926668.1 uncharacterized protein LOC111433729 [Cucurbita moschata]4.1e-8062.93Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M +  L  ARD F++MKNF  +F +QT+Y TTAG  ASS IIS +GL+LIY  TQ  KEK   RVFTRSMS+GALHGG+ AMKR+LQYHK+RA  + +  
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL
        YL+KLE   N+  P+F  IQ ++AK+EMIGQEDKAIE+LK+A K+AKE SL + EYEYQMLLVEALIYKG+  EA    CLN+D+ SDVRR LYK II+L
Subjt:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL

Query:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK
        LL N QKAEEEWE+F+ MR Q+  PPD+KDS FY L+N FE FK+VV LLKQ+I +KKK
Subjt:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK

XP_038882985.1 uncharacterized protein LOC120074072 [Benincasa hispida]2.0e-9874.71Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M HS L  ARDFFL M NFF+NF VQT+Y T AG AAS FIIS +GLVLIYV TQ+IKEKN  RVF RS+SMGALH GK AMKRLLQYHKMRATPK K+ 
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKIN--SVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAII
        YL+K E+ IN  + RPNF K+Q I+AKLEMIGQEDKAIE+LKRA ++A+ENS P+ EYEYQMLLVEALIYKGNFA AE VPCLNN+D SDVRR LYKAII
Subjt:  YLKKLEEKIN--SVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAII

Query:  QLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK
        QLLL NTQKAEEEWEEFKNMR+ +L PPDVKDSQF+ LL DF+ FKQVV++LK++IFEK+K
Subjt:  QLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK

TrEMBL top hitse value%identityAlignment
A0A0A0KLZ3 Uncharacterized protein2.6e-8060.82Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M H       DFF  + NF+  F VQT+Y   AG  AS+FI+S VGLVL+Y +T+++K+KN  RVFTRS+S+GALHGGK AMKRLLQ+ KMRA P+NKDK
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEE------KINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLY
        ++KKL+       KI S  PNF KIQ IV KLEM+GQEDKAIE LK A ++AK+ SLP  E+EYQMLLVE  IYKG+  +AE +PCL ND TSDVRRPLY
Subjt:  YLKKLEE------KINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLY

Query:  KAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK
        KAII++L   TQ+A +EWEEF+ MR+ +L PPDVKDS FYALL DF+ FK+VV +L+++IF KK +AK
Subjt:  KAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK

A0A5D3CMZ4 Uncharacterized protein1.4e-5266.28Show/hide
Query:  MRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVR
        MRA P+ KDKYLKKL+ KI S  P+FLK+Q IVAKLEM+GQEDK IE LK A +KA E S P  EYEYQMLLVE  IYKG FA+AE +PCLNN+D SDVR
Subjt:  MRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVR

Query:  RPLYKAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK
        RPL+KAII++LL  TQ+A +EWEEF+ +R+ YL PPDVKDSQFY LL DF+ F++VV +L+++IF KK +AK
Subjt:  RPLYKAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK

A0A6J1CZJ5 uncharacterized protein LOC1110157012.1e-8566.54Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M H  L +ARDF LIM+NF  +F +QT+Y T+A  AASS IIS +GLVLIYV TQ  +EKN  RVF RSMS+GALHGGK AMKRLLQY KMRAT KN+D+
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL
         L+KLE+ I    P+F K+Q IVAKLEM GQEDKAIE+LK+A K+AKENSL H EYEYQ+LLVE LIYKGN  EAE   CLN ++TSDVRR LYKAIIQ+
Subjt:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL

Query:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK
        LL N +KA+E+WEEFK MR+++L PPDVKDSQFY L+ +FE FKQVV LL ++I E+ K+
Subjt:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK

A0A6J1EIU2 uncharacterized protein LOC1114337292.0e-8062.93Show/hide
Query:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK
        M +  L  ARD F++MKNF  +F +QT+Y TTAG  ASS IIS +GL+LIY  TQ  KEK   RVFTRSMS+GALHGG+ AMKR+LQYHK+RA  + +  
Subjt:  MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDK

Query:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL
        YL+KLE   N+  P+F  IQ ++AK+EMIGQEDKAIE+LK+A K+AKE SL + EYEYQMLLVEALIYKG+  EA    CLN+D+ SDVRR LYK II+L
Subjt:  YLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQL

Query:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK
        LL N QKAEEEWE+F+ MR Q+  PPD+KDS FY L+N FE FK+VV LLKQ+I +KKK
Subjt:  LLRNTQKAEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK

A0A6J1KTB7 uncharacterized protein LOC1114974392.1e-7762.55Show/hide
Query:  ARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEK
        ARD FL+MKNF  +F +QT+Y TTAG  ASS IIS +GL+LIY  TQ  KEK   RVFTRSMS+GALHGG+ AMKR+LQY KMRA  + +  YL+KLE  
Subjt:  ARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEK

Query:  INSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQLLLRNTQKA
         ++  P+F  IQ ++ K+EM GQEDKAIE+LK+A K+AKE SL + EYEYQMLLVEALIYKG+  EA    CLN+D+ SDVRR LYK II+LLL N QKA
Subjt:  INSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQLLLRNTQKA

Query:  EEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK
        EEEWE+F+ MR  +  PPD++DS FY L+N FE FK+VV LLKQ+I +KKK
Subjt:  EEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34530.1 unknown protein1.6e-2636.36Show/hide
Query:  RVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLV
        R  ++S+SMGA+ GGK A++RLL  H  R    +      + E  ++  +P+F  +Q  + K+EM G+E K  ELLK+A +KA++    H+ YE +MLLV
Subjt:  RVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLV

Query:  EALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQLLLRNTQK-AEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK
        E LIY GN  EA    CL ++  +D RRPLY+ II  L  +  K  EE +  F+ ++    +P   ++ +   +   F++FK+V+  LK EI +  K+
Subjt:  EALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQLLLRNTQK-AEEEWEEFKNMRNQYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKK

AT2G34530.2 unknown protein1.9e-2242.11Show/hide
Query:  RVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLV
        R  ++S+SMGA+ GGK A++RLL  H  R    +      + E  ++  +P+F  +Q  + K+EM G+E K  ELLK+A +KA++    H+ YE +MLLV
Subjt:  RVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLV

Query:  EALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKA
        E LIY GN  EA    CL ++  +D RRPLY+A
Subjt:  EALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKA

AT2G34540.2 unknown protein2.0e-0830.38Show/hide
Query:  KPAMKRLLQYHKMRAT----PKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAE
        K A++ L +   M A+    P  K   L KL    +    + +K++ +    E  G+ ++A++LL+ A  + +    P   +  QM LVE LI    + E
Subjt:  KPAMKRLLQYHKMRAT----PKNKDKYLKKLEEKINSVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAE

Query:  AEMVPCLNNDDT--SDVRRPLYKAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKD
        A    CLN+++   SDVR PLYKAII  +L    +A++ W+EF+    +   P   +D
Subjt:  AEMVPCLNNDDT--SDVRRPLYKAIIQLLLRNTQKAEEEWEEFKNMRNQYLFPPDVKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCATTCCTTATTATTAGAGGCAAGAGACTTTTTCCTGATAATGAAAAACTTCTTCAACAACTTTACGGTCCAAACCAGATATGCAACAACAGCAGGGACCGCAGC
GTCGAGTTTCATTATCTCGAGCGTTGGTCTCGTGTTGATTTATGTCCTTACTCAGGTTATAAAGGAGAAAAATTACCACCGTGTTTTTACAAGGTCAATGTCCATGGGAG
CTCTACATGGTGGCAAACCAGCCATGAAAAGATTGCTTCAATACCACAAAATGAGAGCAACCCCAAAAAACAAAGATAAATATCTGAAGAAGTTAGAGGAAAAGATCAAC
TCAGTCCGTCCTAATTTCCTGAAGATTCAGGGCATTGTGGCAAAGCTGGAAATGATAGGACAAGAAGATAAAGCTATTGAATTATTAAAAAGAGCAGAAAAAAAAGCTAA
GGAAAATTCACTTCCACACCAAGAATATGAATATCAAATGCTTCTCGTGGAAGCACTTATTTACAAGGGAAACTTTGCGGAAGCAGAAATGGTTCCATGCTTGAATAACG
ATGACACTTCAGATGTTCGACGCCCGTTATATAAGGCTATAATTCAACTACTGCTAAGGAACACCCAGAAAGCAGAAGAAGAATGGGAAGAGTTTAAGAATATGAGAAAC
CAGTACCTGTTTCCACCTGACGTTAAAGACTCTCAATTTTACGCTCTCCTGAACGATTTCGAGCAGTTCAAACAAGTGGTCCACCTGCTCAAACAAGAAATTTTTGAGAA
GAAGAAGAAAGCAAAATAG
mRNA sequenceShow/hide mRNA sequence
TCTACATACAAATTATATTTAAAATTATTGCTGTTCGCCCTCATTCGGCACCATAATCTGGACATCTCTGTTCTTGGAACATCCATTGAAGTTTTGATTCCTGTTGACAC
AATCTAGAATGTTTCATTCCTTATTATTAGAGGCAAGAGACTTTTTCCTGATAATGAAAAACTTCTTCAACAACTTTACGGTCCAAACCAGATATGCAACAACAGCAGGG
ACCGCAGCGTCGAGTTTCATTATCTCGAGCGTTGGTCTCGTGTTGATTTATGTCCTTACTCAGGTTATAAAGGAGAAAAATTACCACCGTGTTTTTACAAGGTCAATGTC
CATGGGAGCTCTACATGGTGGCAAACCAGCCATGAAAAGATTGCTTCAATACCACAAAATGAGAGCAACCCCAAAAAACAAAGATAAATATCTGAAGAAGTTAGAGGAAA
AGATCAACTCAGTCCGTCCTAATTTCCTGAAGATTCAGGGCATTGTGGCAAAGCTGGAAATGATAGGACAAGAAGATAAAGCTATTGAATTATTAAAAAGAGCAGAAAAA
AAAGCTAAGGAAAATTCACTTCCACACCAAGAATATGAATATCAAATGCTTCTCGTGGAAGCACTTATTTACAAGGGAAACTTTGCGGAAGCAGAAATGGTTCCATGCTT
GAATAACGATGACACTTCAGATGTTCGACGCCCGTTATATAAGGCTATAATTCAACTACTGCTAAGGAACACCCAGAAAGCAGAAGAAGAATGGGAAGAGTTTAAGAATA
TGAGAAACCAGTACCTGTTTCCACCTGACGTTAAAGACTCTCAATTTTACGCTCTCCTGAACGATTTCGAGCAGTTCAAACAAGTGGTCCACCTGCTCAAACAAGAAATT
TTTGAGAAGAAGAAGAAAGCAAAATAGTGGGGAAAAATGAAAATGGGCGTTGAAAGCCAAAGCAAACAACCCCTCTAATCATCACCAATTATGTGTGCTATGAGTAGAAT
ATGAATAATTGAGGAAATTCCAAATAACAAAAATAAAATGGAGGGTGAAATAGTATATAATAAAGTAATAACGTAATTGAATAATAGTATATATATGTACTTTTTTTTCC
TCAAACATTACTGGTTTTTCAGTGTAGTCTTAATTCTTTCAAGTTTAAATTAATTATGGATGTCTTAGACATTTTTCTATGTTCACTGATAGTAACTTGTATCACAAAAA
TTTTTACTCTATTCATACAAATTTGGAGGAGAAAAAAGATTATTTGAACAAAGAAAAATTGAAGTTAC
Protein sequenceShow/hide protein sequence
MFHSLLLEARDFFLIMKNFFNNFTVQTRYATTAGTAASSFIISSVGLVLIYVLTQVIKEKNYHRVFTRSMSMGALHGGKPAMKRLLQYHKMRATPKNKDKYLKKLEEKIN
SVRPNFLKIQGIVAKLEMIGQEDKAIELLKRAEKKAKENSLPHQEYEYQMLLVEALIYKGNFAEAEMVPCLNNDDTSDVRRPLYKAIIQLLLRNTQKAEEEWEEFKNMRN
QYLFPPDVKDSQFYALLNDFEQFKQVVHLLKQEIFEKKKKAK