; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G007550 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G007550
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTransmembrane protein
Genome locationCmo_Chr16:3786312..3791752
RNA-Seq ExpressionCmoCh16G007550
SyntenyCmoCh16G007550
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577290.1 hypothetical protein SDJN03_24864, partial [Cucurbita argyrosperma subsp. sororia]1.2e-14198.21Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNS FFSLPISPRSFNQLHQFHLHAHNNSTSRFRS CQYGIGAFESE+VAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICG KRSGGLKVDLLDRIEKLEEDFRSLTTVIR LSRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

KAG7015376.1 hypothetical protein SDJN02_23011, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-14197.85Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNS FFSLPISPRSFNQLHQFHLHAHNNSTSRFRS CQYGIGAFESE+VAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICG KRSGGLKVDLLDRIEKLEEDFRSLTTVIR LSRKLEKLG 
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

XP_022929426.1 uncharacterized protein LOC111436002 isoform X1 [Cucurbita moschata]1.2e-146100Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

XP_022929427.1 uncharacterized protein LOC111436002 isoform X2 [Cucurbita moschata]4.3e-14499.28Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAM  QQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

XP_022985520.1 uncharacterized protein LOC111483510 isoform X1 [Cucurbita maxima]1.6e-13896.06Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSS LK CCFPNSFPTNNS FFSLPISPRS NQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVDLLDRIEKLEEDFRSLTTVIR +SRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELI+AIGEKGKL ESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

TrEMBL top hitse value%identityAlignment
A0A6J1EMR8 uncharacterized protein LOC111436002 isoform X22.1e-14499.28Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAM  QQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

A0A6J1EN38 uncharacterized protein LOC111436002 isoform X15.9e-147100Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

A0A6J1FNU5 uncharacterized protein LOC1114468911.7e-10174.91Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHN----NSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSE
        MSL SQ LFRCS+RLK C F NS P  N+  FSLPI+ R    L+QFH+H H     N+ S  RSYC YGIG  ESED+ QS+D+ GGDF+LESVLLFSE
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHN----NSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSE

Query:  LFSLFSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLE
        LFSLF+SAVFLV FVVNFVGSSSK+AL VL+GDRGL+WGF LLVAT VLN+WIRRRQWRR+CG K SGGLKV+LLDRIEKLEED RS TTVIRALSRKLE
Subjt:  LFSLFSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLE

Query:  KLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        KLGIRF VT+KT+ D I E+A LAQRN +DT+T AVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEK KL++SKQ  D
Subjt:  KLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

A0A6J1J8G1 uncharacterized protein LOC111483510 isoform X22.1e-13695.34Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSS LK CCFPNSFPTNNS FFSLPISPRS NQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVDLLDRIEKLEEDFRSLTTVIR +SRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAM  QQQKQLELI+AIGEKGKL ESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

A0A6J1JBJ5 uncharacterized protein LOC111483510 isoform X17.7e-13996.06Show/hide
Query:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
        MSLPSQILFRCSS LK CCFPNSFPTNNS FFSLPISPRS NQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL
Subjt:  MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSL

Query:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI
        FSSAVFLVVFVVNFVGSSSKRALRVLMGDR LIWGF LLVATAVLNSWIRRRQWRRICG KRSGGLKVDLLDRIEKLEEDFRSLTTVIR +SRKLEKLGI
Subjt:  FSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGI

Query:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD
        RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELI+AIGEKGKL ESKQALD
Subjt:  RFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65250.1 unknown protein1.1e-4144.6Show/hide
Query:  MSLPSQ-ILFRCS-----SRLKSCCFPNS---FPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSN---DKDGGDFDL
        +SLPS+  LF  S     SR  + CF  S       +S  F  PI  R     H     ++      +R++    IG+F  ED + SN   D     FDL
Subjt:  MSLPSQ-ILFRCS-----SRLKSCCFPNS---FPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSN---DKDGGDFDL

Query:  ESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRIC-GRKRSGGLKVDLLDRIEKLEEDFRSLTTV
         S + F+E   + SSAV  VV  VN+V           +G + L  GF  LV +    SW+RRRQW RIC G + S G   +L+ R+EKLE+D +S T++
Subjt:  ESVLLFSELFSLFSSAVFLVVFVVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRIC-GRKRSGGLKVDLLDRIEKLEEDFRSLTTV

Query:  IRALSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMES
        +R LSR LEKLGIRFRVT+K L +PI ETA LAQ+N E T+    Q+++LEKEL EIQKVLLAMQEQQ+KQLELI+ I +  KL ES
Subjt:  IRALSRKLEKLGIRFRVTQKTLMDPIVETAGLAQRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTGCCTTCTCAGATTCTCTTCAGATGTTCGAGCCGTTTGAAGTCTTGTTGCTTCCCCAACTCCTTTCCTACAAACAACAGTTTCTTCTTCTCTCTCCCAATTTC
TCCTCGATCCTTCAATCAACTTCATCAATTCCATCTTCATGCACATAACAATTCAACCTCCCGCTTTCGGAGTTATTGTCAATACGGCATTGGAGCTTTCGAATCTGAAG
ACGTTGCGCAGAGTAACGACAAGGATGGTGGCGATTTCGATTTAGAATCGGTTCTTTTGTTTTCTGAATTGTTTTCTCTCTTTTCTTCGGCTGTTTTCTTGGTTGTTTTT
GTTGTGAATTTCGTGGGTTCGAGTTCGAAGAGAGCGCTTAGGGTATTGATGGGTGATAGGGGTTTGATTTGGGGGTTTTCGCTGCTAGTGGCTACCGCTGTTCTTAACTC
GTGGATTCGAAGACGGCAATGGAGACGAATTTGTGGGCGAAAACGGAGCGGTGGGTTGAAGGTGGATTTGTTGGATAGGATTGAGAAATTAGAGGAGGATTTTAGGAGTT
TGACGACTGTGATTCGGGCCTTGTCTAGGAAGCTTGAGAAGTTGGGCATAAGGTTTAGGGTAACACAAAAAACTCTGATGGATCCAATTGTTGAGACTGCAGGTTTAGCT
CAAAGAAATTATGAGGACACTCAAACTTCGGCTGTGCAAGAAGATGTTCTTGAGAAAGAACTCCTTGAAATACAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCAAAA
GCAACTTGAGCTGATTATTGCAATAGGAGAAAAAGGGAAGCTGATGGAAAGCAAACAGGCACTTGATTAA
mRNA sequenceShow/hide mRNA sequence
CTGGCACGCCACTCTCAGGTCTTCTCAGACGCCTGTGTTGCTCATGTCCACTGTTATCACCCCGAGTGACCAACAATCTCCATTAATGGTAACTCTGCTTCTACCCATTT
TCGTTCCTGAGAAGAAGCATTAAACATCTTCAACAAGCGACCTCAAATTCTCCTGTATTTCTCTGAAAATGTCATTGCCTTCTCAGATTCTCTTCAGATGTTCGAGCCGT
TTGAAGTCTTGTTGCTTCCCCAACTCCTTTCCTACAAACAACAGTTTCTTCTTCTCTCTCCCAATTTCTCCTCGATCCTTCAATCAACTTCATCAATTCCATCTTCATGC
ACATAACAATTCAACCTCCCGCTTTCGGAGTTATTGTCAATACGGCATTGGAGCTTTCGAATCTGAAGACGTTGCGCAGAGTAACGACAAGGATGGTGGCGATTTCGATT
TAGAATCGGTTCTTTTGTTTTCTGAATTGTTTTCTCTCTTTTCTTCGGCTGTTTTCTTGGTTGTTTTTGTTGTGAATTTCGTGGGTTCGAGTTCGAAGAGAGCGCTTAGG
GTATTGATGGGTGATAGGGGTTTGATTTGGGGGTTTTCGCTGCTAGTGGCTACCGCTGTTCTTAACTCGTGGATTCGAAGACGGCAATGGAGACGAATTTGTGGGCGAAA
ACGGAGCGGTGGGTTGAAGGTGGATTTGTTGGATAGGATTGAGAAATTAGAGGAGGATTTTAGGAGTTTGACGACTGTGATTCGGGCCTTGTCTAGGAAGCTTGAGAAGT
TGGGCATAAGGTTTAGGGTAACACAAAAAACTCTGATGGATCCAATTGTTGAGACTGCAGGTTTAGCTCAAAGAAATTATGAGGACACTCAAACTTCGGCTGTGCAAGAA
GATGTTCTTGAGAAAGAACTCCTTGAAATACAAAAGGTCTTACTAGCCATGCAGGAGCAGCAGCAAAAGCAACTTGAGCTGATTATTGCAATAGGAGAAAAAGGGAAGCT
GATGGAAAGCAAACAGGCACTTGATTAAGAACGAACAAGAATGGAAAGACGCAATTCTGCCAATGAGGGTTCAAAAGAACGGGAAGCTTATGAAATCTGAGGGATGATAT
CAATTATATAGGTTAGATTCATCATACAGATACACTCAAAGAACCGCGTCCCATCTCGAAGGGCCTCATTCTGTCAAACATCAATTGCGATCAGGTGATTCTTGAAAACA
GGTATGTGACAGAAAAGAAACTGATAAATACAGAATGGTAAGCTCTACTTGGTATGGTATGAATTTACAGTTTTACAGAAAAAGATGAGAAAAGAATTCACTCAGGTGAT
TGTTCCTTCTTGGGCGGAACGGTCCTCCTCCTCAACATCAAAAGATCCCCAAATCTCCTCATCAATGCCTTCTTCTTCCTTGAATTCCTATCACCATCCATCTCATCCAG
GTTCCCCTCTTCCAACTGATCAAGATCCTCATCATTCATCATCTCCCCATCTTCACCACACTGAGAAAATGAGTGCTTCCTTTCATGTTTCCTGGGGGAACCAGGCGAGT
CGACATCGTCAAAAATCGACCCTTTAAGCGTCTCCTCCATTTCAGGCTCTTCCTCCTTCTCTTTCTCTTTCTCTTCTTCCACCTCCTTCTTCTTCTCTGGCATGATCCTC
ATGTCCAAGAAACTGAAACTGAAAGCCTTCCCTAGCCTTCTCCCAAATGTATCTTCCTTCTCCACTGGAGTTGGACTCGGATTCGGATTCGGAGTCGGACATGGACTCAA
CGGTGGCTTCGATCTTGTAATCTCCACTTGCTCCTTGCCTTCTTCTTTGCTCTTGCCTTCTTCTTTTCTCTTGCCTTCTTCTTTGCTCTTGCTTTCCTCTTGGCTCTTGC
TTTCATCTTTACTTTTGTTTTCTTCTTTCTTATTTGCCTCTTCTAGCAACTGCTTCAGCTCCTTAATCTCCTCTAGAGATGCAGCTTGGCTGACTTTGAGAGTTTCATTC
TCACTGCTGACAAAATCCAATGCATTTTCCTTCTCAGCCAAACAGTCCTTTAGCTGTGAATTCTCTTCTATTGCAATCCCTGCAGCTTCCTTGGCCACACTGGCTTCATT
TAAGGCCTGTTTTAGAATGTCCCTCACTTTCTTAATCTCTTCTTTTGATGTCATGTTCTTAAGCTCTGCTAGCCTCAGGGCATCCGTCAAACGGCGATTCTCTTGCTGGG
CATTGTATCTGTCTTCTTCAGCACTTCTTATGCAGTCCACTAAGCTTGTTTCTCTCCCACTCCATGCCACTAGAGACTCCTCTGCTTCCGATCTCAGCCTGTCCACTGTG
CTCTTGTACAAATCTGCATCTTTTCTTGCATCTTGCAACAAAGATTTGTACTTTTCCTCTGTGTTTTTCAATGTTGTTTTCAAATTCTCTGCTTCATCTTTTGTTTGCAT
TAACTCCTCTTCGACTGTGCTGTATTTTCCCTTCAAATGACTCGCTTCTGTTGCTACCTCTTTCAATGCCATTGCTAGATCATCCATGGCTTTCTTGTTGTTCTCCTCTG
CCTCTGTAGCCAACTTTAGCTCGTTGTTTAGCAGAAACAGACAGTGTTTTGTCGATTCGAGCTCGGATTTCATCCTGTCGAATTCAATTTTTGTTGTAATGCTCTGTTTT
GGATTGTAGATGTAATTGGTGATGGAAGAACATTTCTCAAGTTTTTCACGAAGGGATTGGATCTCTAGTTTTGATTCTTCGAGCGCAATCTTCGTCCGTTCGAGCTGTTT
TGTCTGTGCAGATAGCAAATCCTTGGTTTTCTCTTCTGATTCTTTGGAACTTTTGTACTTTTTTTCGAGTTCTTCAATTCTTGTTTGGCTTCGTGATTCAGAGCGTTTCA
AAACAGAGAGCTCCTTCGCCATCTTTGCCAAATTAGAATCCTTCTCTACCAATTTAAGCTCCAATTCCTTAGCTTTTCCCATCGCCAAGTCTCTGTCTCTCTTCACAGAT
TTCAACTCTGTAATGAGTTCTATATCGACCTCGTCCGCCATTTTCTGTATCTCATTAAACATTAGAAGCTCTGTTAAAGCTACTTCGTCTTCCTCTGCCACTACAGATTT
ATCATCTGTCTCCTCTGTTTGCATCTCAGAAACTCAGATCAAATCTCCTGATGAATGGGAAAAACAAACAAAACCCAGATTTTCTAGACATCAAAATTCATTAGTAAATG
AAAAGAAACGAAGAAAGAATCAAAGTTGAAGAACAGGAAGATTCAAGGAAGAAGAACAGGTCAAATGGCAGAAAACCTGAATCATTCAAACCAAACTTCTGCAAGAAAGG
GAAACATACCGAACCCGAGTTGGTGAAAGGTGAAAGAACAGGCAATTCCATTCAAAGGAAAACGATAGAGAATTGGGAGATTGCAGTAAATTTGGGAGAAGAAGATGAAT
TGAGAAGAAGAAATTTGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAG
Protein sequenceShow/hide protein sequence
MSLPSQILFRCSSRLKSCCFPNSFPTNNSFFFSLPISPRSFNQLHQFHLHAHNNSTSRFRSYCQYGIGAFESEDVAQSNDKDGGDFDLESVLLFSELFSLFSSAVFLVVF
VVNFVGSSSKRALRVLMGDRGLIWGFSLLVATAVLNSWIRRRQWRRICGRKRSGGLKVDLLDRIEKLEEDFRSLTTVIRALSRKLEKLGIRFRVTQKTLMDPIVETAGLA
QRNYEDTQTSAVQEDVLEKELLEIQKVLLAMQEQQQKQLELIIAIGEKGKLMESKQALD