; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1036 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1036
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTransmembrane protein
Genome locationMC01:15901633..15907022
RNA-Seq ExpressionMC01g1036
SyntenyMC01g1036
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026852.1 hypothetical protein SDJN02_10859 [Cucurbita argyrosperma subsp. argyrosperma]1.65e-15579.25Show/hide
Query:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA
        MS+AFQYLSLSS SPSPPPS TFYFS+F SRNPC SLRFAP  FP+ L HFQ LDHKLRSPFNF SIN HQF PRVS S G GRRD  D  F++DS LSA
Subjt:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA

Query:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ
        AELFCLV+SLLASVG ALN VKA SKS+FLAVFGD I VGA LFLVAGVAIGAWIRRRQWNRIYR TAK  LE++LVERTNKLEEDL++SATLIRVLSRQ
Subjt:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ

Query:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI
        LEKLG RFR TRKALKKP+EE         TA LAQKTSEATRALAVRGDILE EL EIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE +  QSP 
Subjt:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI

Query:  GRHDLVDERLNKKEVHDI
        GRHDL+DER+N+KEV D+
Subjt:  GRHDLVDERLNKKEVHDI

XP_022132906.1 uncharacterized protein LOC111005633 isoform X1 [Momordica charantia]1.10e-20497.16Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG
        EKLGIRFRVTRKALKKPIEE         TAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG

Query:  RHDLVDERLNKKEVHDI
        RHDLVDERLNKKEVHDI
Subjt:  RHDLVDERLNKKEVHDI

XP_022132907.1 uncharacterized protein LOC111005633 isoform X2 [Momordica charantia]4.68e-16796.62Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ
        EKLGIRFRVTRKALKKPIEE         TAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ

XP_023518369.1 uncharacterized protein LOC111781875 [Cucurbita pepo subsp. pepo]3.72e-15478.62Show/hide
Query:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA
        MS+AFQYLSLSS SPSPPPS TFYFS+F SRNPC SLR+AP  F ++ LHFQ LDHKLRSPFNF SIN HQF PRVS S G GRRD  D  F++DS LSA
Subjt:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA

Query:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ
        AELFCLV+SLLASVG ALN VKA SKS+FLAVFGD I VGA LFLVAGVAIGAWIRRRQWNRIYR TAK  LE++LVERTNKLEEDL++SATLIRVLSRQ
Subjt:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ

Query:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI
        LEKLG RFR TRKALKKP+EE         TA LAQKTSEATRALAVRGDILE EL EIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE +  QSP 
Subjt:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI

Query:  GRHDLVDERLNKKEVHDI
        GRHDL+DER+N+KEV D+
Subjt:  GRHDLVDERLNKKEVHDI

XP_038881992.1 uncharacterized protein LOC120073309 [Benincasa hispida]1.03e-16380.19Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MS+AFQ LSL+SPSPSPPPST  FS+FFSRNPC SLRFAP  FPN L HFQ L+HK RSPFNF SIN HQFCPRVSTSGGVGR+   DGDF++DS LSAA
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        ELFCLV+SL+ SVG ALN  KARSKS+FLAVFGDGIFVGA LFLVAGVAIGAWIRRRQWNRI+R TAK  L ++L+E+TN+LEEDL+SSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEG-QSPI
        EKLGIRFRVTRKALKKP+EE         TA LAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE + G QS I
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEG-QSPI

Query:  GRHDLVDERLNKKEVHDI
        GRHDL+DERLN+KEV D+
Subjt:  GRHDLVDERLNKKEVHDI

TrEMBL top hitse value%identityAlignment
A0A0A0KK16 Uncharacterized protein3.56e-15075.39Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MS+ FQ LSL+SPSPS   STF FS+F SRNPC SL F P  FPN L HFQ LD+K RSPFNF SIN H FCPRVSTSGGVGRR     DF++DS LSA 
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        E FCLV+SL+ SVG ALN  K RSKSLFLAVFGDG+ VG  LFLVAGVAIGAWIRRRQWNR++R TAK  LE++L+E+TNKLEEDL+SSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG
        EKLGIRFRVTRKALKKP+EE         TA LAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQL+LILAIG SGK+WESRQE + GQS +G
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG

Query:  RHDLVDERLNKKEVHDI
        RHDL+DE LN KEV D+
Subjt:  RHDLVDERLNKKEVHDI

A0A6J1BTL1 uncharacterized protein LOC111005633 isoform X15.33e-20597.16Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG
        EKLGIRFRVTRKALKKPIEE         TAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIG

Query:  RHDLVDERLNKKEVHDI
        RHDLVDERLNKKEVHDI
Subjt:  RHDLVDERLNKKEVHDI

A0A6J1BV51 uncharacterized protein LOC111005633 isoform X22.26e-16796.62Show/hide
Query:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
        MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA
Subjt:  MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAA

Query:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
        ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL
Subjt:  ELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQL

Query:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ
        EKLGIRFRVTRKALKKPIEE         TAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ
Subjt:  EKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQ

A0A6J1HDZ6 uncharacterized protein LOC1114633227.31e-15478.62Show/hide
Query:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA
        MS+AFQYLSLSS SPSPPPS T YFS+F SRNPC SLRFAP  FP+ L HFQ LDHKLRSP+NF SIN HQF PRVS S G GRRD  D  F++DS LSA
Subjt:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA

Query:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ
        AELFCLV+SLLASVG ALN VKA SKS+FLAVFGD I VGA LFLVAGVAIGAWIRRRQWNRIYR TAK  LE +LVERTNKLEEDL++SATLIRVLSRQ
Subjt:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ

Query:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI
        LEKLG RFR TRKALKKP+EE         TA LAQKTSEATRALAVRGDILE EL EIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE +  QSP 
Subjt:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI

Query:  GRHDLVDERLNKKEVHDI
        GRHDL+DER+N+KEV D+
Subjt:  GRHDLVDERLNKKEVHDI

A0A6J1KQT6 uncharacterized protein LOC1114968082.56e-15478.93Show/hide
Query:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA
        MS+A QYLSLSS SPSPPPS TFYFS+F SRNPC SLRFAP  FP+AL HFQ LDHKLRSPFNF SIN HQF PRVS S G GRRD  D  F +DS LSA
Subjt:  MSIAFQYLSLSSPSPSPPPS-TFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSA

Query:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ
        AELFCLV+SLLASVG ALN VKA SKS+F AVFGD I VGA LFLVAGVAIGAWIRRRQWNRIYR TAK  LE+DLVERTNKLEEDL++SATLIRVLSRQ
Subjt:  AELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQ

Query:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI
        LEKLG RFR TRKALKKP+EE         TA LAQKTSEATRALAVRGDILE EL EIQKVLLAMQEQQQKQLELILAIGKSGK+WESRQE +  +SP 
Subjt:  LEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPI

Query:  GRHDLVDERLNKKEVHDI
        GRHDL+DER+N+KEV D+
Subjt:  GRHDLVDERLNKKEVHDI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65250.1 unknown protein7.3e-4547.57Show/hide
Query:  FLDHKLRSPFNF-----YSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAAELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVA
        FL H + +  NF     Y  +         +S      D+S   F+L SF+S AE  C++SS + SV +A+N V        +   G  +    F+ LV 
Subjt:  FLDHKLRSPFNF-----YSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAAELFCLVSSLLASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVA

Query:  GVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQLEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAV
         VA G+W+RRRQW RI +G A+ +   +L+ R  KLE+DLKSS +++RVLSR LEKLGIRFRVTRKALK+PI E         TA LAQK SEATR L  
Subjt:  GVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQLEKLGIRFRVTRKALKKPIEEPLRIDLFLKTAVLAQKTSEATRALAV

Query:  RGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIGRHDLVDERLNKKE
        + +ILEKEL EIQKVLLAMQEQQ+KQLELIL I KS KL+ES    +  QSP       ++R NK E
Subjt:  RGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIGRHDLVDERLNKKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGATTGCTTTCCAATACCTTTCACTGAGCTCGCCTTCTCCTTCCCCTCCCCCTTCTACCTTTTACTTCTCCAGCTTCTTTTCCAGGAATCCGTGCTCGTCTCTTCG
ATTTGCCCCTAGACATTTCCCCAACGCCCTGCTGCATTTTCAATTTCTCGATCACAAACTTCGAAGCCCTTTTAATTTCTATTCGATTAATACCCATCAGTTCTGTCCTC
GAGTTTCTACTTCTGGAGGAGTTGGACGTAGAGACAGTAGTGATGGTGATTTTAATCTCGATTCCTTTCTTTCAGCTGCCGAGTTGTTTTGCCTCGTTTCGTCTTTGCTC
GCTTCTGTTGGGGTCGCTCTGAATAGCGTGAAAGCCAGGTCTAAGAGTCTGTTCTTGGCGGTGTTTGGTGATGGGATTTTCGTTGGCGCATTCTTATTTTTGGTGGCTGG
GGTTGCAATTGGTGCTTGGATTCGCAGACGGCAGTGGAATCGGATTTATCGAGGGACAGCGAAGGCCGCATTGGAGATAGATTTGGTGGAAAGAACTAACAAGCTGGAGG
AGGATTTGAAGAGCTCGGCAACGCTAATTCGAGTCTTGTCGAGGCAGCTGGAGAAGCTAGGGATTAGGTTTAGAGTTACTCGAAAGGCTCTGAAGAAGCCCATCGAGGAG
CCTCTAAGAATTGATCTATTTCTGAAGACTGCAGTGTTAGCTCAAAAAACTTCTGAGGCTACTCGAGCATTGGCAGTTCGAGGAGATATTCTGGAGAAGGAGCTTGCTGA
AATCCAGAAGGTCTTACTGGCTATGCAGGAACAGCAACAAAAGCAACTTGAGTTGATTCTAGCTATAGGGAAGTCGGGAAAGCTATGGGAAAGCAGACAGGAGCTTGCCG
AAGGACAGAGTCCTATTGGGAGGCATGATTTGGTCGATGAACGCTTAAATAAGAAGGAAGTCCATGACATTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAATTAAAAAATTTCAAATTTCAAATATATCCTATAACCATTTTTAAAAAATTATAAATAAATGAACATTTTTTTAAATGGGCGGTAAAATTTATCACTTACTTAAA
TTTCCCATAAAAGAAAGCCTGGGCCAAAGCCGTGGACCAAGAAGGCCCTGTTGGACGCTGGTGGTTATCCAGTTTAACGGTTACAGTTATAAAGTAATAGATCGGATAAA
ACTCGCGCGTAATTCATTGAATTCATCGCGGTTTTCCCTTCCACCGCGAAGTCCGTCTCTCTCCCCTTCTGTCCGAAGTCAAATCTGTCTCCAATTTCATTTTCTGTTCT
GAAGGAGGGACCTATGTCCCGTCTTCCATGTCTGACAAAACCCTAGATTCTTGAATCACACGCTCTGCAAGTCTACCCTCCATAACGATGTCGATTGCTTTCCAATACCT
TTCACTGAGCTCGCCTTCTCCTTCCCCTCCCCCTTCTACCTTTTACTTCTCCAGCTTCTTTTCCAGGAATCCGTGCTCGTCTCTTCGATTTGCCCCTAGACATTTCCCCA
ACGCCCTGCTGCATTTTCAATTTCTCGATCACAAACTTCGAAGCCCTTTTAATTTCTATTCGATTAATACCCATCAGTTCTGTCCTCGAGTTTCTACTTCTGGAGGAGTT
GGACGTAGAGACAGTAGTGATGGTGATTTTAATCTCGATTCCTTTCTTTCAGCTGCCGAGTTGTTTTGCCTCGTTTCGTCTTTGCTCGCTTCTGTTGGGGTCGCTCTGAA
TAGCGTGAAAGCCAGGTCTAAGAGTCTGTTCTTGGCGGTGTTTGGTGATGGGATTTTCGTTGGCGCATTCTTATTTTTGGTGGCTGGGGTTGCAATTGGTGCTTGGATTC
GCAGACGGCAGTGGAATCGGATTTATCGAGGGACAGCGAAGGCCGCATTGGAGATAGATTTGGTGGAAAGAACTAACAAGCTGGAGGAGGATTTGAAGAGCTCGGCAACG
CTAATTCGAGTCTTGTCGAGGCAGCTGGAGAAGCTAGGGATTAGGTTTAGAGTTACTCGAAAGGCTCTGAAGAAGCCCATCGAGGAGCCTCTAAGAATTGATCTATTTCT
GAAGACTGCAGTGTTAGCTCAAAAAACTTCTGAGGCTACTCGAGCATTGGCAGTTCGAGGAGATATTCTGGAGAAGGAGCTTGCTGAAATCCAGAAGGTCTTACTGGCTA
TGCAGGAACAGCAACAAAAGCAACTTGAGTTGATTCTAGCTATAGGGAAGTCGGGAAAGCTATGGGAAAGCAGACAGGAGCTTGCCGAAGGACAGAGTCCTATTGGGAGG
CATGATTTGGTCGATGAACGCTTAAATAAGAAGGAAGTCCATGACATTTGAGCTATTGAAGAAAAGAATGGAGATGGCTTTAGGATTCTTTCGATACAAATTAGGATGTT
AGTTGTTGCACAATGACAGAATAGGATGGTGGTGGGATGTAATTCTTGTGGGCAGTTTTAGACTCTGGCATCACTCCATACTTTTTGCCAGAAAAAATTTTGGGTTTCCA
TAATGTTCAATTTAATGGGTTGCATTCAGAGAGCAAGTGTGAAGCGGCAGGAGAAGTGTAATCAAAGAAATGTTTGTTGTGGTCATTGTGTTAATGGGAGAAACATTCTG
CTTAACCATTGTAATTTTGATTTCTCATTGAATTCTGGTGGTTCGCATCGACTTCCATCGTCTATAGCAGTTTGCATCTTAAATTCCCATCCCTCCCCCCGGGCATGTTC
AGTGGAGTCGGTTCCAGGGAGCTTTTACACGAGCAACGAGGACAGCTATTCTAATGGAGAAACATCAGGCTGGGAACCTGCAACTTGCTTTTTACATGTTATGGAAGCTA
CGAAATGCAACACTTACCCTATCAAAACCATACAATTTGTTGGCTACGGAGGGAAGCCGATGCTCAGACACAGGCATGTAGCAGCAGTTCGTCAGGTTCGGTTTTTTGCT
CTCTTTTCGCCCAGAGAATTTAAAAGCAGTAAAACACAATAAAAGAAAAAGAAGACGATTATTAATATCATAATATCACCGGAAGTAGAATAGGGCTTGTGTGGTTCTGT
TCTGTTCATAAAGATGCAGTCTTGTAATGGTCTGATCAATAGTCTGGAAAGTACGTTGTGATTTTTACCGACGACTTAACAACTCCATGACACTGGAACCAAGCAGCCTC
AGGTACACTTTTCACCATACTCGCGTTAGTATTTTCGTCGATTTATGGGAACCAGATACTGGTTAGGCTAAGAGAGTGCAGGTGCAGAGAAAGTTTCCCTTTTTCTGCTG
AGTGCTGACTGATTCAATGTTGAATACCAAAAGAACTCGAAAGGTAGTAATCATATTTATGTTGATTAGAGTTTAAGAAGTTAAAGCTAAAAGGTATTTCATTTGGGTCT
GTTTAAAAAATAATGGAAGTGATCAGGGAAAATTGAAAAAAAAAAAAATTTTTAAAATAGAGAACAAACAAGGAAAATATGAGGGGGAAAAAACCCATTTTACTCCAAAC
TTGGCATCAATTTTTACTTAATATGAGGGAAAAAATATCAATTTTACTCAAATTTTTATAACAAAACTTATCATTGATAAATGAAAAGGAACAAAATTATTCAAATGATA
CGAACTCTCAACGGGAGGAAAAACAAAATAAAAAATAAAGAAAAAGGGGAAATAGAAACCCAAGGAAATACAACACAAATAAACAAGAAATATAACCGACATAAAAAGGG
GGAGAAGCTTTAGATAAATCTATCCTTGAAGCAATTTTAATCAACACCGAAGAACTTGAAGAATGCCAAAGCTTTAAACTTTCTAGGAAATAGCGAAATGTTGCTGAAAA
AAATTCTTCAAATTCTCCAACCATAAAGATTGATGCTTCGAAAATGGAGACCCTTTGAACTTCTCAAATCAGAAAAATGATATAGGACAGTTAAGACTGAGGAGAAGCTC
AATGAAGACCCTTCAAAATTCTAACCCTAAACCCAAGGACCAAAAAAAAAAAAAAATTGTTGCTGAAAATCCTTCAAATTCTCCAACCACAAAGAATGAAGCTACAGAAA
AGGACCTTTCAAATTCTCAAAAGCATAAAGATTGATGCAAGAACAACTAAAACTCAGGAGAATCTCATTGAAACCAATCAACCAGCTGTAAAAAATCCATTTTACTTGTG
CAGGATAGAATTTCTGAATTCTTTAAACCCTTGTGATCATGCATGCAGATTTATCAATAAAGTTGTTCCGACAAATACCATATTGATTTCAGTTTGACTCCTCAAACAAT
TCTTTCTACTTACGGCCTACGGGGTCTATGGTTAATTGAGGCTCTCGTTCTTGGAGTTTGGTAAGATGCATGGTTGACTAAATGGCCTGTAAAAAAAATCAACAGATCTG
GAGAGGCAGTCGAGGAATGTGGCAGATTATTTGCAGCATTCTTTTCGCCTTTAACGAGCAAACACCACATATTCTTT
Protein sequenceShow/hide protein sequence
MSIAFQYLSLSSPSPSPPPSTFYFSSFFSRNPCSSLRFAPRHFPNALLHFQFLDHKLRSPFNFYSINTHQFCPRVSTSGGVGRRDSSDGDFNLDSFLSAAELFCLVSSLL
ASVGVALNSVKARSKSLFLAVFGDGIFVGAFLFLVAGVAIGAWIRRRQWNRIYRGTAKAALEIDLVERTNKLEEDLKSSATLIRVLSRQLEKLGIRFRVTRKALKKPIEE
PLRIDLFLKTAVLAQKTSEATRALAVRGDILEKELAEIQKVLLAMQEQQQKQLELILAIGKSGKLWESRQELAEGQSPIGRHDLVDERLNKKEVHDI