; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1058 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1058
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionARM repeat superfamily protein
Genome locationMC09:16460464..16464424
RNA-Seq ExpressionMC09g1058
SyntenyMC09g1058
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146451.1 uncharacterized protein LOC101212969 isoform X1 [Cucumis sativus]5.16e-11287.89Show/hide
Query:  MSMLASKLNTY--LCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ASKL T+  LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSID
Subjt:  MSMLASKLNTY--LCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        D++RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  DKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]7.13e-11488.83Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_022146798.1 uncharacterized protein LOC111015917 [Momordica charantia]7.85e-127100Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_022150134.1 uncharacterized protein LOC111018388 [Momordica charantia]2.23e-11792.55Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MS+L  KLN YLCRREPVRTLQFRTFSAY ESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGD+LMQQSVALLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKS P+SVED K+NE+KSNL+KRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]2.49e-11489.36Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATF+GYQIFPYMGDNL+QQSV+LLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DL YDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein3.45e-11488.83Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A1S3C497 uncharacterized protein LOC103496719 isoform X23.86e-11187.77Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL  +LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV LL+VKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVE GGAQELLNML  AKDDRTRKEALKAL+AISHSDEAV  LHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1CZI6 uncharacterized protein LOC1110159173.80e-127100Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1D932 uncharacterized protein LOC1110183881.08e-11792.55Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MS+L  KLN YLCRREPVRTLQFRTFSAY ESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGD+LMQQSVALLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKS P+SVED K+NE+KSNL+KRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X23.86e-11186.7Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ SKL  +LCRREP RTLQFR FSAY+E EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNL+QQSV+LLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDS EDT++NE+KSNL+KRF DL YDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein4.6e-5555.96Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S   EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGCTCGCATCGAAGTTGAACACTTATCTCTGCAGAAGAGAGCCTGTGCGGACCCTGCAATTTCGCACTTTTTCAGCTTACAACGAAAGCGAGATCGAGAAGGA
GGCTGAAAGAAAAGTAGGATGGTTATTGAAACTAATCTTTGCTGGGACTGCCACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGATGCAGCAATCTG
TGGCACTCTTGCAAGTCAAGGATCCATTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGTTTTTCGATAGACGATAAAAGAAGAATGAAAATAGTGGAGATGGGTGGA
GCTCAAGAGCTCTTAAACATGTTGGGGGCTGCCAAAGACGACCGCACGCGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCTCATTCAGATGAAGCTGTCAGTGCTCT
GCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACTCCGGATTCGGTTGAAGATACGAAACTGAACGAGTTCAAGTCCAACCTAATAAAGAGATTTGAAGATCTTAGAT
ACGATGTCTCGTCTTGA
mRNA sequenceShow/hide mRNA sequence
GCTCAATAATCCATTTAGAAGTATTTCTATACCAAAAATTTCACGAACCAAATAAAGTATGTTTGTCAAAGCGAGCATAACTCTAATTGACATATACATCGTTCAAATTC
AGTCTCAACATGTTGTATTGAAAATAAAATAATAATAAAATAAAGGGTATGTTTTTTTTTTCAAAAAAAAAGAAAAAAAAAAGAAAAAAAACTATATTTTAAACTTTCTT
TAAGCTATTTCCATCCAAATCTCTCAACCTAAGCAATTTCATACATTGCCCATCGACACAGCGCCGGAGCCGGCGGCCGAGCTTCCGTTGTGCACCATGAGCATGCTCGC
ATCGAAGTTGAACACTTATCTCTGCAGAAGAGAGCCTGTGCGGACCCTGCAATTTCGCACTTTTTCAGCTTACAACGAAAGCGAGATCGAGAAGGAGGCTGAAAGAAAAG
TAGGATGGTTATTGAAACTAATCTTTGCTGGGACTGCCACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGATGCAGCAATCTGTGGCACTCTTGCAA
GTCAAGGATCCATTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGTTTTTCGATAGACGATAAAAGAAGAATGAAAATAGTGGAGATGGGTGGAGCTCAAGAGCTCTT
AAACATGTTGGGGGCTGCCAAAGACGACCGCACGCGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCTCATTCAGATGAAGCTGTCAGTGCTCTGCATAAAGCAGGGG
CAATCTTGGTTATTAAATCTACTCCGGATTCGGTTGAAGATACGAAACTGAACGAGTTCAAGTCCAACCTAATAAAGAGATTTGAAGATCTTAGATACGATGTCTCGTCT
TGACATGTAAGAGTTTTATTTATTTCATTTGTAGTTGTTTAGTGTGGGGGTATCAAGATGAAGCCATGAAGCCTCTAAAGATATCACAAGGTTATGAACTGCATTCACTT
CAAATGAGTGGTGGAAATTCAAGACGGGCGTTGTACATTTTTTTACTGTCTGTTTTACATTCTTGCAAAGCGGGTTTTCTCGGGTTGAGATACTTTATAAGTAGTTATCT
CGTCGTTGCGGGAACCTCTCCTGTTTAACAAAGCGGGTTTTCTTGGAAAGAATTGAGATGCTGAAACTAGCTTTGTTATGCTTTTAGGAATAAAATTGATGCATCACCCT
TTTGCTCATGTTTTCATATTTGATTTTCAGATTAAGGACCCCTCTTTTGGCATTTTCAGATGTAATTTTCAGACCCAAAGTTTAATTGACCTTACACAATATTTGCAATC
AATTTCTCACAAACAAATACAGAACTTAGAAAACCCACTCACAATTTAACATTACATCTTTCAAATATACAAAACCTGAACTTTTATTCCCTTTTTTTTTGGTTCAAATC
GACCTACCTTAACCAAGAGCTGGGCTCAGATTAACATACAAAACTGAACTTAAAATGGCAAAAAAATAGTGTTTACAAACTATACAACCAATCATCAACTCCAGAACACA
AAGAATTGGAACTTTCAAAACAATTAACGATATCTAAGACTTACTTTTCAATGCATTGTTTATGTTCGTAGAAATGGCAATGGTGCTGGTCGCTACTGCCAGAATTACCA
TTATTGAAGCAACCAACTTGTCCTTCTTTGTTGATATTCCATTCACATCCCTA
Protein sequenceShow/hide protein sequence
MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDKRRMKIVEMGG
AQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS