; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g20760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g20760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:14165094..14166756
RNA-Seq ExpressionMoc03g20760
SyntenyMoc03g20760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.9e-11377.24Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTS
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E SRPNS  AMVC F S
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD
         VKRKSKGRAHAL+  QSS P TPAV         GP++E P PVI+L+S+G  SREKR R ++EA+D
Subjt:  SVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]3.1e-9285.34Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSE
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSE
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]4.8e-9385.05Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAM
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+HF RGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSE  M
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.5e-15578.22Show/hide
Query:  LARRLESELNEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGLLRKGFKIPNDILLRIPEEGERADNPSEGWVTLYLKMFEYGLRLPLHPFAHEFLN
        LARRLES+L EIEN R SDDGEDSD STSGQGLEYPS++PEHYLG LR+GF IP +ILLR+PEEGERADNP EGWVTLY KMFEYGLRLPLHPF  EFL 
Subjt:  LARRLESELNEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGLLRKGFKIPNDILLRIPEEGERADNPSEGWVTLYLKMFEYGLRLPLHPFAHEFLN

Query:  RTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPF
        RTGL+ AQVAPNGWGVIFALAILFWLRARD +E EL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLA DESG  F
Subjt:  RTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPF

Query:  FDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSS
        FDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSE AMVCGF S VKRKSKGRAHAL+  QSS
Subjt:  FDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSS

Query:  DPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD
         P TPAV         GP++E P  VI+L+S+G  SREKR R ++EA+D
Subjt:  DPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]6.4e-12278.45Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLA DESG  FFDVP RFGNLVSIK IPEL QA+FDTLK YKDHF R RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALDVSPLREVREG
        LVR +EASRPNSE AMVCGFT SVKRKSKGRAHALKTV  ++PVTP V +  AQ  +GPS+ VPTPVI+LD +G RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKATSFLEVGPRGPLPSSHADLIDDSTARMGGTSDVKMRFRTEPSSSGVKDQVSRVSAACLDRCLRRASKFVSDLGSVLQRTIDHTVE
        SPL+RR+KKKK +S  E G RG LP+SHADL+DD  ARM GTS+V+MRF  EPSSSGVKDQVSR+SA CLDR LRRASKFVSD GSVLQRTID+  E
Subjt:  SPLKRRKKKKKATSFLEVGPRGPLPSSHADLIDDSTARMGGTSDVKMRFRTEPSSSGVKDQVSRVSAACLDRCLRRASKFVSDLGSVLQRTIDHTVE

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138269.0e-11477.24Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTS
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E SRPNS  AMVC F S
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTS

Query:  SVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD
         VKRKSKGRAHAL+  QSS P TPAV         GP++E P PVI+L+S+G  SREKR R ++EA+D
Subjt:  SVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD

A0A6J1DWD2 uncharacterized protein LOC1110246801.5e-9285.34Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSE
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSE
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSE

A0A6J1DWF1 uncharacterized protein LOC1110251082.3e-9385.05Show/hide
Query:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG
        MFEYGLRLPLHPF  EFL RTGL+ AQVAPNGWGVIFALAILFWLRARD +E ELL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVG

Query:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAM
        KWF+ASGEWLA DESG  FFDVP RFGNLVSI+P+PELTQASFDTLK+YK+HF RGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSE  M
Subjt:  KWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAM

A0A6J1DXS5 uncharacterized protein LOC1110255023.6e-15578.22Show/hide
Query:  LARRLESELNEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGLLRKGFKIPNDILLRIPEEGERADNPSEGWVTLYLKMFEYGLRLPLHPFAHEFLN
        LARRLES+L EIEN R SDDGEDSD STSGQGLEYPS++PEHYLG LR+GF IP +ILLR+PEEGERADNP EGWVTLY KMFEYGLRLPLHPF  EFL 
Subjt:  LARRLESELNEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGLLRKGFKIPNDILLRIPEEGERADNPSEGWVTLYLKMFEYGLRLPLHPFAHEFLN

Query:  RTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPF
        RTGL+ AQVAPNGWGVIFALAILFWLRARD +E EL  VDQLL CFEAKRIAKKPGR+YMCARKGAGGIVKGPTSIKGWV KWF+ASGEWLA DESG  F
Subjt:  RTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPF

Query:  FDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSS
        FDVP RFGNLVSI+P+PELTQASFDTLK+YK+ F RGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSE AMVCGF S VKRKSKGRAHAL+  QSS
Subjt:  FDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSS

Query:  DPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD
         P TPAV         GP++E P  VI+L+S+G  SREKR R ++EA+D
Subjt:  DPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256653.1e-12278.45Show/hide
Query:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNP
        MCARKG GGIVKGPTSIKGWVGKWFFASGEWLA DESG  FFDVP RFGNLVSIK IPEL QA+FDTLK YKDHF R RKI TLVTDKLLLESGLLDYNP
Subjt:  MCARKGAGGIVKGPTSIKGWVGKWFFASGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNP

Query:  LVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALDVSPLREVREG
        LVR +EASRPNSE AMVCGFT SVKRKSKGRAHALKTV  ++PVTP V +  AQ  +GPS+ VPTPVI+LD +G RS EKRSR ESEALDVSPL EVR  
Subjt:  LVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKTVQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALDVSPLREVREG

Query:  SPLKRRKKKKKATSFLEVGPRGPLPSSHADLIDDSTARMGGTSDVKMRFRTEPSSSGVKDQVSRVSAACLDRCLRRASKFVSDLGSVLQRTIDHTVE
        SPL+RR+KKKK +S  E G RG LP+SHADL+DD  ARM GTS+V+MRF  EPSSSGVKDQVSR+SA CLDR LRRASKFVSD GSVLQRTID+  E
Subjt:  SPLKRRKKKKKATSFLEVGPRGPLPSSHADLIDDSTARMGGTSDVKMRFRTEPSSSGVKDQVSRVSAACLDRCLRRASKFVSDLGSVLQRTIDHTVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGTGATACCATAGGTAGTGCGGGTCGGACCATAAGTAGTTCGCCACCCAAACCAAGCGATTCTGGGGAGGTCCTAGCTCG
TAGGTTAGAGTCTGAGCTAAATGAAATAGAGAACTTTAGGTTCTCAGATGATGGAGAGGATAGTGATACCTCCACTTCGGGCCAAGGTCTGGAGTACCCTTCAAAGATGC
CCGAGCACTATCTCGGACTCCTCCGTAAGGGGTTTAAAATTCCGAACGATATCCTTCTTAGGATTCCAGAGGAAGGGGAAAGAGCTGACAATCCCTCAGAGGGATGGGTC
ACTCTTTATTTAAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCCCATGAGTTTTTAAACCGAACTGGACTGTCTCTTGCTCAAGTGGCCCCCAATGG
GTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGTCGAGCTGCTAAGCGTTGACCAACTTCTTGGGTGTTTTGAGGCCAAGA
GGATAGCTAAAAAACCAGGTCGGTACTATATGTGCGCGAGGAAGGGCGCGGGTGGCATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGTAAATGGTTCTTTGCC
TCAGGTGAATGGCTAGCAAATGACGAGTCAGGTCATCCCTTCTTTGATGTGCCTGTTAGGTTTGGGAACCTAGTGTCGATCAAACCAATCCCTGAGCTTACACAAGCCTC
TTTCGACACCCTCAAATTTTACAAAGATCACTTCCTTAGAGGTCGAAAGATCGGAACTTTGGTGACCGATAAGCTGCTTCTGGAATCTGGGTTGTTAGACTACAACCCCT
TAGTGCGCCCGGTTGAAGCTTCAAGGCCAAATTCCGAGCACGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGTAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACA
GTTCAAAGCTCTGATCCAGTGACTCCCGCTGTGGACCAACCTGCAGCTCAGGACCAGGCTGGGCCATCAACTGAAGTTCCAACTCCGGTGATCGACTTGGATTCTACTGG
GGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCTGAAGCGTTGGACGTGTCACCTCTCCGCGAAGTGAGAGAGGGTTCTCCTTTGAAGAGAAGGAAGAAAAAGAAGA
AAGCCACCTCCTTCTTGGAGGTTGGACCTCGTGGCCCCCTGCCCTCAAGCCACGCCGACCTGATAGATGACTCGACAGCTCGGATGGGGGGGACATCCGACGTGAAGATG
CGGTTTAGAACGGAACCGTCAAGCTCCGGGGTGAAAGACCAGGTGTCACGCGTCTCGGCTGCCTGCTTGGATCGCTGTCTCAGAAGAGCATCCAAGTTTGTAAGCGATCT
TGGGTCCGTACTGCAACGGACAATCGATCATACTGTCGAGGTAAAATATACTTCTGAACTTTGTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTTTTCTTGTCTTCCCCCTCCAGTAGTGATACCATAGGTAGTGCGGGTCGGACCATAAGTAGTTCGCCACCCAAACCAAGCGATTCTGGGGAGGTCCTAGCTCG
TAGGTTAGAGTCTGAGCTAAATGAAATAGAGAACTTTAGGTTCTCAGATGATGGAGAGGATAGTGATACCTCCACTTCGGGCCAAGGTCTGGAGTACCCTTCAAAGATGC
CCGAGCACTATCTCGGACTCCTCCGTAAGGGGTTTAAAATTCCGAACGATATCCTTCTTAGGATTCCAGAGGAAGGGGAAAGAGCTGACAATCCCTCAGAGGGATGGGTC
ACTCTTTATTTAAAGATGTTTGAGTACGGCCTCAGACTTCCCCTTCATCCCTTTGCCCATGAGTTTTTAAACCGAACTGGACTGTCTCTTGCTCAAGTGGCCCCCAATGG
GTGGGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGACGAGGATGAGGTCGAGCTGCTAAGCGTTGACCAACTTCTTGGGTGTTTTGAGGCCAAGA
GGATAGCTAAAAAACCAGGTCGGTACTATATGTGCGCGAGGAAGGGCGCGGGTGGCATAGTCAAGGGGCCGACCTCCATCAAGGGATGGGTAGGTAAATGGTTCTTTGCC
TCAGGTGAATGGCTAGCAAATGACGAGTCAGGTCATCCCTTCTTTGATGTGCCTGTTAGGTTTGGGAACCTAGTGTCGATCAAACCAATCCCTGAGCTTACACAAGCCTC
TTTCGACACCCTCAAATTTTACAAAGATCACTTCCTTAGAGGTCGAAAGATCGGAACTTTGGTGACCGATAAGCTGCTTCTGGAATCTGGGTTGTTAGACTACAACCCCT
TAGTGCGCCCGGTTGAAGCTTCAAGGCCAAATTCCGAGCACGCCATGGTGTGTGGATTCACGAGCAGCGTGAAACGTAAGTCTAAGGGTCGTGCTCACGCCCTTAAGACA
GTTCAAAGCTCTGATCCAGTGACTCCCGCTGTGGACCAACCTGCAGCTCAGGACCAGGCTGGGCCATCAACTGAAGTTCCAACTCCGGTGATCGACTTGGATTCTACTGG
GGAGCGCTCCAGGGAGAAGCGCTCGAGGAGCGAGTCTGAAGCGTTGGACGTGTCACCTCTCCGCGAAGTGAGAGAGGGTTCTCCTTTGAAGAGAAGGAAGAAAAAGAAGA
AAGCCACCTCCTTCTTGGAGGTTGGACCTCGTGGCCCCCTGCCCTCAAGCCACGCCGACCTGATAGATGACTCGACAGCTCGGATGGGGGGGACATCCGACGTGAAGATG
CGGTTTAGAACGGAACCGTCAAGCTCCGGGGTGAAAGACCAGGTGTCACGCGTCTCGGCTGCCTGCTTGGATCGCTGTCTCAGAAGAGCATCCAAGTTTGTAAGCGATCT
TGGGTCCGTACTGCAACGGACAATCGATCATACTGTCGAGGTAAAATATACTTCTGAACTTTGTTTTTAA
Protein sequenceShow/hide protein sequence
MVVFLSSPSSSDTIGSAGRTISSSPPKPSDSGEVLARRLESELNEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGLLRKGFKIPNDILLRIPEEGERADNPSEGWV
TLYLKMFEYGLRLPLHPFAHEFLNRTGLSLAQVAPNGWGVIFALAILFWLRARDEDEVELLSVDQLLGCFEAKRIAKKPGRYYMCARKGAGGIVKGPTSIKGWVGKWFFA
SGEWLANDESGHPFFDVPVRFGNLVSIKPIPELTQASFDTLKFYKDHFLRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSEHAMVCGFTSSVKRKSKGRAHALKT
VQSSDPVTPAVDQPAAQDQAGPSTEVPTPVIDLDSTGERSREKRSRSESEALDVSPLREVREGSPLKRRKKKKKATSFLEVGPRGPLPSSHADLIDDSTARMGGTSDVKM
RFRTEPSSSGVKDQVSRVSAACLDRCLRRASKFVSDLGSVLQRTIDHTVEVKYTSELCF