; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21570 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21570
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:15706284..15708282
RNA-Seq ExpressionMoc04g21570
SyntenyMoc04g21570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138041.1 uncharacterized protein LOC111009298 [Momordica charantia]5.9e-7264.38Show/hide
Query:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK
        EWLAKDE               V+I+P+PEL QA+FDTLK+YK+HFPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSELAMVCGF+SNVKRKSK
Subjt:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK

Query:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPL-NEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASH
        G+AHAL+A ++++P  P           GP+SE   PV+ELES+   SREK PRD++EA+DVSPL  EVR E PLK+R+KKKKTTS  EVG RG LPAS 
Subjt:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPL-NEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASH

Query:  VELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQ
         + VDD EAR+GGT DV  +FR+EPSSS V+DQ
Subjt:  VELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQ

XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.5e-9164.41Show/hide
Query:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------
        MFEYGLRLPLHPF Q+FL RT LAP QVAPNGW VIFALAILFWLRARD +EAELL+VDQLL CFEAKRIAKKPGR                        
Subjt:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------

Query:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSS
               EWLAKDE GR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP VRP+E SRPNS LAMVC F+S
Subjt:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSS

Query:  NVKRKSKGRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEAL-------DVSPLNE
         VKRKSKGRAHAL+A ++++P  P           GP+SE   PV+ELES+G  SREK PRD++EA+       DV PL E
Subjt:  NVKRKSKGRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEAL-------DVSPLNE

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.6e-7271.65Show/hide
Query:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------
        MFEYGLRLPLHPF Q+FL RT LAP QVAPNGW VIFALAILFWLRARD +EAELL+VDQLL CFEAKRIAKKPGR                        
Subjt:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------

Query:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAM
               EWLAKDE GR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+HFPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSEL M
Subjt:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.7e-13269.14Show/hide
Query:  DLAFKLESELQEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEGERADNPPGGWVTLHLKMFEYGLRLPLHPFAQKFL
        DLA +LES+L+EIEN R SDDGEDSD STSGQGLEYPS++PEHYLG LRRGF IP +ILLR+PEEGERADNPP GWVTL+ KMFEYGLRLPLHPF Q+FL
Subjt:  DLAFKLESELQEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEGERADNPPGGWVTLHLKMFEYGLRLPLHPFAQKFL

Query:  NRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR-------------------------------EWLAKDEFGRP
         RT LAP QVAPNGW VIFALAILFWLRARD +EAEL +VDQLL CFEAKRIAKKPGR                               EWLAKDE GR 
Subjt:  NRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR-------------------------------EWLAKDEFGRP

Query:  FFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSKGRAHALKAVRN
        FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSELAMVCGF+S VKRKSKGRAHAL+A ++
Subjt:  FFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSKGRAHALKAVRN

Query:  TEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALD
        ++P  P           GP+SE    V+ELES+G  SREK PRD++EA+D
Subjt:  TEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]3.5e-10176.56Show/hide
Query:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK
        EWLAKDE GR FFDVP RFGNLVSIK IPELAQATFDTLKHYKDHFPR RKI TLVTDKLLLESGLLDYNPLVR +EASRPNSELAMVCGF+ +VKRKSK
Subjt:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK

Query:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPLNEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASHV
        GRAHALK V  TEP  P   +  AQ  +GPSS V TPV+EL+ +G  S EK  R+ESEALDVSPLNEVRGESPL++R+KKKKT+SSSE G RG LP SH 
Subjt:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPLNEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASHV

Query:  ELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQVSRISAACLDLCLRRASKFISDPG
        +LVDD EAR+ GTS+V+M+F +EPSSS VKDQVSRISA CLD  LRRASKF+SDPG
Subjt:  ELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQVSRISAACLDLCLRRASKFISDPG

TrEMBL top hitse value%identityAlignment
A0A6J1C8K9 uncharacterized protein LOC1110092982.9e-7264.38Show/hide
Query:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK
        EWLAKDE               V+I+P+PEL QA+FDTLK+YK+HFPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSELAMVCGF+SNVKRKSK
Subjt:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK

Query:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPL-NEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASH
        G+AHAL+A ++++P  P           GP+SE   PV+ELES+   SREK PRD++EA+DVSPL  EVR E PLK+R+KKKKTTS  EVG RG LPAS 
Subjt:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPL-NEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASH

Query:  VELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQ
         + VDD EAR+GGT DV  +FR+EPSSS V+DQ
Subjt:  VELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQ

A0A6J1CR42 uncharacterized protein LOC1110138267.2e-9264.41Show/hide
Query:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------
        MFEYGLRLPLHPF Q+FL RT LAP QVAPNGW VIFALAILFWLRARD +EAELL+VDQLL CFEAKRIAKKPGR                        
Subjt:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------

Query:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSS
               EWLAKDE GR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP VRP+E SRPNS LAMVC F+S
Subjt:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSS

Query:  NVKRKSKGRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEAL-------DVSPLNE
         VKRKSKGRAHAL+A ++++P  P           GP+SE   PV+ELES+G  SREK PRD++EA+       DV PL E
Subjt:  NVKRKSKGRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEAL-------DVSPLNE

A0A6J1DWF1 uncharacterized protein LOC1110251087.5e-7371.65Show/hide
Query:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------
        MFEYGLRLPLHPF Q+FL RT LAP QVAPNGW VIFALAILFWLRARD +EAELL+VDQLL CFEAKRIAKKPGR                        
Subjt:  MFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR------------------------

Query:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAM
               EWLAKDE GR FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+HFPRGRK+GTLVTDKLLLESGLLDYNP VRP+E+SRPNSEL M
Subjt:  -------EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255028.5e-13369.14Show/hide
Query:  DLAFKLESELQEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEGERADNPPGGWVTLHLKMFEYGLRLPLHPFAQKFL
        DLA +LES+L+EIEN R SDDGEDSD STSGQGLEYPS++PEHYLG LRRGF IP +ILLR+PEEGERADNPP GWVTL+ KMFEYGLRLPLHPF Q+FL
Subjt:  DLAFKLESELQEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEGERADNPPGGWVTLHLKMFEYGLRLPLHPFAQKFL

Query:  NRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR-------------------------------EWLAKDEFGRP
         RT LAP QVAPNGW VIFALAILFWLRARD +EAEL +VDQLL CFEAKRIAKKPGR                               EWLAKDE GR 
Subjt:  NRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPGR-------------------------------EWLAKDEFGRP

Query:  FFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSKGRAHALKAVRN
        FFDVP RFGNLVSI+P+PEL QA+FDTLK+YK+ FPRGRK+GTLVTD+LLLESGLLDYNP VRP+E+SRPNSELAMVCGF+S VKRKSKGRAHAL+A ++
Subjt:  FFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSKGRAHALKAVRN

Query:  TEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALD
        ++P  P           GP+SE    V+ELES+G  SREK PRD++EA+D
Subjt:  TEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALD

A0A6J1DZB3 uncharacterized protein LOC1110256651.7e-10176.56Show/hide
Query:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK
        EWLAKDE GR FFDVP RFGNLVSIK IPELAQATFDTLKHYKDHFPR RKI TLVTDKLLLESGLLDYNPLVR +EASRPNSELAMVCGF+ +VKRKSK
Subjt:  EWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSK

Query:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPLNEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASHV
        GRAHALK V  TEP  P   +  AQ  +GPSS V TPV+EL+ +G  S EK  R+ESEALDVSPLNEVRGESPL++R+KKKKT+SSSE G RG LP SH 
Subjt:  GRAHALKAVRNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPLNEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASHV

Query:  ELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQVSRISAACLDLCLRRASKFISDPG
        +LVDD EAR+ GTS+V+M+F +EPSSS VKDQVSRISA CLD  LRRASKF+SDPG
Subjt:  ELVDDLEARIGGTSDVKMQFRIEPSSSRVKDQVSRISAACLDLCLRRASKFISDPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTTGTGGAAGACGTTACGATCTGCCAGGCTGTCGGAGTACTTAAGTATTCCGTCGTTACGGATCTCGAGATGATCCTAGCCGCTTGTTCATTACACGTGTACGG
TGCAGAGCTCGAACCTTCCATAGGTAGCGCAGGTCGGACTATAAGCAGTTCGCCCCCTAAACTAAATGATTCTGGGGAGGACTTAGCTTTTAAGTTAGAGTCCGAGCTTC
AAGAGATAGAGAACTTCAGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTCGGGCCAGGGTTTAGAATACCCTTCTAAAATGCCCGAGCACTATCTCGGACCC
CTCCGTAGGGGGTTTAAAATTCCAAACGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGGGGGATGGGTCACTCTTCACTTAAAAATGTT
TGAGTACGGCCTCAGACTTCCTCTTCACCCCTTTGCCCAGAAGTTCCTAAATCGAACTGATCTGGCTCCTACTCAAGTGGCCCCCAATGGGTGGGCTGTCATTTTTGCCC
TGGCCATTCTTTTTTGGTTGCGAGCTCGAGATGAGGACGAAGCCGAGCTACTAAATGTTGACCAGCTTCTTGGGTGCTTTGAGGCCAAGAGGATAGCCAAGAAGCCAGGT
CGGGAATGGCTGGCAAAGGACGAATTCGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGGAACCTAGTATCAATCAAACCGATTCCTGAGCTCGCTCAAGCCACCTT
CGACACCCTCAAGCACTATAAAGATCACTTCCCAAGGGGCCGGAAAATCGGGACCTTGGTAACTGACAAACTGCTCCTTGAATCAGGGTTGTTAGACTACAACCCCTTGG
TGCGTCCGGTTGAAGCTTCAAGGCCAAACTCCGAACTCGCAATGGTGTGTGGATTCTCCAGCAACGTGAAACGTAAATCCAAGGGCCGTGCACACGCCCTAAAGGCTGTT
CGGAATACGGAGCCAACGATCCCCGTCGGGGCTCAGCCTGTGGCTCAAGACACTGCTGGGCCATCTTCCGAAGTCTCAACTCCGGTGGTCGAGTTGGAATCTGCTGGGGA
GCACTCCAGAGAGAAGCACCCAAGGGATGAGTCGGAGGCGCTGGACGTGTCTCCTCTGAACGAGGTGAGGGGAGAGTCCCCTTTGAAGAAGAGAAAGAAGAAGAAGAAGA
CTACCTCCTCCTCGGAGGTTGGAACTCGTGGGCCCCTACCCGCGAGTCACGTCGAATTGGTGGACGACCTCGAAGCTCGGATTGGGGGGACGTCTGATGTAAAGATGCAG
TTCAGAATTGAACCATCAAGCTCCAGGGTGAAGGACCAGGTGTCCCGCATCTCGGCTGCATGCTTGGACCTCTGTCTGAGGAGAGCGTCCAAGTTTATAAGCGATCCAGG
GCGTTCACCGCTTCCATCCATTCAGCAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGTTGTGGAAGACGTTACGATCTGCCAGGCTGTCGGAGTACTTAAGTATTCCGTCGTTACGGATCTCGAGATGATCCTAGCCGCTTGTTCATTACACGTGTACGG
TGCAGAGCTCGAACCTTCCATAGGTAGCGCAGGTCGGACTATAAGCAGTTCGCCCCCTAAACTAAATGATTCTGGGGAGGACTTAGCTTTTAAGTTAGAGTCCGAGCTTC
AAGAGATAGAGAACTTCAGGTTTTCTGATGATGGGGAGGATAGTGACACTTCCACCTCGGGCCAGGGTTTAGAATACCCTTCTAAAATGCCCGAGCACTATCTCGGACCC
CTCCGTAGGGGGTTTAAAATTCCAAACGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGGGGGATGGGTCACTCTTCACTTAAAAATGTT
TGAGTACGGCCTCAGACTTCCTCTTCACCCCTTTGCCCAGAAGTTCCTAAATCGAACTGATCTGGCTCCTACTCAAGTGGCCCCCAATGGGTGGGCTGTCATTTTTGCCC
TGGCCATTCTTTTTTGGTTGCGAGCTCGAGATGAGGACGAAGCCGAGCTACTAAATGTTGACCAGCTTCTTGGGTGCTTTGAGGCCAAGAGGATAGCCAAGAAGCCAGGT
CGGGAATGGCTGGCAAAGGACGAATTCGGTCGTCCCTTCTTTGACGTGCCTGTTAGGTTTGGGAACCTAGTATCAATCAAACCGATTCCTGAGCTCGCTCAAGCCACCTT
CGACACCCTCAAGCACTATAAAGATCACTTCCCAAGGGGCCGGAAAATCGGGACCTTGGTAACTGACAAACTGCTCCTTGAATCAGGGTTGTTAGACTACAACCCCTTGG
TGCGTCCGGTTGAAGCTTCAAGGCCAAACTCCGAACTCGCAATGGTGTGTGGATTCTCCAGCAACGTGAAACGTAAATCCAAGGGCCGTGCACACGCCCTAAAGGCTGTT
CGGAATACGGAGCCAACGATCCCCGTCGGGGCTCAGCCTGTGGCTCAAGACACTGCTGGGCCATCTTCCGAAGTCTCAACTCCGGTGGTCGAGTTGGAATCTGCTGGGGA
GCACTCCAGAGAGAAGCACCCAAGGGATGAGTCGGAGGCGCTGGACGTGTCTCCTCTGAACGAGGTGAGGGGAGAGTCCCCTTTGAAGAAGAGAAAGAAGAAGAAGAAGA
CTACCTCCTCCTCGGAGGTTGGAACTCGTGGGCCCCTACCCGCGAGTCACGTCGAATTGGTGGACGACCTCGAAGCTCGGATTGGGGGGACGTCTGATGTAAAGATGCAG
TTCAGAATTGAACCATCAAGCTCCAGGGTGAAGGACCAGGTGTCCCGCATCTCGGCTGCATGCTTGGACCTCTGTCTGAGGAGAGCGTCCAAGTTTATAAGCGATCCAGG
GCGTTCACCGCTTCCATCCATTCAGCAATTATGA
Protein sequenceShow/hide protein sequence
MTVVEDVTICQAVGVLKYSVVTDLEMILAACSLHVYGAELEPSIGSAGRTISSSPPKLNDSGEDLAFKLESELQEIENFRFSDDGEDSDTSTSGQGLEYPSKMPEHYLGP
LRRGFKIPNDILLRIPEEGERADNPPGGWVTLHLKMFEYGLRLPLHPFAQKFLNRTDLAPTQVAPNGWAVIFALAILFWLRARDEDEAELLNVDQLLGCFEAKRIAKKPG
REWLAKDEFGRPFFDVPVRFGNLVSIKPIPELAQATFDTLKHYKDHFPRGRKIGTLVTDKLLLESGLLDYNPLVRPVEASRPNSELAMVCGFSSNVKRKSKGRAHALKAV
RNTEPTIPVGAQPVAQDTAGPSSEVSTPVVELESAGEHSREKHPRDESEALDVSPLNEVRGESPLKKRKKKKKTTSSSEVGTRGPLPASHVELVDDLEARIGGTSDVKMQ
FRIEPSSSRVKDQVSRISAACLDLCLRRASKFISDPGRSPLPSIQQL