; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10340 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10340
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionINVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: my s in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).
Genome locationchr7:7922724..7924707
RNA-Seq ExpressionMoc07g10340
SyntenyMoc07g10340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]2.3e-7872.08Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        F+ASIHS +M+KAELDG EAL AKE+ENSSAALEAATT+KGELLKA+ EV ILR EV+AKAELLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------
        Q LE KD +I RLTAELK  KERL NG+LLE +FRQH DFDGFAKDFSDAGFKFLMKGIAA+MPHLQIDLS LKKK                        
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQDEFKEAN
        ELDSDYSD+EEEDA SQE  E+GTTQEEVPSQQD  +E N
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQDEFKEAN

XP_022158409.1 uncharacterized protein LOC111024898 [Momordica charantia]1.8e-6768.4Show/hide
Query:  MIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDMLQALEAKDAA
        MIKAELDG EAL AKEKENS AALEAATTMK ELLKARSEV IL+ +V+ KAE+LK+E ++HKAHL AAHAITK +EK+KFQLLKEKDD+ QALE  DA 
Subjt:  MIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDMLQALEAKDAA

Query:  IERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------ELDSDYSDL
        I RL+ ELK  KERL NG LLE AF+QHPDFDGFAKDFSDAGFKFLMKGIA +M HLQIDLS++KKK                        ELDSDYSD+
Subjt:  IERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------ELDSDYSDL

Query:  EEEDALSQEQVEVGTTQEEVPSQQDEFKEAN
        EE DA SQE  EVGTTQEEVPSQ    +E N
Subjt:  EEEDALSQEQVEVGTTQEEVPSQQDEFKEAN

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.5e-6959.04Show/hide
Query:  SDSGEDLAHRLESELEEIENFRFSDVGEDSDASVSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLA RLES+LEEIEN R SD GEDSDAS SGQGLEYPS++PEHYLG LRRGF IP +ILLR+PEE ERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSGEDLAHRLESELEEIENFRFSDVGEDSDASVSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVL-----------------------------------------YVRK------------
         QEFL RTGLAPAQVAPNGWGVIF LAI+FWLRARD E   +                                          +VRK            
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVL-----------------------------------------YVRK------------

Query:  --------ERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL
                   RF NLVSIR +PELTQASFDT+K+YK+RFP+GRK+GTL
Subjt:  --------ERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.3e-7672.96Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        FIASIH  VM+KAELDG EAL AKE+ENS AALEAATT+KGELLKA+ EVDILR EV+AK +LLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------
        Q LE KDA+I RLT ELK  KERL NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAA+MPHLQIDL+ LKKK                        
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQ
        ELDSDYSD+EEEDA SQE  EVGTTQEEVPSQQ
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQ

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-0547.5Show/hide
Query:  GLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVLYVRKERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL
        G+     +  GW   +  A   WL A+DE   +   V     RF NLVSI+LIPEL QA+FDT+K YKD FP+ RKI TL
Subjt:  GLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVLYVRKERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.4e-7571.55Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        FIASIHS VM+KAELDG EALTAKE+EN S  LEAATT+KGELLKA+ EVDILR EV+AK +LLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKK------------------------K
        Q LE KDA+I RLT ELK  KERL +GALLE +FRQHP+FDGFAKDFSDAGFKFLMKGIAA+MPHLQIDLS+LKK                        +
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKK------------------------K

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQ
        ELDSDYSD+EEEDA SQE  +VGTTQEE PSQ
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQ

TrEMBL top hitse value%identityAlignment
A0A6J1DF31 uncharacterized protein LOC1110199091.1e-7872.08Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        F+ASIHS +M+KAELDG EAL AKE+ENSSAALEAATT+KGELLKA+ EV ILR EV+AKAELLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------
        Q LE KD +I RLTAELK  KERL NG+LLE +FRQH DFDGFAKDFSDAGFKFLMKGIAA+MPHLQIDLS LKKK                        
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQDEFKEAN
        ELDSDYSD+EEEDA SQE  E+GTTQEEVPSQQD  +E N
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQDEFKEAN

A0A6J1DXS5 uncharacterized protein LOC1110255023.6e-6959.04Show/hide
Query:  SDSGEDLAHRLESELEEIENFRFSDVGEDSDASVSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMFEYGLRLPLHPF
        S+   DLA RLES+LEEIEN R SD GEDSDAS SGQGLEYPS++PEHYLG LRRGF IP +ILLR+PEE ERADNPPEGWVTLY KMFEYGLRLPLHPF
Subjt:  SDSGEDLAHRLESELEEIENFRFSDVGEDSDASVSGQGLEYPSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVL-----------------------------------------YVRK------------
         QEFL RTGLAPAQVAPNGWGVIF LAI+FWLRARD E   +                                          +VRK            
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVL-----------------------------------------YVRK------------

Query:  --------ERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL
                   RF NLVSIR +PELTQASFDT+K+YK+RFP+GRK+GTL
Subjt:  --------ERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL

A0A6J1DZB3 uncharacterized protein LOC1110256656.2e-7772.96Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        FIASIH  VM+KAELDG EAL AKE+ENS AALEAATT+KGELLKA+ EVDILR EV+AK +LLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------
        Q LE KDA+I RLT ELK  KERL NG LLE +FRQHPDFDGFAKDFSDAGFKFLMKGIAA+MPHLQIDL+ LKKK                        
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQ
        ELDSDYSD+EEEDA SQE  EVGTTQEEVPSQQ
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQQ

A0A6J1DZB3 uncharacterized protein LOC1110256655.7e-0647.5Show/hide
Query:  GLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVLYVRKERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL
        G+     +  GW   +  A   WL A+DE   +   V     RF NLVSI+LIPEL QA+FDT+K YKD FP+ RKI TL
Subjt:  GLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVLYVRKERRRFRNLVSIRLIPELTQASFDTVKFYKDRFPKGRKIGTL

A0A6J1DZB3 uncharacterized protein LOC1110256656.8e-7671.55Show/hide
Query:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML
        FIASIHS VM+KAELDG EALTAKE+EN S  LEAATT+KGELLKA+ EVDILR EV+AK +LLK+E ++HKAHLRAAHAITKGLEK+KFQLLKEKDD+ 
Subjt:  FIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDML

Query:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKK------------------------K
        Q LE KDA+I RLT ELK  KERL +GALLE +FRQHP+FDGFAKDFSDAGFKFLMKGIAA+MPHLQIDLS+LKK                        +
Subjt:  QALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKK------------------------K

Query:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQ
        ELDSDYSD+EEEDA SQE  +VGTTQEE PSQ
Subjt:  ELDSDYSDLEEEDALSQEQVEVGTTQEEVPSQ

A0A6J1DZB5 uncharacterized protein LOC1110248988.9e-6868.4Show/hide
Query:  MIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDMLQALEAKDAA
        MIKAELDG EAL AKEKENS AALEAATTMK ELLKARSEV IL+ +V+ KAE+LK+E ++HKAHL AAHAITK +EK+KFQLLKEKDD+ QALE  DA 
Subjt:  MIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKKFQLLKEKDDMLQALEAKDAA

Query:  IERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------ELDSDYSDL
        I RL+ ELK  KERL NG LLE AF+QHPDFDGFAKDFSDAGFKFLMKGIA +M HLQIDLS++KKK                        ELDSDYSD+
Subjt:  IERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKK------------------------ELDSDYSDL

Query:  EEEDALSQEQVEVGTTQEEVPSQQDEFKEAN
        EE DA SQE  EVGTTQEEVPSQ    +E N
Subjt:  EEEDALSQEQVEVGTTQEEVPSQQDEFKEAN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related1.4e-0425.66Show/hide
Query:  RLESELEEIENFRFSDVGEDSDASVSGQGLEY------PSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQ
        R+ ++ +   N    D  E +D +VSG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F  
Subjt:  RLESELEEIENFRFSDVGEDSDASVSGQGLEY------PSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQ

Query:  EFLNRTGLAPAQV
         F     +A +Q+
Subjt:  EFLNRTGLAPAQV

AT5G38190.1 INVOLVED IN: biological_process unknown2.4e-0427.55Show/hide
Query:  DVGEDSDASVSGQGLEY------PSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQV
        D  E +D +VSG+  +       P+      +G       +P  + +RIP + +R  + PEG++ L+   F E GLR P+  F   F     +A +Q+
Subjt:  DVGEDSDASVSGQGLEY------PSKMPEHYLGPLRRGFKIPNDILLRIPEEEERADNPPEGWVTLYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGTTGCGGAAGATGTTTCGATTTGCCAGGTTGTCGGAGTACTTAAGTATTCCGTCGTTACGAATTTCGAGATGATCCTGGCCACTCGTTCATTACACGTTGATAG
CCTAGGTAGCGTAGGTCGGACAATAAGTAGTTCGCCGCCCAAACCAAGTGACTCGGGGGAGGACTTAGCTCATAGGTTAGAGTCCGAACTAGAAGAGATTGAGAACTTTA
GATTTTCTGATGTTGGGGAGGATAGTGATGCTTCCGTCTCGGGTCAGGGTTTGGAATACCCTTCAAAAATGCCCGAGCACTATCTCGGACCCCTCCGTAGGGGGTTTAAA
ATTCCAAACGACATCCTCCTTAGGATTCCGGAGGAAGAGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAAATGTTTGAGTACGGGCTCAGACT
TCCCCTTCACCCTTTTGCCCAAGAGTTCCTCAACCGAACTGGGTTGGCTCCTGCTCAAGTGGCCCCCAACGGATGGGGTGTCATTTTTGTGTTAGCCATCATTTTCTGGT
TGAGAGCTCGGGATGAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGTTCAGGAACTTAGTATCAATCAGGCTAATCCCCGAACTCACTCAAGCATCC
TTTGATACGGTTAAGTTTTACAAGGATCGTTTTCCGAAGGGTAGGAAGATCGGAACTCTAGAGTTCATTGCTTCTATCCATTCGACAGTCATGATAAAGGCCGAACTGGA
TGGAATGGAGGCTTTGACAGCAAAAGAGAAGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACAATGAAGGGCGAGCTACTGAAGGCTCGCTCCGAAGTGGATATCC
TGAGGCCCGAGGTGGAAGCTAAGGCCGAACTGCTGAAGAGGGAGGACAAGAGGCATAAGGCCCACCTCCGAGCTGCCCATGCGATCACTAAAGGGCTGGAGAAGAAGAAG
TTCCAACTCCTAAAGGAAAAGGACGACATGCTCCAGGCCCTCGAGGCGAAGGACGCTGCGATAGAGCGTCTCACTGCTGAGCTCAAAATGGAGAAGGAACGTCTTGCCAA
CGGAGCTCTTCTAGAAGCAGCCTTCAGGCAACACCCAGACTTTGATGGGTTTGCTAAGGACTTTAGCGACGCAGGCTTCAAGTTTTTGATGAAGGGTATTGCTGCCAACA
TGCCCCATCTTCAGATCGACCTCAGCGAACTGAAGAAGAAAGAGCTGGACTCTGACTACTCCGATCTCGAGGAAGAAGATGCTCTTAGTCAAGAGCAAGTCGAGGTCGGC
ACCACCCAAGAGGAGGTCCCTTCGCAGCAGGACGAGTTCAAGGAGGCCAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGTTGCGGAAGATGTTTCGATTTGCCAGGTTGTCGGAGTACTTAAGTATTCCGTCGTTACGAATTTCGAGATGATCCTGGCCACTCGTTCATTACACGTTGATAG
CCTAGGTAGCGTAGGTCGGACAATAAGTAGTTCGCCGCCCAAACCAAGTGACTCGGGGGAGGACTTAGCTCATAGGTTAGAGTCCGAACTAGAAGAGATTGAGAACTTTA
GATTTTCTGATGTTGGGGAGGATAGTGATGCTTCCGTCTCGGGTCAGGGTTTGGAATACCCTTCAAAAATGCCCGAGCACTATCTCGGACCCCTCCGTAGGGGGTTTAAA
ATTCCAAACGACATCCTCCTTAGGATTCCGGAGGAAGAGGAAAGAGCTGACAATCCTCCAGAGGGATGGGTCACTCTTTACTTGAAAATGTTTGAGTACGGGCTCAGACT
TCCCCTTCACCCTTTTGCCCAAGAGTTCCTCAACCGAACTGGGTTGGCTCCTGCTCAAGTGGCCCCCAACGGATGGGGTGTCATTTTTGTGTTAGCCATCATTTTCTGGT
TGAGAGCTCGGGATGAAGAAGCCTGGTCGGTACTATATGTGCGCAAGGAAAGGCGCAGGTTCAGGAACTTAGTATCAATCAGGCTAATCCCCGAACTCACTCAAGCATCC
TTTGATACGGTTAAGTTTTACAAGGATCGTTTTCCGAAGGGTAGGAAGATCGGAACTCTAGAGTTCATTGCTTCTATCCATTCGACAGTCATGATAAAGGCCGAACTGGA
TGGAATGGAGGCTTTGACAGCAAAAGAGAAGGAGAACTCCTCTGCTGCCTTAGAGGCTGCCACCACAATGAAGGGCGAGCTACTGAAGGCTCGCTCCGAAGTGGATATCC
TGAGGCCCGAGGTGGAAGCTAAGGCCGAACTGCTGAAGAGGGAGGACAAGAGGCATAAGGCCCACCTCCGAGCTGCCCATGCGATCACTAAAGGGCTGGAGAAGAAGAAG
TTCCAACTCCTAAAGGAAAAGGACGACATGCTCCAGGCCCTCGAGGCGAAGGACGCTGCGATAGAGCGTCTCACTGCTGAGCTCAAAATGGAGAAGGAACGTCTTGCCAA
CGGAGCTCTTCTAGAAGCAGCCTTCAGGCAACACCCAGACTTTGATGGGTTTGCTAAGGACTTTAGCGACGCAGGCTTCAAGTTTTTGATGAAGGGTATTGCTGCCAACA
TGCCCCATCTTCAGATCGACCTCAGCGAACTGAAGAAGAAAGAGCTGGACTCTGACTACTCCGATCTCGAGGAAGAAGATGCTCTTAGTCAAGAGCAAGTCGAGGTCGGC
ACCACCCAAGAGGAGGTCCCTTCGCAGCAGGACGAGTTCAAGGAGGCCAACTGA
Protein sequenceShow/hide protein sequence
MTVAEDVSICQVVGVLKYSVVTNFEMILATRSLHVDSLGSVGRTISSSPPKPSDSGEDLAHRLESELEEIENFRFSDVGEDSDASVSGQGLEYPSKMPEHYLGPLRRGFK
IPNDILLRIPEEEERADNPPEGWVTLYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFVLAIIFWLRARDEEAWSVLYVRKERRRFRNLVSIRLIPELTQAS
FDTVKFYKDRFPKGRKIGTLEFIASIHSTVMIKAELDGMEALTAKEKENSSAALEAATTMKGELLKARSEVDILRPEVEAKAELLKREDKRHKAHLRAAHAITKGLEKKK
FQLLKEKDDMLQALEAKDAAIERLTAELKMEKERLANGALLEAAFRQHPDFDGFAKDFSDAGFKFLMKGIAANMPHLQIDLSELKKKELDSDYSDLEEEDALSQEQVEVG
TTQEEVPSQQDEFKEAN