; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g05950 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g05950
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:4363219..4365178
RNA-Seq ExpressionMoc03g05950
SyntenyMoc03g05950
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0140640 - catalytic activity, acting on a nucleic acid (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]5.5e-8352.26Show/hide
Query:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRVGRIHSVARI-RINDLRIPTEY
        R+H    LK  + +       SQ P       G+NV    +       Q TS     +  L  + TN +        +   ++S     RI++LR+P  Y
Subjt:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRVGRIHSVARI-RINDLRIPTEY

Query:  QHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRKGEPVVEHI
        Q PKFQ FDGKGN KQH+AHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RT+SMMELTN KQRKGEPV+++I
Subjt:  QHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRKGEPVVEHI

Query:  NRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALK
        NRWRALSLD KD+L E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE+   EKI K   KESMVVNT  LK
Subjt:  NRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALK

Query:  FSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        FS   K KE R+EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  FSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]4.2e-8363.4Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK
        RI++LR+P  YQ PKFQ FDGKGN KQH+AHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RT+SMMELTN K
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK

Query:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK
        QRKGEPV+++INRWRALSLD KD+L E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE+   EKI K   K
Subjt:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK

Query:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        ESMVVNT  LKFS   K KE R+EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]4.2e-8363.4Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK
        RI++LR+P  YQ PKFQ FDGKGN KQH+AHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RT+SMMELTN K
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK

Query:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK
        QRKGEPV+++INRWRALSLD KD+L E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE+   EKI K   K
Subjt:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK

Query:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        ESMVVNT  LKFS   K KE R+EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]1.9e-8363.4Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK
        RI++LR+P  YQ PKFQ FDGKGN KQH+AHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RT+SMMELTN K
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK

Query:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK
        QRKGEPV+++INRWRALSLD KD+L E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE+   EKI K  +K
Subjt:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK

Query:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        ESMVVNT  LKFS   K KE R+EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]5.5e-8352.26Show/hide
Query:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRVGRIHSVARI-RINDLRIPTEY
        R+H    LK  + +       SQ P       G+NV    +       Q TS     +  L  + TN +        +   ++S     RI++LR+P  Y
Subjt:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRVGRIHSVARI-RINDLRIPTEY

Query:  QHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRKGEPVVEHI
        Q PKFQ FDGKGN KQH+AHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RT+SMMELTN KQRKGEPV+++I
Subjt:  QHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRKGEPVVEHI

Query:  NRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALK
        NRWRALSLD KD+L E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE+   EKI K   KESMVVNT  LK
Subjt:  NRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALK

Query:  FSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        FS   K KE R+EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  FSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

TrEMBL top hitse value%identityAlignment
A0A5A7STA5 Ty3-gypsy retrotransposon protein3.3e-8167.5Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL
        RI++LR+P  YQ PKFQ FDGKGN KQHIAHF+ET ENA +RGD+  +QFVR+LKGNAF    RT+SMMELTN KQRKGEPV+++INRWRALSLD KD+L
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL

Query:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK
         E+  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K   V +V+K+KKE  S EK+ K ++KESMVVNT  LKFS   K KE R EK
Subjt:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK

Query:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        + DG+  RRL+L++RQ+ VYP    D+ +MLEQLLEK LI
Subjt:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

A0A5A7UWW2 Ty3-gypsy retrotransposon protein4.0e-7964.58Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL
        RI++LR+P  YQ  KFQ FDGK N KQHIAHF+ETCENAG RGD+  +QF+R+LKGNAF CT RT+SMMELTN KQ KGEPV+++INRWRALSLD +D+L
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL

Query:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK
         ++  VEMCTQG HW LL+ILQGIKPRTFEELATR HDME+SI+ RG K   V +V+K+KKE  S EK+ K  +KESMVVNT ALKFS   K KE R +K
Subjt:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK

Query:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        + DG+  +  +L++RQ+ VYP  D D+ +MLEQLLEK LI
Subjt:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

A0A5A7V8K5 Ty3-gypsy retrotransposon protein5.9e-8351.93Show/hide
Query:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRV---GRIHSVARI------RIN
        R+H  T L+         +  SQ P       G+NV            Q   P  Q +  +A L    L  +I N  R    G+  +          RI+
Subjt:  REHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYRV---GRIHSVARI------RIN

Query:  DLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRK
        +L +P  YQ PKFQ FDGKGN KQHIAHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RTISMMELTN KQ K
Subjt:  DLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCKQRK

Query:  GEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESM
        GEPV+++INRWRALSLD KD+L ++  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+ LV +VKK+KKE  S EK+ K ++KESM
Subjt:  GEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESM

Query:  VVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        VVNTI LKFS   K KE R EK+ DG+  RRL+L++RQE VYP  D D+ +MLEQLLEK LI
Subjt:  VVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

A0A5D3C0W6 Ty3-gypsy retrotransposon protein1.2e-8062.64Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK
        RI++L +P  YQ PKFQ FDGKGN KQHIAHF+ETCENAG+RGD+  +QFVR+LKGNAF                           T RTISMMELTN K
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFN-------------------------CTTRTISMMELTNCK

Query:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK
        Q KGEPV+++INRWRAL LD KD+L ++  VEMCTQG HWGLLYILQGIKPRTFEELATR HDME+SI+SRG K+  V +VKK+KKE  S EK+ K ++K
Subjt:  QRKGEPVVEHINRWRALSLDYKDKLMEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMK

Query:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        ESMVVNTI LKFS   K KE R EK+ DG+  RRL+L++RQE VYP  D D  +MLEQLLEK LI
Subjt:  ESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

A0A5D3DR73 Ty3-gypsy retrotransposon protein4.0e-7964.58Show/hide
Query:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL
        RI++LR+P  YQ  KFQ FDGK N KQHIAHF+ETCENAG RGD+  +QF+R+LKGNAF CT RT+SMMELTN KQ KGEPV+++INRWRALSLD +D+L
Subjt:  RINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL

Query:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK
         ++  VEMCTQG HW LL+ILQGIKPRTFEELATR HDME+SI+ RG K   V +V+K+KKE  S EK+ K  +KESMVVNT ALKFS   K KE R +K
Subjt:  MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEK

Query:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI
        + DG+  +  +L++RQ+ VYP  D D+ +MLEQLLEK LI
Subjt:  QQDGNVGRRLSLEDRQENVYPLTDPDLVNMLEQLLEKGLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAACCCAGTTATGGTAAAACCAATAACCGGAAGTCCCTTTCGGCCAATGAGAAGGTGGGGCCCCTTGTTCAAGTCCCGGAGTCAGAATTTAAGGGAACACTCATC
TACTCCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAACTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCGACGACTCGGG
CCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGATACCCCCACTCGCATGTCTCCACACGAACGACCTGGATCGAGTCATCTGTAACCGTTACAGA
GTGGGTCGTATCCATAGTGTTGCCAGGATAAGGATCAATGATCTGAGAATTCCTACTGAATACCAACATCCAAAGTTCCAGCATTTCGATGGGAAGGGCAATTCTAAGCA
ACATATTGCTCACTTTATGGAAACTTGTGAGAATGCTGGTACAAGAGGCGATAAACGGGCCAAGCAGTTTGTCCGAACGTTGAAGGGAAACGCCTTTAACTGTACGACGA
GAACTATTAGCATGATGGAGCTCACCAACTGCAAACAAAGAAAAGGTGAACCGGTTGTCGAACATATCAACCGGTGGAGAGCTCTAAGTCTTGACTACAAGGATAAACTT
ATGGAAGTACCTGTTGTAGAAATGTGCACCCAAGGTAGGCACTGGGGACTTCTTTACATCCTTCAAGGGATAAAACCCCGCACATTCGAAGAACTAGCAACTCGCGTTCA
TGATATGGAGATAAGTATTTCTAGTAGAGGTAATAAAGAGCCTCTCGTCTCAGATGTGAAGAAAGAGAAAAAGGAAGTGACGAGCTTTGAGAAGATTTCCAAGCGTGCTA
TGAAAGAATCTATGGTCGTTAATACGATCGCTCTCAAATTTTCCTCGAAAGGAAAAGAAAAAGAAAATAGGATAGAAAAGCAACAAGATGGAAACGTAGGACGTCGCCTA
TCTCTAGAAGATAGACAAGAAAATGTGTATCCTCTTACAGATCCTGATCTTGTCAATATGTTAGAACAACTATTAGAAAAGGGACTCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAACCCAGTTATGGTAAAACCAATAACCGGAAGTCCCTTTCGGCCAATGAGAAGGTGGGGCCCCTTGTTCAAGTCCCGGAGTCAGAATTTAAGGGAACACTCATC
TACTCCCCTAAAGTCGGGGAGGAGTGAATTCCATCTTGTGAAGTTATGTTCCCAACTCCCCACTCGGTCTCGTCCCCAAAATGGTAGGAATGTTGAGTCGACGACTCGGG
CCACTCTCACCCATACAGATCAAAGGACGAGCCCTCACGGGCAGGAGATACCCCCACTCGCATGTCTCCACACGAACGACCTGGATCGAGTCATCTGTAACCGTTACAGA
GTGGGTCGTATCCATAGTGTTGCCAGGATAAGGATCAATGATCTGAGAATTCCTACTGAATACCAACATCCAAAGTTCCAGCATTTCGATGGGAAGGGCAATTCTAAGCA
ACATATTGCTCACTTTATGGAAACTTGTGAGAATGCTGGTACAAGAGGCGATAAACGGGCCAAGCAGTTTGTCCGAACGTTGAAGGGAAACGCCTTTAACTGTACGACGA
GAACTATTAGCATGATGGAGCTCACCAACTGCAAACAAAGAAAAGGTGAACCGGTTGTCGAACATATCAACCGGTGGAGAGCTCTAAGTCTTGACTACAAGGATAAACTT
ATGGAAGTACCTGTTGTAGAAATGTGCACCCAAGGTAGGCACTGGGGACTTCTTTACATCCTTCAAGGGATAAAACCCCGCACATTCGAAGAACTAGCAACTCGCGTTCA
TGATATGGAGATAAGTATTTCTAGTAGAGGTAATAAAGAGCCTCTCGTCTCAGATGTGAAGAAAGAGAAAAAGGAAGTGACGAGCTTTGAGAAGATTTCCAAGCGTGCTA
TGAAAGAATCTATGGTCGTTAATACGATCGCTCTCAAATTTTCCTCGAAAGGAAAAGAAAAAGAAAATAGGATAGAAAAGCAACAAGATGGAAACGTAGGACGTCGCCTA
TCTCTAGAAGATAGACAAGAAAATGTGTATCCTCTTACAGATCCTGATCTTGTCAATATGTTAGAACAACTATTAGAAAAGGGACTCATATAA
Protein sequenceShow/hide protein sequence
MNNPVMVKPITGSPFRPMRRWGPLFKSRSQNLREHSSTPLKSGRSEFHLVKLCSQLPTRSRPQNGRNVESTTRATLTHTDQRTSPHGQEIPPLACLHTNDLDRVICNRYR
VGRIHSVARIRINDLRIPTEYQHPKFQHFDGKGNSKQHIAHFMETCENAGTRGDKRAKQFVRTLKGNAFNCTTRTISMMELTNCKQRKGEPVVEHINRWRALSLDYKDKL
MEVPVVEMCTQGRHWGLLYILQGIKPRTFEELATRVHDMEISISSRGNKEPLVSDVKKEKKEVTSFEKISKRAMKESMVVNTIALKFSSKGKEKENRIEKQQDGNVGRRL
SLEDRQENVYPLTDPDLVNMLEQLLEKGLI