; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS000366 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS000366
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationscaffold44:1233556..1235625
RNA-Seq ExpressionMS000366
SyntenyMS000366
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]9.4e-11987.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]5.9e-13799.59Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTFC
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KYIFRSDVTFC
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTFC

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]1.9e-11989.07Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF TSVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKY+FRSDV F
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]7.2e-11988.66Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND
        MAE QG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF TSVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKY+FRSDV F
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]6.5e-12088.48Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGD KKT P++ WPAIKPKQNLQI RLK+NDLFTVPSF + VESKGFI  AES+GFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        D IWRSGLD LF+DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
        VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.0e-11887.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT P++NWP IKPKQNLQ+  LK+NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSS+ LVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

A0A1S3AY48 uncharacterized protein LOC1034840334.5e-11987.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

A0A6J1CIV9 uncharacterized protein LOC1110113962.8e-13799.59Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
        DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTFC
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KYIFRSDVTFC
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTFC

A0A6J1FZS7 uncharacterized protein LOC1114494139.2e-12089.07Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF TSVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKY+FRSDV F
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

A0A6J1HX87 uncharacterized protein LOC1114676972.3e-11888.26Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF  SVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTTPTTN----WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKY+FRSDV F
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYIFRSDVTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.7e-8772.6Show/hide
Query:  WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLADTIWRSGLDKLFADIKIRGKVAVGLNPN
        WP IK K NL +  LK +DLFTV + LTS ESK F+ IAES+GF HQGS GP  GEA+RDN RISVNDP LADT+W+SGL  LF DIKIR KVAVGLNPN
Subjt:  WPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLADTIWRSGLDKLFADIKIRGKVAVGLNPN

Query:  IRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KGKTKND---TNDSKGPSSEPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLLH
        IRFYRY+ GQ FGRHIDES DL  G RTYYTLLIYLSG S K K+K+    TNDS   S+EPLVGGETVFYGSRN +VAEVAP EGMAL H+HGDKC+LH
Subjt:  IRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KGKTKND---TNDSKGPSSEPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLLH

Query:  EARNVTKGVKYIFRSDVTF
        E RNV+KGVKY+FRSDV F
Subjt:  EARNVTKGVKYIFRSDVTF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCCAAGAAAACAACCCCAACAACGAATTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAGATCGTCCGCCTCAAAGAAAATGATCTCTTCACGGTGCCAAGTTTTCTCACATCTGTTGAGTCAAAAGGATTCATCACGATTGCAGAATCGATGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCCACGAAAGGAGAAGCTTTCAGAGATAATGATCGGATCTCGGTGAATGATCCCGATTTAGCAGACACCATTTGGCGTTCGGGACTTGATAAA
CTATTTGCTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTTTACAGATACAATGTTGGTCAACGCTTCGGACGCCATATTGATGAAAG
TGTTGATCTTGGAGGGGGAAAGCGCACATATTACACTTTGTTAATATATTTAAGTGGAGGTTCTAAAGGCAAAACAAAAAACGATACCAACGATTCCAAAGGTCCTTCTT
CCGAGCCTCTGGTTGGAGGGGAAACCGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCCGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACATTTTCCGTTCAGATGTCACATTTTGT
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCCAAGAAAACAACCCCAACAACGAATTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAGATCGTCCGCCTCAAAGAAAATGATCTCTTCACGGTGCCAAGTTTTCTCACATCTGTTGAGTCAAAAGGATTCATCACGATTGCAGAATCGATGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCCACGAAAGGAGAAGCTTTCAGAGATAATGATCGGATCTCGGTGAATGATCCCGATTTAGCAGACACCATTTGGCGTTCGGGACTTGATAAA
CTATTTGCTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTTTACAGATACAATGTTGGTCAACGCTTCGGACGCCATATTGATGAAAG
TGTTGATCTTGGAGGGGGAAAGCGCACATATTACACTTTGTTAATATATTTAAGTGGAGGTTCTAAAGGCAAAACAAAAAACGATACCAACGATTCCAAAGGTCCTTCTT
CCGAGCCTCTGGTTGGAGGGGAAACCGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCCGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACATTTTCCGTTCAGATGTCACATTTTGT
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDSKKTTPTTNWPAIKPKQNLQIVRLKENDLFTVPSFLTSVESKGFITIAESMGFVHQGSLGPTKGEAFRDNDRISVNDPDLADTIWRSGLDK
LFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKGKTKNDTNDSKGPSSEPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDK
CLLHEARNVTKGVKYIFRSDVTFC