; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G012350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G012350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCmo_Chr18:12203709..12206057
RNA-Seq ExpressionCmoCh18G012350
SyntenyCmoCh18G012350
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]3.3e-11988.66Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF TSVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]4.9e-139100Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022968459.1 uncharacterized protein LOC111467697 [Cucurbita maxima]3.5e-13798.79Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFF SVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]1.8e-13899.6Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAE QGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.1e-11988.31Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI+RLK+NDLFTVPSFF+ VESK FIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLAD IWRSGLD  F+DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein5.6e-11786.69Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAET G+KRKMAGRR EGD KKT     PSS WP IKPKQ+LQ+N LK+NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSS+ LVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840332.5e-11786.69Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD  KT     PSS WP IKPKQ+LQ+N LK NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLAD IWRSGL+  FADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113961.6e-11988.66Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKT P+++    WPAIKPKQ+LQI RLKENDLFTVPSF TSVESK FI  AESMGFVHQGSLGP+KGEA+RDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494132.4e-139100Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676971.7e-13798.79Show/hide
Query:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
        MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFF SVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND
Subjt:  MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK FADIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  WRNVVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.1e-8871.04Show/hide
Query:  SSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVNDPDLADTIWRSGLDKQFADIKIRGKVAVGL
        S +WP IK K +L ++ LK +DLFTV +  TS ESKAF+K AES+GF HQGS GP+ GEAYRDN RISVNDP LADT+W+SGL   F DIKIR KVAVGL
Subjt:  SSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVNDPDLADTIWRSGLDKQFADIKIRGKVAVGL

Query:  NPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGS-KTKTKNDTNNSKDPSS-EPLVGGETVFYGWRNVVVAEVAPTEGMALLHLHGDKCLL
        NPNIRFYRY+ GQ FGRHIDES DL +G RTYYTLLIYLSG S K+K+K+ ++ + D SS EPLVGGETVFYG RN +VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGS-KTKTKNDTNNSKDPSS-EPLVGGETVFYGWRNVVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAAACACAGGGGCAGAAGAGAAAAATGGCGGGTAGAAGAAGGGAGGGAGATTCGAAGAAAACAGCCCCATCTTCCCACCCATCTTCCCAGTGGCCAGCC
ATTAAACCCAAGCAGGATCTTCAGATCAATCGCCTCAAAGAAAATGATCTTTTCACCGTACCAAGTTTTTTCACAAGTGTTGAGTCAAAAGCATTCATCAAGACG
GCAGAGTCGATGGGTTTTGTTCATCAGGGGAGCCTCGGTCCTTCTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGAC
ACCATTTGGCGTTCGGGACTTGATAAACAATTTGCTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTTTACAGATACAACGTT
GGGCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGCGAACGTACTATACTTTGTTAATATATTTAAGTGGAGGTTCCAAAACTAAG
ACAAAAAATGATACCAACAATTCCAAAGATCCTTCTTCTGAGCCTCTAGTTGGTGGGGAAACTGTTTTCTATGGTTGGAGGAATGTCGTTGTGGCTGAGGTGGCT
CCTACAGAAGGGATGGCTCTTCTGCATCTTCATGGAGACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACCAAGGGTGTCAAATATGTTTTCCGTTCAGACGTC
ATATTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAAACACAGGGGCAGAAGAGAAAAATGGCGGGTAGAAGAAGGGAGGGAGATTCGAAGAAAACAGCCCCATCTTCCCACCCATCTTCCCAGTGGCCAGCC
ATTAAACCCAAGCAGGATCTTCAGATCAATCGCCTCAAAGAAAATGATCTTTTCACCGTACCAAGTTTTTTCACAAGTGTTGAGTCAAAAGCATTCATCAAGACG
GCAGAGTCGATGGGTTTTGTTCATCAGGGGAGCCTCGGTCCTTCTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGAC
ACCATTTGGCGTTCGGGACTTGATAAACAATTTGCTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTTTACAGATACAACGTT
GGGCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGCGAACGTACTATACTTTGTTAATATATTTAAGTGGAGGTTCCAAAACTAAG
ACAAAAAATGATACCAACAATTCCAAAGATCCTTCTTCTGAGCCTCTAGTTGGTGGGGAAACTGTTTTCTATGGTTGGAGGAATGTCGTTGTGGCTGAGGTGGCT
CCTACAGAAGGGATGGCTCTTCTGCATCTTCATGGAGACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACCAAGGGTGTCAAATATGTTTTCCGTTCAGACGTC
ATATTTTCTTGACGATACGTTTAGGTTGTCAGTCTCGACGAAAAATAATTGAATTACCTACACATGGTTAAGATTGTGCTATTGACTTGGTTATCTACAGGCCCC
ATGATGTCGTGAAAACTGCCAAACAGGTCACTAAGGTATCCTCCTCATTTGATTCAATGTCTAAAAACCAGATTTTATATTGTAAAAGAGCTCGTCCAAAGTTGA
TTTTCTTGCGTCCAGGTTGGTCCCTGGAATAGCTAAAGAATTCAGTGTATCACTACTTGAAATGGAACTGTTCATAGTCTTATAAGCTGTATTACTTGGTGGCTT
GTTGAAATATATGACATTCTACTGCATATGCCACTGATATATGCTGTTTGTTATATTTACCTTCTTTCCTTTTCTGCCAAAGGAATTTTATTTTCTAAATTTGAT
GTTTCATCTCGTGTCATGATTGTAGTTCAAAATTTCCATCTTTTTGTGGCTTTTGAATTAGGGCATTACAAAATGGTTCATTTTTATCGTATTCAGAAACTTAGA
TTTATAGGACGGGTGAGTGTATTGTTTGAAAAAACTCTTCTTGATTGAAGAGTTGTGTTCAACAATATGACATTAGAGCGAGGATGGTATAGGTTATCAGATCTC
CTAAAACTCTTAGTATGCTA
Protein sequenceShow/hide protein sequence
MAETQGQKRKMAGRRREGDSKKTAPSSHPSSQWPAIKPKQDLQINRLKENDLFTVPSFFTSVESKAFIKTAESMGFVHQGSLGPSKGEAYRDNDRISVNDPDLAD
TIWRSGLDKQFADIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKTKTKNDTNNSKDPSSEPLVGGETVFYGWRNVVVAEVA
PTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS