; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C006861 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C006861
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationchr06:6757360..6759742
RNA-Seq ExpressionMELO3C006861
SyntenyMELO3C006861
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]9.7e-13299.57Show/hide
Query:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNN
        MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGL+N
Subjt:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNN

Query:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]8.7e-13396.72Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]9.0e-138100Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]4.2e-11986.83Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.4e-12792.21Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDF KTDPSS WP IKPKQNLQ++ LK NDLFTVPSFF+CVESK FIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGL+NLF+DIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        V+AEV+PTEGMALLHLHGDKCLLHEARNV +GVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein4.2e-13396.72Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840334.4e-138100Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 14.7e-13299.57Show/hide
Query:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNN
        MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGL+N
Subjt:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNN

Query:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113962.0e-11986.83Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494136.6e-11886.69Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKT----DPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD  KT     PSS WP IKPKQ+LQ+N LK NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFNKT----DPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLAD IWRSGL+  FADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADIIWRSGLNNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.1e-8871.49Show/hide
Query:  SSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNNLFADIKIRGKIAVGL
        S  WP IK K NL V+ LK +DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL+NLF DIKIR K+AVGL
Subjt:  SSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNNLFADIKIRGKIAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL  G RTYYTLLIYLSG S K+K+K+ ++ + D SS EPLVGGETVFYGSRN ++AEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVMKGVKYVFRSDVIFS
        HE RNV KGVKYVFRSDV+F+
Subjt:  HEARNVMKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAATAAAACAGACCCCTCTTCGAACTGGCCACCCATTAAACCCAAGCAGAA
TCTTCAAGTCAACCTGCTGAAAGGAAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTAATAAT
CTATTTGCCGATATTAAAATTCGGGGCAAAATTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTGGGTCAGCGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACCTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAACGATACGAACAATTCCAAAGATCCTTCAT
CTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGTGTTATTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTATGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGA
mRNA sequenceShow/hide mRNA sequence
GAAGAATAGAGGCACCAGTTTATTTCTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAATAAAACAGACCCCTCTTCGA
ACTGGCCACCCATTAAACCCAAGCAGAATCTTCAAGTCAACCTGCTGAAAGGAAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATC
AAGGCTGCAGAGTCATTGGGTTTTCTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGA
CATCATTTGGCGTTCCGGGCTTAATAATCTATTTGCCGATATTAAAATTCGGGGCAAAATTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTGGGTC
AGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGGAGGCAAGCGCACATATTATACCTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAAC
GATACGAACAATTCCAAAGATCCTTCATCTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGTGTTATTGCTGAGGTGGCTCCTACTGAAGGGAT
GGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAACGTTATGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGATTAGTTC
AGGTTGTTGGTGACCTCAATGGAAAGAAAAAAAAGAAATCGAATTATCTACACATGACCAAGAATGTAGTACTGATGATTTGGTTGTTTATACGCCTCCTGGTGTCGTGA
AAACTGCAAAACAATCACTTAGGTAGCCTCCTTATTTGATTGGATGACCCCAACATATCTCGGTAAAGAGCTCGTTCAATGTCGATTTTTTTTAACGTCCAGGTTGGTTC
CAGGAATAAGCTATAGAATTCAGTGTATCATTTGTTTGAAATGAAGCCTTCATAATCTTGTAAGTCGTAATACGAAGTGGCTGGTTGAATTATATCATTACATGACATTC
TACTGCATATGATGCCACTCTGATATATGTACCCTCGTCACTTTTCTGCCTAAGGAATTTACTTTTCTAAATTTGATGTTTCATTTCTTATGTTAGGCTTGTTACTCAAA
GTTGCCATATTTTGGTGGCCTTTGAATATATTGGGAAACTATGCCATTTGTTACTCTTCTATTAGAGTCACTTTTGTCTTGATGACT
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLNN
LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDK
CLLHEARNVMKGVKYVFRSDVIFS