; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0017161 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0017161
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationtig00002012:23105..25437
RNA-Seq ExpressionIVF0017161
SyntenyIVF0017161
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]6.20e-172100Show/hide
Query:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]5.39e-17397.13Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]5.01e-17899.59Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]5.24e-15587.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]4.05e-16692.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDF KTDPSS WP IKPKQNLQ++ LK NDLFTVPSFF+CVESK FIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGLDNLF+DIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        V+AEV+PTEGMALLHLHGDKCLLHEARNV +GVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein8.5e-13497.13Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840331.3e-13799.59Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 19.4e-133100Show/hide
Query:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113965.4e-12087.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT P++NWP IKPKQNLQ+  LK NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494131.3e-11887.1Show/hide
Query:  MAETQGKKRKMAGRRGEGDFNKT----DPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD  KT     PSS WP IKPKQ+LQ+N LK NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFNKT----DPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVMKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.1e-8871.49Show/hide
Query:  SSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKIAVGL
        S  WP IK K NL V+ LK +DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR K+AVGL
Subjt:  SSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKIAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL  G RTYYTLLIYLSG S K+K+K+ ++ + D SS EPLVGGETVFYGSRN ++AEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVMKGVKYVFRSDVIFS
        HE RNV KGVKYVFRSDV+F+
Subjt:  HEARNVMKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAATAAAACAGACCCCTCTTCGAACTGGCCACCCATTAAACCCAAGCAGAA
TCTTCAAGTCAACCTGCTGAAAGGAAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAAT
CTATTTGCCGATATTAAAATTCGGGGCAAAATTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTGGGTCAGCGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACCTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAACGATACGAACAATTCCAAAGATCCTTCAT
CTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGTGTTATTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTATGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGA
mRNA sequenceShow/hide mRNA sequence
AGAGGCACCAGTTGATTTCTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAATAAAACAGACCCCTCTTCGAACTGGCC
ACCCATTAAACCCAAGCAGAATCTTCAAGTCAACCTGCTGAAAGGAAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTG
CAGAGTCATTGGGTTTTCTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATT
TGGCGTTCCGGGCTTGATAATCTATTTGCCGATATTAAAATTCGGGGCAAAATTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTGGGTCAGCGCTT
TGGACGTCATATTGATGAAAGTGTTGATCTTGGAGGAGGCAAGCGCACATATTATACCTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAACGATACGA
ACAATTCCAAAGATCCTTCATCTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGTGTTATTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTC
CTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAACGTTATGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGATTAGTTCAGGTTGT
TGGTGACCTCAATGGAAAGAAAAAAAAGAAATCGAATTATCTACACATGACCAAGAATGTAGTACTGATGATTTGGTTGTTTATACGCCTCCTGGTGTCGTGAAAACTGC
AAAACAATCACTTAGGTAGCCTCCTTATTTGATTGGATGACCCCAACATATCTCGGGAAAGAGCTCGTTCAATGTCGATTTTTTTTAAAGTCCAGGTTGGTTCCAGGAAT
AAGCTATAGAATTCAGTGTATCATTTGTTTGAAATGAAGCCTTTCATAATCTTGTAAGTTGTAATACGAAGTGGCTGGTTGAATTATATCATTACATGACATTCTACTGC
ATATGATGCCACTCTGATATATGTACCCTCGTCACTTTTCTGCCTAAGGAATTTACTTTTCTAAATTTGATGTTTCATTTCTTATGTTAGGCTTGTTACTCAAAGTTGCC
ATATTTTGGTGGCCTTTGAATATATTGGGAAACTATG
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFNKTDPSSNWPPIKPKQNLQVNLLKGNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFADIKIRGKIAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDK
CLLHEARNVMKGVKYVFRSDVIFS