; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G016570 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G016570
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationGy14Chr3:12319107..12321623
RNA-Seq ExpressionCsGy3G016570
SyntenyCsGy3G016570
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]3.44e-16797.44Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]2.58e-178100Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]3.25e-17296.72Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]1.28e-15386.83Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT P++NWP IKPKQNLQ+  LK+NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]2.44e-16592.62Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRR EGDFKKTDPSS WP IKPKQNLQ++ LKDNDLFTVPSFF+CVESK FIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        V+AEV+PTEGMALLHLHGDKCLLHEARNV +GVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.25e-178100Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840331.57e-17296.72Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 11.67e-16797.44Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113966.21e-15486.83Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT P++NWP IKPKQNLQ+  LK+NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494134.84e-15286.69Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSN----WPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND
        MAET G+KRKMAGRR EGD KKT PSS+    WP IKPKQ+LQ+N LK+NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSN----WPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSS+ LVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYG

Query:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-8771.04Show/hide
Query:  SSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S  WP IK K NL V+ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-DTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL  G RTYYTLLIYLSG S K+K+K+ ++ + D SS + LVGGETVFYGSRN ++AEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-DTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVRKGVKYVFRSDVIFS
        HE RNV KGVKYVFRSDV+F+
Subjt:  HEARNVRKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACTGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCCAACTGGCCACCCATTAAACCCAAGCAGAA
TCTTCAAGTTAACCTCCTGAAAGACAATGATCTTTTCACCGTTCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGACAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAAT
CTATTTGCTGATATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGCGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTCTT
CTGACACTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCTAGGAATGGTGTTATCGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTAGGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAATTCAAAATGTCCATGGATAAAAGAAGGTATTTTTATTGCTGGTTATTCTCTGGGTACTATAATCTGGAAGGCTGGAAGGCTGGAAGGCCCGCCCAGTTACT
TTCATCAGCCCACTAATTCGTCCAAGAAATTGGTAAGTTTTCACTGGGTGTTCTTAATTTGGGCCGAATATCAAAGTTTCATCGAAGAATAGAGGCACCGGTTGATCTCT
GATGGCGGAAACACTGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCCAACTGGCCACCCATTAAACCCAAGCAGA
ATCTTCAAGTTAACCTCCTGAAAGACAATGATCTTTTCACCGTTCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTT
CATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGACAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAA
TCTATTTGCTGATATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGCGCTTTGGACGTCATATTGATGAAA
GTGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTCT
TCTGACACTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCTAGGAATGGTGTTATCGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAA
GTGTTTGTTGCATGAAGCTCGCAACGTTAGGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGATTAGTTCAGGTTGTTGGTGACCTCAACGGAAAAA
AAAAAAAGAAATTGAATTATCTACACATGACCAAGAACGTGGTAGTGATGATTTGGTTGTTTATACGCCTCATGGTGTCGTGTCAACTGCAAAACAATCACTTAGGTAGC
CTCCTTATTTGATTGAATGACCCCATCGTATCTTGGAAAGGAGCTCGTTCAATGTCGATTTTTTTTCACGTCCAGGTTGGTTCCTGGAATAAGCTATAGAATTCAGTGTA
TCATTTGTTGAAATGAAGCTGTTCATAATCTTGTCAGTTGTAATACTAAGTGGATGGTTGAATTATATCATTATATGACATTCTACTGCATATGATGCCACTCTGATATA
TGTACCCTCGTCACTTTTCTGCCTAAAGGAATTTACTTTTCTAAATTTGATGTTTCATTTCATATGTCAGGCTTGTTATCAAAGTTGTCATTTTTTTGTGGCCTTTGAAT
ATATTGGGAAACTATGCTATTTATTACACATCTATTAGAGTCA
Protein sequenceShow/hide protein sequence
MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDK
CLLHEARNVRKGVKYVFRSDVIFS