; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G16460 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G16460
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationChr3:12372888..12375317
RNA-Seq ExpressionCSPI03G16460
SyntenyCSPI03G16460
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]5.9e-12997.44Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]9.1e-138100Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]3.9e-13396.72Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]5.5e-11986.83Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT P++NWP IKPKQNLQ+  LK+NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]6.5e-12892.62Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRR EGDFKKTDPSS WP IKPKQNLQ++ LKDNDLFTVPSFF+CVESK FIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        V+AEV+PTEGMALLHLHGDKCLLHEARNV +GVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein4.4e-138100Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840331.9e-13396.72Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        VIAEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 12.9e-12997.44Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWPPIKPKQNLQVNLLK NDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSS+ LVGGETVFYGSRNGVIAEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113962.7e-11986.83Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT P++NWP IKPKQNLQ+  LK+NDLFTVPSF T VESK FI  AES+GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSS+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNG

Query:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KG+KY+FRSDV F
Subjt:  VIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494136.6e-11886.69Show/hide
Query:  MAETLGKKRKMAGRRGEGDFKKT----DPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND
        MAET G+KRKMAGRR EGD KKT     PSS WP IKPKQ+LQ+N LK+NDLFTVPSFFT VESKAFIK AES+GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETLGKKRKMAGRRGEGDFKKT----DPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDPSS+ LVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYG

Query:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS
         RN V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  SRNGVIAEVAPTEGMALLHLHGDKCLLHEARNVRKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-8771.04Show/hide
Query:  SSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S  WP IK K NL V+ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-DTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL  G RTYYTLLIYLSG S K+K+K+ ++ + D SS + LVGGETVFYGSRN ++AEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPSS-DTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVRKGVKYVFRSDVIFS
        HE RNV KGVKYVFRSDV+F+
Subjt:  HEARNVRKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACTGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCCAACTGGCCACCCATTAAACCCAAGCAGAA
TCTTCAAGTTAACCTCCTGAAAGACAATGATCTTTTCACCGTTCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGACAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAAT
CTATTTGCTGATATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGCGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTCTT
CTGACACTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCTAGGAATGGTGTTATCGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTAGGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGA
mRNA sequenceShow/hide mRNA sequence
CTAATTCGTCCAAGAAATTGGTAAGTTTTCACTGGGTGTTCTTAATTTGGGCCGAATATCAAAGTTTCATCGAAGAATAGAGGCACCGGTTGATCTCTGATGGCGGAAAC
ACTGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCCAACTGGCCACCCATTAAACCCAAGCAGAATCTTCAAGTTA
ACCTCCTGAAAGACAATGATCTTTTCACCGTTCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTCTTCATCAGGGGAGC
CTTGGTCCTACTAAAGGAGAAGCTTATAGAGACAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAATCTATTTGCTGA
TATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTG
GAGGAGGCAAGCGCACATATTATACTTTGTTGATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTCTTCTGACACTCTG
GTTGGAGGGGAAACTGTTTTCTATGGTTCTAGGAATGGTGTTATCGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCA
TGAAGCTCGCAACGTTAGGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGATTAGTTCAGGTTGTTGGTGACCTCAACGGAAAAAAAAAAAAGAAAT
TGAATTATCTACACATGACCAAGAACGTGGTAGTGATGATTTGGTTGTTTATACGCCTCATGGTGTCGTGTCAACTGCAAAACAATCACTTAGGTAGCCTCCTTATTTGA
TTGAATGACCCCATCGTATCTTGGAAAGGAGCTCGTTCAATGTCGATTTTTTTTCACGTCCAGGTTGGTTCCTGGAATAAGCTATAGAATTCAGTGTATCATTTGTTGAA
ATGAAGCTGTTCATAATCTTGTCAGTTGTAATACTAAGTGGATGGTTGAATTATATCATTATATGACATTCTACTGCATATGATGCCACTCTGATATATGTACCCTCGTC
ACTTTTCTGCCTAAAGGAATTTACTTTTCTAAATTTGATGTTTCATTTCATATGTCAGGCTTGTTATCAAAGTTGTCATTTTTTTGTGGCCTTTGAATATATTGGGAAAC
TATGCTATTTATTACACATCTATTAGAGTCACTTTTGTCTTGATGACTCCAAATTACTGCCCG
Protein sequenceShow/hide protein sequence
MAETLGKKRKMAGRRGEGDFKKTDPSSNWPPIKPKQNLQVNLLKDNDLFTVPSFFTCVESKAFIKAAESLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPSSDTLVGGETVFYGSRNGVIAEVAPTEGMALLHLHGDK
CLLHEARNVRKGVKYVFRSDVIFS