; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0005611 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0005611
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationchr06:6817753..6821870
RNA-Seq ExpressionPI0005611
SyntenyPI0005611
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]7.9e-12695.3Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWP IKPKQNLQVN LK NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNGVIAEVTPTEG
        LF DIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP SEPLVGGETVFYGSRNGVIAEV PTEG
Subjt:  LFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNGVIAEVTPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]3.1e-13095.08Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSSNWP IKPKQNLQVN LK+NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        DIIWRSGLDNLF DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP S+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VIAEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]1.4e-13095.08Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSSNWP IKPKQNLQVN LK NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        DIIWRSGL+NLF DIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP SEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VIAEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]1.0e-12087.65Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD KKT P++NWPAIKPKQNLQ+ RLKENDLFTVPSF T VESK FI  AE +GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        D IWRSGLD LF DIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK P SEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        V+AEV PTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.2e-12993.44Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDFKKTDPSS WPAIKPKQNLQ++RLK+NDLFTVPSFF+CVESK FIKAAE LGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLLIYLSGGSKNKTKNDTNNSKDP SEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFS
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.5e-13095.08Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSSNWP IKPKQNLQVN LK+NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        DIIWRSGLDNLF DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP S+ LVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VIAEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840336.8e-13195.08Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSSNWP IKPKQNLQVN LK NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        DIIWRSGL+NLF DIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP SEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VIAEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 13.8e-12695.3Show/hide
Query:  MAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSSNWP IKPKQNLQVN LK NDLFTVPSFFTCVESKAFIKAAE LGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNGVIAEVTPTEG
        LF DIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDP SEPLVGGETVFYGSRNGVIAEV PTEG
Subjt:  LFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNGVIAEVTPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113964.9e-12187.65Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD KKT P++NWPAIKPKQNLQ+ RLKENDLFTVPSF T VESK FI  AE +GF+HQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG
        D IWRSGLD LF DIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK P SEPLVGGETVFYGSRNG
Subjt:  DIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNG

Query:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        V+AEV PTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494131.6e-11987.5Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS WPAIKPKQ+LQ+NRLKENDLFTVPSFFT VESKAFIK AE +GF+HQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYG
        PDLAD IWRSGLD  F DIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTNNSKDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYG

Query:  SRNGVIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN V+AEV PTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVIAEVTPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.6e-8771.04Show/hide
Query:  SSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFTDIKIRGKVAVGL
        S  WP IK K NL V+ LK +DLFTV +  T  ESKAF+K AE LGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLFTDIKIR KVAVGL
Subjt:  SSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFTDIKIRGKVAVGL

Query:  NPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPFS-EPLVGGETVFYGSRNGVIAEVTPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL  G RTYYTLLIYLSG S K+K+K+ ++ + D  S EPLVGGETVFYGSRN ++AEV P EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KNKTKNDTNNSKDPFS-EPLVGGETVFYGSRNGVIAEVTPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCAAACTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAAGTCAACCGCCTGAAAGAAAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTTATTGGGTTTTCTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAACGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAAT
CTATTTACTGATATTAAAATTCGGGGAAAGGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGTGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTGTTAATATACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTTTT
CTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATGGTGTTATTGCTGAGGTGACTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTACGAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCATGA
mRNA sequenceShow/hide mRNA sequence
TTTGGTTATTCATTGGGTGCTGTAATCTCGAAGGCCCGCCCAATTAAGGCAGCCAACCAAACTTTCATCAGCCCACTAATTCGTCCAAGAAATTGGGAAATTTTCACTGG
GTGTTCTTAATTTGGGCCGAATATCAAAGTTTCATCGAAGAATAGAGGCACTGGTTGATTTCTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAG
GGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCAAACTGGCCAGCCATTAAACCCAAGCAGAATCTTCAAGTCAACCGCCTGAAAGAAAATGATCTTTTCACCGTGCCA
AGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTTATTGGGTTTTCTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTATAGAGATAA
CGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCCGGGCTTGATAATCTATTTACTGATATTAAAATTCGGGGAAAGGTTGCTGTTGGGTTGA
ATCCAAATATCAGATTATACAGATACAAGGTCGGTCAGTGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTGTTAATA
TACTTAAGTGGAGGTTCCAAAAACAAAACAAAAAATGATACGAACAATTCCAAAGATCCTTTTTCTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAA
TGGTGTTATTGCTGAGGTGACTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACGAAGGGAGTCAAATATG
TTTTCCGTTCAGATGTCATATTCTCATGATTAGTTAAGGTTGTTGGTGACCTCAACGGATAGGAAAAAATGAAATTGAATTATCTACACATGACCAAGAATCTGGTATTG
ATGATTTGGTTGTTTATACGCCTCCTCTTGTCGTGACAACTGCAAAACAATCACTTAGGTTGGTTCCTGGAATAAGCTATAGAATTCAGTGTATCATTTGTTGAAATGAA
GCCGTTCATAATCTTGTAAGTTGTAATACTAAGTGGCTGGTTGAATTATATCATTATATGACATTCTACTGCATATGATGCCACTCTGATATATGTACCCTCGTCACTTT
TCTGCCTAAGGAATTTACTTTCCTAAATTTGATGTTTCATTTCTTATGTTAGGCTTGTTACTCAAAGTTGCCTTCTTTTGGTGGCCTTTGAATACATTGGAAAACTATGC
CATTTGTTGCTCTTCTATTTGAGTTACTTTTGTCTTGATGACTCCAAATGACTGCTCGTGAGAATGATTCGCTTCTACACCTAACTAATCGGCTGTTGTTTTGATAAAGT
ATGAGATGATAAGATTTTCTCGTAATTCATACATTTGCTTGTTATACTAAAAGTCCTAAGTGATAAGTTTCTGACAATTTTTGTTGTCCAATAACATAATTAGAAAGTAT
GTTGATGGATAGAATAATTTAGTTTATTTCTTGTTTTATGTTCGCTTTTCATACCTAGGTTGCTTTATCTAAAGTTTTCGCTTAAAATTGAAAAATAAAATAATTTGGTC
CCTTTGAACTTTGAAAGGCCTTTGTTTTACTCTAAATTTACACAGAATTAGTTTATGGTCTTGTTGTCAGTTTTTAATTAATTGTTAGCATTAATGTAAATTCGTTTCGA
AACATGACAAATTTGTCATGTACATGTCACTTTAATGCAAAGTTATAGCTAAATGTCGAGAAAATTATTGAACGTGAATGTATCAAATTCCACAATAGCAGAGCAGCAAC
TTTGGCCGAGTGGTTGCTCCCTTGATGGTAGAGCAACGAGTTCAATAGGTATCAATTCCAACTCCTTCAAGTCAAAATTCTGTTACATAGCTGCCATGGATCCTCAAACG
ATTAGAGAAATGATTGTTCTATGTGAAACCATAATTGATAGTTCTTCAATTATAGAAACATGAGGGGAAGGGATTATCATGAATCCTATTGCATTTGCCATCCCCTTGCC
ATTTGCAGTAGTTGTTTCTCAACAAGCTGCCAGTTTCTTCATTACATAACACCAACCTAATATTAACATTCGAACCACATCTGTTGTATCTAAATTTTGCCTTTTTAATC
CTTTTGAAGGATTGACTTTCATGGTTGATGGCATTTTAAGGGACACAAGAACCACTATTCAATTTGGTCAAACTTGGAAGGATCATACAGCTCCTCTTGTTGCTGAGTTT
TCCTCTAGAAGCTCCCAACAAATCAATTCAGACATGCTCAAGTTATGCCTTGAAACCTTATTTTGAGTGTAATTTACATGTACTGCATAAAATAAAGCTAAACCAGGTTT
GGTTTTAGCTGACAATGGCGGGAAAGAGGGATTCTTCACAGTAAGGATTACAAGTTCAGAGACATTGAAATCCTCCACTGCGGTCCAATAAGTTCTACTACAAATTCAAT
ATGACTAACTATGTTTATGCTATCGAGGTTAACTTCTTTATATGCCATATGAGGATTTTACTGACATGGTTAAGTTAGGAGCATGACTTTTATACCACTTGAAAGAGATA
ACATAACCCCTACATATCTTATAATCTTGGTCGTACAAAAAATATCAGGGTTCTATCACTTGGAAAGAGATGTCCATGAGATTAGAAGTCCAGTTGTGGTTTACACAACA
CTTCTACCATCCTCTTCAACAAAGAGTCCACTCAACTTGAGCAATGGATACCAGATTTGATGTTTGAAAAAAAAAAGCATTTTCGGCAGAAACTAAAGGTAGAAAATATC
TCTCTGATACTGTTTTATCTCTTAAGAAAATAACAAAGAATTATGATGCAGATGAAGATGGTTTTTTGTTCGAGGCACTATTGTTTAGTGAGCT
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFKKTDPSSNWPAIKPKQNLQVNRLKENDLFTVPSFFTCVESKAFIKAAELLGFLHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFTDIKIRGKVAVGLNPNIRLYRYKVGQCFGRHIDESVDLGGGKRTYYTLLIYLSGGSKNKTKNDTNNSKDPFSEPLVGGETVFYGSRNGVIAEVTPTEGMALLHLHGDK
CLLHEARNVTKGVKYVFRSDVIFS