; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G007720 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G007720
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCG_Chr05:8301325..8303869
RNA-Seq ExpressionClCG05G007720
SyntenyClCG05G007720
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]4.8e-12392.74Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]8.5e-12893.03Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]8.5e-12892.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]1.7e-12088.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.3e-13195.1Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDFKKTDPSSKWPAIKPKQNLQI+RLKDNDLFTVPSFF+CVESK FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS
        VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein4.1e-12893.03Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840334.1e-12892.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 12.3e-12392.74Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1FZS7 uncharacterized protein LOC1114494138.3e-12188.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676971.1e-12088.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFF  VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.4e-9072.4Show/hide
Query:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQHFGRHIDES DL +G RTYYTLL+YLSG S K+K+K+ ++   D  S EPLVGGETVFYGSRNS+VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCAGGGCTTGATAAC
CTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCACTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGATACCAACAATTTCAAAGATCCTTATT
CTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTTCGTTCAGATGTCATATTCTCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCAGGGCTTGATAAC
CTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCACTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGATACCAACAATTTCAAAGATCCTTATT
CTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTTCGTTCAGATGTCATATTCTCTTCATGATTAGTTCAGGTTGTCGTCCTCCACAAAAAATA
AATGAAATTGAATTATCTACAAGTGATCAAGAATGTGCTATTGATGATTTGGTTGTTTGTAGGCCTCTTGATGTTGTGACAACTGCAAAACGATCACTCAGGTTGGTTCC
TAGAATAAGCTATAGAGTTCAGTGTATTATTTGTTGAAATGAAGCAGTTCATAATCTTGTAAGTCGTATTTACCAAGTGGCTGGTTGAATTATATGACATTCTACTGCAT
ATGATGCCATTCTGATATATTTACCCTCCTCACTGTTCTACCTAAGGAATTTATTTTCTAAATTTGATGTTTCATTTCTTATGTTAGGCAGGGCTTGTTACTCAAAGTTG
CCATCTTTTTGTGGCCTTTAAATATATTGGTAAACTATGCCATTTGTTA
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDK
CLLHEARNVTKGVKYVFRSDVIFSS