; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G09750 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G09750
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationClcChr05:7884301..7889116
RNA-Seq ExpressionClc05G09750
SyntenyClc05G09750
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]5.3e-12292.31Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]1.2e-12692.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]1.2e-12692.21Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]2.5e-11988.31Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
         DLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.8e-13094.69Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA
        MAETQGKKRKMAGRR EGDFKKTDPSSKWPAIKPKQNLQI+RLKDNDLFTVPSFF+CVESK FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND DLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS
        VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein5.9e-12792.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840335.9e-12792.21Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 12.6e-12292.31Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVND DLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1FZS7 uncharacterized protein LOC1114494131.2e-11988.31Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
         DLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676971.6e-11988.31Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFF  VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
         DLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  SDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.9e-8971.95Show/hide
Query:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVND  LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQHFGRHIDES DL +G RTYYTLL+YLSG S K+K+K+ ++   D  S EPLVGGETVFYGSRNS+VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAG
CAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTG
GGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATTCTGATTTAGCAGACATCATTTGGCGT
TCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCACTTT
GGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGAT
ACCAACAATTTCAAAGATCCTTATTCTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGGCTCCTACTGAAGGG
ATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTTCGTTCAGATGTCATATTCTCTTCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATCCATAAAGATAAAAATAAAATAAAATAATAAAAAAAAGGGCATTTTTATTCATTTGATGCTATTACTGCTATAATCTCGAAGGCCCAATTAAGGCAGCCGACC
AAACTCTCGTAAGCCCACTTATTCGTCCACGAAATTGGGAAAATTTCAATGGATGTCCTTAATTTGGGCCGAACATCGTCGAGAAATGGAGGCGCTGGTTGATCT
CTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCC
AAGCAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCA
TTGGGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATTCTGATTTAGCAGACATCATTTGG
CGTTCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCAC
TTTGGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAAT
GATACCAACAATTTCAAAGATCCTTATTCTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGGCTCCTACTGAA
GGGATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTTCGTTCAGATGTCATATTCTCT
TCATGATTAGTTCAGGTTGTCGTCCTCCACAAAAAATAAATGAAATTGAATTATCTACAAGTGATCAAGAATGTGCTATTGATGATTTGGTTGTTTGTAGGCCTC
TTGATGTTGTGACAACTGCAAAACGATCACTCAGAGCATATCGAGGATTATCACTTGGATAGATGGTGTCCATGAGATGAGAAGCCCAGTTGTGGTTTACACAAC
ACTTCGACCTGTCTTCTTCAACAACTATTGGGAGAGTCTACTCAGTCAGAGCAATCAATATTAGAGGACTATTTGATGTTTGAACACAAAGGCATCTTTGGTAGA
AACCAAGGGTAGAAAATATGTTTACCTGATAAAAAGAAGTATGATATAAAAGAAGATGGTTATTGTTAGAGGTGCTATTGTTTAGTGAACTTGTTCTTAAATTTT
TCTGTTACTTTGATTTATAATATTTGACACACTGTATATAAAAACTTTCTTAATGGAAGAGACATCTTTCTCACTTTATTTCAAACAGTAATGATTCAAC
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDSDLADIIWR
SGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS