; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC05G089140 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC05G089140
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCmU531Chr05:7936937..7938857
RNA-Seq ExpressionCmUC05G089140
SyntenyCmUC05G089140
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]4.8e-12392.74Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]8.5e-12893.03Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]8.5e-12892.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]1.7e-12088.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.3e-13195.1Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDFKKTDPSSKWPAIKPKQNLQI+RLKDNDLFTVPSFF+CVESK FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS
        VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein4.1e-12893.03Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840334.1e-12892.62Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNS

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 12.3e-12392.74Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQ FGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1FZS7 uncharacterized protein LOC1114494138.3e-12188.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676971.1e-12088.71Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFF  VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRYKVGQ FGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYG

Query:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.4e-9072.4Show/hide
Query:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL
        NPNIR YRY  GQHFGRHIDES DL +G RTYYTLL+YLSG S K+K+K+ ++   D  S EPLVGGETVFYGSRNS+VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPYS-EPLVGGETVFYGSRNSVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAG
CAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTG
GGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGT
TCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCACTTT
GGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGAT
ACCAACAATTTCAAAGATCCTTATTCTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGGCTCCTACTGAAGGG
ATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTTCGTTCAGATGTCATATTCTCTTCA
TGA
mRNA sequenceShow/hide mRNA sequence
ATCGTCGAGAAATGGAGGCGCTGGTTGATCTCTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTTCAAGAAAACAGA
CCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGA
GTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGT
GAATGATCCTGATTTAGCAGACATCATTTGGCGTTCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATAT
CAGATTATACAGATACAAGGTTGGTCAGCACTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTT
AAGCGGAGGTTCCAAGAACAAAACGAAAAATGATACCAACAATTTCAAAGATCCTTATTCTGAACCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAA
TAGTGTTGTTGCCGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAA
ATATGTTTTTCGTTCAGATGTCATATTCTCTTCATGA
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWR
SGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQHFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPYSEPLVGGETVFYGSRNSVVAEVAPTEG
MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS