; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G090440 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G090440
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationCicolChr05:8146184..8150896
RNA-Seq ExpressionCcUC05G090440
SyntenyCcUC05G090440
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]1.8e-12292.31Show/hide
Query:  MAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGE DF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNSVVAEVPPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEV PTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNSVVAEVPPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]4.2e-12792.62Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGE DFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS

Query:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]4.2e-12792.21Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGE DF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS

Query:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]8.5e-12088.31Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR E D KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG

Query:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEV PTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]2.8e-13195.1Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR E DFKKTDPSSKWPAIKPKQNLQI+RLKDNDLFTVPSFF+CVESK FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS

Query:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS
        VVAEV PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein2.0e-12792.62Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGE DFKKTDPSS WP IKPKQNLQ+  LKDNDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP S+ LVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS

Query:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840332.0e-12792.21Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGE DF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNS

Query:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEV PTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 18.9e-12392.31Show/hide
Query:  MAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGE DF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNSVVAEVPPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GKRTYYTLL+YLSGGSKNKTKNDTNN KDP SEPLVGGETVFYGSRN V+AEV PTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNSVVAEVPPTEG

Query:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1FZS7 uncharacterized protein LOC1114494134.1e-12088.31Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR E D KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFFT VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG

Query:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEV PTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676975.4e-12088.31Show/hide
Query:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR E D KKT     PSS+WPAIKPKQ+LQI RLK+NDLFTVPSFF  VESKAFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEEDFKKT----DPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRYKVGQRFGRHIDESVDLGEGKRTYYTLL+YLSGGSK KTKNDTNN KDP SEPLVGGETVFYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYG

Query:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEV PTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSVVAEVPPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.9e-8971.49Show/hide
Query:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK++DLFTV +  T  ESKAF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPCS-EPLVGGETVFYGSRNSVVAEVPPTEGMALLHLHGDKCLL
        NPNIR YRY  GQ FGRHIDES DL +G RTYYTLL+YLSG S K+K+K+ ++   D  S EPLVGGETVFYGSRNS+VAEV P EGMAL H+HGDKC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGS-KNKTKNDTNNFKDPCS-EPLVGGETVFYGSRNSVVAEVPPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGAGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAG
CAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTTCACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTG
GGTTTCGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGAGGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGT
TCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCGCTTT
GGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAAGCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGAT
ACCAACAATTTCAAAGATCCTTGTTCTGAGCCTCTGGTTGGAGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGCCTCCTACTGAAGGG
ATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCATGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCTTCA
TGA
mRNA sequenceShow/hide mRNA sequence
TTTTTATTCATTTGATGCTATTACTTCTATAATCTCGAAGGCCCAATTATGGCAGCCGACTAAACTCTCGTAAGCCCACTTATTCGTCCACGAAATTGGGAAAAT
TTCAATGGATGTCCTTAATTTGGGCCGAACATCGTCGAGAAATGGAGGCACTGGTTGATCTCTGATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAG
AAGAGGGGAGGAGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAATCTTCAAATCACCCGCCTGAAAGATAATGATCTTTT
CACCGTGCCAAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTCGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGGGA
GGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCAGGGCTTGATAACCTATTTGCTGATATTAAAATCCGTGG
AAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGGGAAGGCAA
GCGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAACGAAAAATGATACCAACAATTTCAAAGATCCTTGTTCTGAGCCTCTGGTTGG
AGGGGAAACTGTTTTCTATGGTTCAAGGAATAGTGTTGTTGCCGAGGTGCCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGGACAAGTGTTTGTTGCA
TGAAGCTCGCAATGTTACAAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCTTCATGATTAGTTCAGGTTGTCGTCCTCCACAAAAAATAAATGAA
ATTGAATTATCTACAAGTGATCAAGAATGTGCTATTGATGATTTGGTTGTTTATAGGCCTCTTAATGTTGTGACAACTGCAAAACGATCACTCAGGTTGGTTCCT
GGAATAAGCTATAGAGTTCAGTGTATTATTTGTTGAAATGAAGCAGTTCATAATCTTTTATAGTTGGAGCTATCAAATTTGACATTGGCGTAGCAACAACAGGAG
TCCCAAGTATAATGTTTTTGTCTGGAAATTCAACGAAGATGACATTTTAGTTCTGATTAGTTAAAACCTGCAAAAGCAGTTGATCCCCGGTTGATTCAACATATC
AGAATCAGATTACTTCAACTTTCTTTGCTAACAAGGCTACAACGGTACAACACTCCAGTTACTGATGGGAGTTCGCAATAGTTCTACACATGGAAGAGGATGGAC
CTCACCAATTTCAGCTTACCATGTTGAAGTTTTATAAACCTGAAGCTATTAAGGGTCAAAGTTGAGTCAACCGTTCTTTCATTCTCCACAATGCTGGGACGAGAG
ATACTTCACAGCAAGGATTACAAGCCCAAAGACATCGAAATCCTCCACTGCTGGTCCGATAAGCTCAACCACAAATTTACTATGACTAACTATGATTATGTTTCC
ATAACTTCTTTATTTGACGTTGGGGGACTTTACTGACACTGTTAAGTTAGGAACATGACTTTGATACAATTTGAAAGGAATAACATAACTCCTATATATCTTATG
ATGTTGATCGCATTTTCAACAAATATCTTTTCAAGTGCAGAGCATATCAAGGATTATCAGTTGGATAGATGGTGTCCATGAGATGAGAAGCCCAGTTGTGGTTTA
CACAACACTTCGACCTGTCTTCTTCAACAACTATTGGGAGAGTCTACTCACTCAGAGCAATCAATACTAGAGGACTATTTGATGTTTGAACACAAAGGCATTTTT
GGTAGAAACCAAGGGTAGAAAATATGTTTACCTGATAAAAAGAAGTATGATATAAAAGAAGATGGTTATTGTTAGAGGTGGTATTGTTTAGTGAACTTGTTCTTA
AATTCT
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEEDFKKTDPSSKWPAIKPKQNLQITRLKDNDLFTVPSFFTCVESKAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWR
SGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKRTYYTLLVYLSGGSKNKTKNDTNNFKDPCSEPLVGGETVFYGSRNSVVAEVPPTEG
MALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFSS