; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002671 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002671
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationscaffold6:1842050..1844772
RNA-Seq ExpressionSpg002671
SyntenySpg002671
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]8.8e-12591.36Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDSKKT P++ WPAIKPKQNL I RLK+NDLFTVP+F T+VESK FI  AESMGFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDKLF DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]3.3e-12491.94Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLK+NDLFTVP+FFT+VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022968459.1 uncharacterized protein LOC111467697 [Cucurbita maxima]8.2e-12391.13Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLK+NDLFTVP+FF +VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]1.3e-12391.53Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE QG+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLK+NDLFTVP+FFT+VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]4.4e-12491.8Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGD KKT PSSKWPAIKPKQNL I+RLKDNDLFTVP+FF+ VESK FIK AES+GFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLD LF+DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        VVAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein2.6e-12290.16Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGD KKT PSS WP IKPKQNL +  LKDNDLFTVP+FFT VESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGLD LF DIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSS+ LVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840332.6e-12289.75Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD  KT PSS WP IKPKQNL +  LK NDLFTVP+FFT VESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        D IWRSGL+ LF DIKIRGK+AVGLNPNIR YRY VGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        V+AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113964.2e-12591.36Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDSKKT P++ WPAIKPKQNL I RLK+NDLFTVP+F T+VESK FI  AESMGFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG
        DTIWRSGLDKLF DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFYGSRNG
Subjt:  DTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNG

Query:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        VVAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494131.6e-12491.94Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLK+NDLFTVP+FFT+VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676974.0e-12391.13Show/hide
Query:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGDSKKTA    PSS+WPAIKPKQ+L I RLK+NDLFTVP+FF +VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDSKKTA----PSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG
        PDLADTIWRSGLDK F DIKIRGKVAVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFYG
Subjt:  PDLADTIWRSGLDKLFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYG

Query:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN VVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGVVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.4e-9073.3Show/hide
Query:  SSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFTDIKIRGKVAVGL
        S KWP IK K NL ++ LK++DLFTV    T+ ESKAF+K AES+GF HQGS GP  GEAYRDN RISVNDP LADT+W+SGL  LFTDIKIR KVAVGL
Subjt:  SSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFTDIKIRGKVAVGL

Query:  NPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLL
        NPNIRFYRY+ GQ FGRHIDES DL  G RTYYTLLIYLSG S KSK+K+ ++ + D SS EPLVGGETVFYGSRN +VAEVAP EGMAL H+HGDKC+L
Subjt:  NPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCGAAGAAAACAGCCCCATCTTCAAAGTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCTGATCACCCGCCTCAAAGATAATGATCTTTTCACCGTGCCAACTTTTTTCACAAATGTTGAGTCAAAAGCATTCATCAAGACTGCAGAGTCGATGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCTACAAAAGGAGAAGCTTATAGAGATAATGATCGAATTTCGGTGAATGATCCTGATTTAGCAGACACCATTTGGCGTTCGGGACTTGATAAA
CTGTTTACTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCGAATATCAGATTTTACAGATACAACGTTGGTCAGCGCTTTGGACGCCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTATTAATATATTTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGATACCAATAATTCCAAAGATCCTTCTT
CTGAGCCTCTAGTTGGAGGGGAGACTGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCATCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACGTTTTCCGTTCAGACGTCATATTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAACACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGGGATTCGAAGAAAACAGCCCCATCTTCAAAGTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCTGATCACCCGCCTCAAAGATAATGATCTTTTCACCGTGCCAACTTTTTTCACAAATGTTGAGTCAAAAGCATTCATCAAGACTGCAGAGTCGATGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCTACAAAAGGAGAAGCTTATAGAGATAATGATCGAATTTCGGTGAATGATCCTGATTTAGCAGACACCATTTGGCGTTCGGGACTTGATAAA
CTGTTTACTGATATTAAAATACGGGGAAAAGTTGCTGTTGGGTTGAATCCGAATATCAGATTTTACAGATACAACGTTGGTCAGCGCTTTGGACGCCATATTGATGAAAG
TGTTGATCTTGGAGGAGGCAAGCGCACATATTATACTTTATTAATATATTTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGATACCAATAATTCCAAAGATCCTTCTT
CTGAGCCTCTAGTTGGAGGGGAGACTGTTTTCTATGGTTCAAGGAATGGCGTTGTGGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCATCTTCATGGGGACAAG
TGTTTGTTGCATGAAGCTCGCAATGTTACGAAGGGTGTCAAATACGTTTTCCGTTCAGACGTCATATTTTCTTGA
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDSKKTAPSSKWPAIKPKQNLLITRLKDNDLFTVPTFFTNVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDK
LFTDIKIRGKVAVGLNPNIRFYRYNVGQRFGRHIDESVDLGGGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYGSRNGVVAEVAPTEGMALLHLHGDK
CLLHEARNVTKGVKYVFRSDVIFS