; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016034 (gene) of Snake gourd v1 genome

Gene IDTan0016034
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationLG01:2629400..2635528
RNA-Seq ExpressionTan0016034
SyntenyTan0016034
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]2.2e-12390.53Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAE+QGKKRKMAGRRGEGDSKKTTP + WPAIKPKQNLQI+RLKENDLFTVPSF T VESK FI  AESMGFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG
        DTIWRSGLDKLFADIK+RGK AVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFY SRNG
Subjt:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

XP_022945055.1 uncharacterized protein LOC111449413 [Cucurbita moschata]1.8e-12290.73Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE+QG+KRKMAGRR EGDSKKT     P S+WPAIKPKQ+LQI RLKENDLFTVPSFFT VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA
        PDLADTIWRSGLDK FADIK+RGK AVGLNPNIRFYRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY 
Subjt:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_022968459.1 uncharacterized protein LOC111467697 [Cucurbita maxima]2.4e-12290.73Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE+QG+KRKMAGRR EGDSKKT     P S+WPAIKPKQ+LQI RLKENDLFTVPSFF  VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA
        PDLADTIWRSGLDK FADIK+RGK AVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY 
Subjt:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_023541473.1 uncharacterized protein LOC111801646 [Cucurbita pepo subsp. pepo]1.8e-12290.73Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE+QG+KRKMAGRR EGDSKKT     P S+WPAIKPKQ+LQI RLKENDLFTVPSFFT VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA
        PDLADTIWRSGLDK FADIK+RGK AVGLNPNIRFYRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY 
Subjt:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]1.5e-12490.98Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAE+QGKKRKMAGRR EGD KKT P SKWPAIKPKQNLQI RLK+NDLFTVPSFF+CVESK FIK AES+GFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG
        D IWRSGLD LF+DIK+RGK AVGLNPNIR YRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY SRNG
Subjt:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        +VAEV+PTEGMALLHLHGDKCLLHEARNVT+GVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.7e-12188.93Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAE+ GKKRKMAGRRGEGD KKT P S WP IKPKQNLQ+  LK+NDLFTVPSFFTCVESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG
        D IWRSGLD LFADIK+RGK AVGLNPNIR YRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSS+ LVGGETVFY SRNG
Subjt:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840334.4e-12289.34Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAE+QGKKRKMAGRRGEGD  KT P S WP IKPKQNLQ+  LK NDLFTVPSFFTCVESKAFIK AES+GF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG
        D IWRSGL+ LFADIK+RGK AVGLNPNIR YRYKVGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY SRNG
Subjt:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHGDKCLLHEARNV KGVKYVFRSDVIFS
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113961.0e-12390.53Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAE+QGKKRKMAGRRGEGDSKKTTP + WPAIKPKQNLQI+RLKENDLFTVPSF T VESK FI  AESMGFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG
        DTIWRSGLDKLFADIK+RGK AVGLNPNIRFYRY VGQRFGRHIDESVDLG GKRTYYTLLIYLSGGSK KTKNDTN+SK PSSEPLVGGETVFY SRNG
Subjt:  DTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNG

Query:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHGDKCLLHEARNVTKG+KY+FRSDV F
Subjt:  LVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494138.8e-12390.73Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE+QG+KRKMAGRR EGDSKKT     P S+WPAIKPKQ+LQI RLKENDLFTVPSFFT VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA
        PDLADTIWRSGLDK FADIK+RGK AVGLNPNIRFYRY VGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY 
Subjt:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1HX87 uncharacterized protein LOC1114676971.2e-12290.73Show/hide
Query:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND
        MAE+QG+KRKMAGRR EGDSKKT     P S+WPAIKPKQ+LQI RLKENDLFTVPSFF  VESKAFIKTAESMGFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAESQGKKRKMAGRRGEGDSKKTT----PFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA
        PDLADTIWRSGLDK FADIK+RGK AVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSK+KTKNDTNNSKDPSSEPLVGGETVFY 
Subjt:  PDLADTIWRSGLDKLFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYA

Query:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNGLVAEVAPTEGMALLHLHGDKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.0e-8871.69Show/hide
Query:  KWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFADIKMRGKFAVGLNP
        KWP IK K NL +  LK +DLFTV +  T  ESKAF+K AES+GF HQGS GP  GEAYRDN RISVNDP LADT+W+SGL  LF DIK+R K AVGLNP
Subjt:  KWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDKLFADIKMRGKFAVGLNP

Query:  NIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYASRNGLVAEVAPTEGMALLHLHGDKCLLHE
        NIRFYRY  GQ FGRHIDES DL +G RTYYTLLIYLSG S KSK+K+ ++ + D SS EPLVGGETVFY SRN +VAEVAP EGMAL H+HGDKC+LHE
Subjt:  NIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGS-KSKTKNDTNNSKDPSS-EPLVGGETVFYASRNGLVAEVAPTEGMALLHLHGDKCLLHE

Query:  ARNVTKGVKYVFRSDVIFS
         RNV+KGVKYVFRSDV+F+
Subjt:  ARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAATCACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGGAGGGAGATTCGAAGAAAACAACCCCATTTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAA
TCTTCAGATCATCCGCCTGAAAGAAAATGATCTGTTCACCGTGCCTAGTTTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGACTGCAGAGTCGATGGGTTTTGTTC
ATCAGGGGAGCCTCGGTCCTACGAAAGGAGAAGCTTATAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCTGACACCATTTGGCGTTCGGGACTTGATAAA
CTATTTGCTGATATTAAAATGCGGGGAAAATTTGCTGTTGGGTTGAATCCAAATATCAGATTTTACAGATATAAGGTTGGTCAGCGCTTTGGACGCCATATTGATGAAAG
TGTTGATCTTGGAGAAGGCAAGCGCACATATTATACTTTGTTAATATATTTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGATACCAACAATTCCAAAGATCCTTCTT
CTGAGCCTCTAGTTGGAGGGGAAACTGTTTTCTATGCTTCTAGGAATGGCCTTGTGGCTGAGGTGGCGCCTACTGAAGGGATGGCTCTCCTGCATCTTCATGGGGATAAG
TGTTTGTTGCACGAAGCTCGCAACGTTACGAAAGGTGTAAAATATGTTTTCCGTTCAGATGTCATATTTTCTTGA
mRNA sequenceShow/hide mRNA sequence
GCACAATTTGGGCCGAACAAACAAAATTTGATCGAGAAATAGAGCGACCGGTTGATTTCTGATGGCGGAATCACAGGGGAAGAAGAGAAAAATGGCGGGTAGAAGAGGGG
AGGGAGATTCGAAGAAAACAACCCCATTTTCGAAGTGGCCAGCCATTAAACCCAAGCAGAATCTTCAGATCATCCGCCTGAAAGAAAATGATCTGTTCACCGTGCCTAGT
TTTTTCACATGTGTTGAGTCAAAAGCATTCATCAAGACTGCAGAGTCGATGGGTTTTGTTCATCAGGGGAGCCTCGGTCCTACGAAAGGAGAAGCTTATAGAGATAATGA
TCGAATCTCGGTGAATGATCCTGATTTAGCTGACACCATTTGGCGTTCGGGACTTGATAAACTATTTGCTGATATTAAAATGCGGGGAAAATTTGCTGTTGGGTTGAATC
CAAATATCAGATTTTACAGATATAAGGTTGGTCAGCGCTTTGGACGCCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGCGCACATATTATACTTTGTTAATATAT
TTAAGCGGAGGTTCCAAAAGCAAAACAAAAAATGATACCAACAATTCCAAAGATCCTTCTTCTGAGCCTCTAGTTGGAGGGGAAACTGTTTTCTATGCTTCTAGGAATGG
CCTTGTGGCTGAGGTGGCGCCTACTGAAGGGATGGCTCTCCTGCATCTTCATGGGGATAAGTGTTTGTTGCACGAAGCTCGCAACGTTACGAAAGGTGTAAAATATGTTT
TCCGTTCAGATGTCATATTTTCTTGATGATAAGTTCAGGCGATGCTCTTGTCTCGTGGATATCCGAAAAGCAAACTACAGTCTCACGGTCATCTGCGGAGGTTGAATATC
GTGCTCCTGTGGTCACACAAGTGAGATTTTATGGATTACTTGTTGTTTGACTGTCCCTCTTCGTCTCTGCTTTTCTGTGATAACATTGCAGCTATTCATATTGCTTCCAA
GCCCATGTTTCACAAGCGGACAAAATACATTGAATTAGACTGCCATTTTGTTAGAGACAAGGTAATCGTCGGACAGATAAAGCTCTTGCCAGTCAGGTACCGACTCCACT
TGTTGATATTTTTACCAAAGCTCTTCCCTTACTGTCTTTTTCAGACTTACTGTCCAAGATGGGCTGTTTTAAACATTTTAGCTCCATCTTGAGGGGGAGTATTAGGGGTA
ATAAGTTTTGTAATAGTTGATTAGTTGGAGGTTAGTTGTTCGTTTAGTTGGCTGGATGTTATAAGTTGTTGGACAGGTGTCTATTTATACTGTTTTGATATAATGAATAA
GATTCAAGCAG
Protein sequenceShow/hide protein sequence
MAESQGKKRKMAGRRGEGDSKKTTPFSKWPAIKPKQNLQIIRLKENDLFTVPSFFTCVESKAFIKTAESMGFVHQGSLGPTKGEAYRDNDRISVNDPDLADTIWRSGLDK
LFADIKMRGKFAVGLNPNIRFYRYKVGQRFGRHIDESVDLGEGKRTYYTLLIYLSGGSKSKTKNDTNNSKDPSSEPLVGGETVFYASRNGLVAEVAPTEGMALLHLHGDK
CLLHEARNVTKGVKYVFRSDVIFS