; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G012760 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G012760
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionFe2OG dioxygenase domain-containing protein
Genome locationchr05:20488411..20491199
RNA-Seq ExpressionLsi05G012760
SyntenyLsi05G012760
Gene Ontology termsGO:0018401 - peptidyl-proline hydroxylation to 4-hydroxy-L-proline (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0004656 - procollagen-proline 4-dioxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0031418 - L-ascorbic acid binding (molecular function)
InterPro domainsIPR006620 - Prolyl 4-hydroxylase, alpha subunit
IPR045054 - Prolyl 4-hydroxylase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033599.1 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 1 [Cucumis melo var. makuwa]4.5e-12190.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_004150303.1 uncharacterized protein LOC101210552 [Cucumis sativus]2.3e-12590.57Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LK+NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSS+ LVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_008439151.1 PREDICTED: uncharacterized protein LOC103484033 [Cucumis melo]1.0e-12590.57Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

XP_022140818.1 uncharacterized protein LOC111011396 [Momordica charantia]9.4e-11986.42Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD KKT P++ WP IKPKQNLQI RLKENDLFTVPSF T VES+ FI  AES+GFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GK TYYTLL+YLSGGSK K KNDTND K PSSEPLVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHG+KCLLHEARNVTKG+KY+FRSDV F
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIF

XP_038874454.1 uncharacterized protein LOC120067107 [Benincasa hispida]7.7e-12992.24Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRR EGDFKKTDPSSKWP IKPKQNLQI+RLK+NDLFTVPSFF+CVES+ FIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        DIIWRSGLDNLF+DIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFSS
        +VAEV+PTEGMALLHLHG+KCLLHEARNVT+GVKYVFRSDVIFSS
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9L9 Fe2OG dioxygenase domain-containing protein1.1e-12590.57Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAET GKKRKMAGRRGEGDFKKTDPSS WP IKPKQNLQ+  LK+NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSS+ LVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A1S3AY48 uncharacterized protein LOC1034840335.0e-12690.57Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        DIIWRSGL+NLFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        ++AEVAPTEGMALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A5A7SWR3 2-oxoglutarate and Fe(II)-dependent oxygenase superfamily protein isoform 12.2e-12190.6Show/hide
Query:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
        MAGRRGEGDF KTDPSS WP IKPKQNLQ+  LK NDLFTVPSFFTCVES+AFIKAAESLGF+HQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
Subjt:  MAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN

Query:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG
        LFADIKIRGK+AVGLNPNIRLYRYKVGQRFGRHIDESVDLG GK TYYTLL+YLSGGSKNK KNDTN+ KDPSSEPLVGGET+FYGSRN ++AEVAPTEG
Subjt:  LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEG

Query:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
        MALLHLHG+KCLLHEARNV KGVKYVFRSDVIFS
Subjt:  MALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

A0A6J1CIV9 uncharacterized protein LOC1110113964.6e-11986.42Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA
        MAETQGKKRKMAGRRGEGD KKT P++ WP IKPKQNLQI RLKENDLFTVPSF T VES+ FI  AES+GFVHQGSLGPTKGEA+RDNDRISVNDPDLA
Subjt:  MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLA

Query:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS
        D IWRSGLD LFADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLG GK TYYTLL+YLSGGSK K KNDTND K PSSEPLVGGET+FYGSRN 
Subjt:  DIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNS

Query:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIF
        +VAEVAPTEGMALLHLHG+KCLLHEARNVTKG+KY+FRSDV F
Subjt:  IVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIF

A0A6J1FZS7 uncharacterized protein LOC1114494131.0e-11886.69Show/hide
Query:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND
        MAETQG+KRKMAGRR EGD KKT     PSS+WP IKPKQ+LQI RLKENDLFTVPSFFT VES+AFIK AES+GFVHQGSLGP+KGEAYRDNDRISVND
Subjt:  MAETQGKKRKMAGRRGEGDFKKT----DPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVND

Query:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYG
        PDLAD IWRSGLD  FADIKIRGKVAVGLNPNIR YRY VGQRFGRHIDESVDLGEGK TYYTLL+YLSGGSK K KNDTN+ KDPSSEPLVGGET+FYG
Subjt:  PDLADIIWRSGLDNLFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYG

Query:  SRNSIVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS
         RN +VAEVAPTEGMALLHLHG+KCLLHEARNVTKGVKYVFRSDVIFS
Subjt:  SRNSIVAEVAPTEGMALLHLHGNKCLLHEARNVTKGVKYVFRSDVIFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G51880.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein9.2e-8871.04Show/hide
Query:  SSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL
        S KWP IK K NL ++ LK +DLFTV +  T  ES+AF+K AESLGF HQGS GP  GEAYRDN RISVNDP LAD +W+SGL NLF DIKIR KVAVGL
Subjt:  SSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDNLFADIKIRGKVAVGL

Query:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGS-KNKMKNDTNDFKDPSS-EPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNKCLL
        NPNIR YRY  GQ FGRHIDES DL +G  TYYTLL+YLSG S K+K K+ ++   D SS EPLVGGET+FYGSRNSIVAEVAP EGMAL H+HG+KC+L
Subjt:  NPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGS-KNKMKNDTNDFKDPSS-EPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNKCLL

Query:  HEARNVTKGVKYVFRSDVIFS
        HE RNV+KGVKYVFRSDV+F+
Subjt:  HEARNVTKGVKYVFRSDVIFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACGCAGGGGAAGAAGAGAAAAATGGCGGGCAGAAGAGGGGAGGGGGATTTCAAGAAAACAGACCCCTCTTCGAAGTGGCCAACCATTAAACCCAAGCAGAA
TCTTCAAATCACTCGCCTGAAAGAAAATGATCTTTTCACTGTGCCAAGTTTTTTCACTTGTGTTGAGTCAAGAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTC
ATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTACAGAGATAATGATCGAATCTCGGTGAATGATCCTGATTTAGCAGACATCATTTGGCGTTCTGGGCTTGATAAC
CTATTTGCTGATATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAGATACAAGGTTGGTCAGCGCTTTGGACGTCATATTGATGAAAG
TGTTGATCTTGGAGAAGGCAAGTGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGAACAAAATGAAAAATGATACCAACGACTTCAAAGATCCTTCTT
CTGAGCCTCTGGTTGGAGGGGAAACTATTTTCTATGGTTCAAGGAATAGTATTGTTGCTGAGGTGGCTCCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGAACAAG
TGTTTGTTGCATGAAGCTCGCAACGTTACAAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATTCTCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATTAAACTTTCATCGAGAAATAGGGGCACTGGTTGATTTCTGATGGCGGAAACGCAGGGGAAGAAGAGAAAAATGGCGGGCAGAAGAGGGGAGGGGGATTTCAAGAAAAC
AGACCCCTCTTCGAAGTGGCCAACCATTAAACCCAAGCAGAATCTTCAAATCACTCGCCTGAAAGAAAATGATCTTTTCACTGTGCCAAGTTTTTTCACTTGTGTTGAGT
CAAGAGCATTCATCAAGGCTGCAGAGTCATTGGGTTTTGTTCATCAGGGGAGCCTTGGTCCTACTAAAGGAGAAGCTTACAGAGATAATGATCGAATCTCGGTGAATGAT
CCTGATTTAGCAGACATCATTTGGCGTTCTGGGCTTGATAACCTATTTGCTGATATTAAAATTCGGGGAAAAGTTGCTGTTGGGTTGAATCCAAATATCAGATTATACAG
ATACAAGGTTGGTCAGCGCTTTGGACGTCATATTGATGAAAGTGTTGATCTTGGAGAAGGCAAGTGCACATATTATACTTTGTTAGTATACTTAAGCGGAGGTTCCAAGA
ACAAAATGAAAAATGATACCAACGACTTCAAAGATCCTTCTTCTGAGCCTCTGGTTGGAGGGGAAACTATTTTCTATGGTTCAAGGAATAGTATTGTTGCTGAGGTGGCT
CCTACTGAAGGGATGGCTCTCCTGCACCTTCATGGGAACAAGTGTTTGTTGCATGAAGCTCGCAACGTTACAAAGGGAGTCAAATATGTTTTCCGTTCAGATGTCATATT
CTCTTCATGATTAGTTCAGGTTGTCGGTGTCCTTGATGAAAAATAAATAAAATTGAATTATCTACACGTGATCAAGAATGTGCTATTGATGATTTGGTTGTTTATAGGCC
TCCTGATGTTGTGACAACTGCAAAATGATCACTCAGGTTGGTTCCTGAAATAAGCTATAGAATTCAGTGTACCATTTGTTGAAATGAAGCAGTTCATGATCTTGTAAGTT
GTATTACTAAGTGACTGGTTGAATTATATGACATTTATGCCACTCTGATATATAATATATTTACCCTCATCACTGTTCTGCCTAAGAAATTTACTTTCCTAAATTTGATG
TTTCATTTCTTATGTTAGGCTTGTTATTCAAAGTTGCCATCTTTTTGTGGCCTTTAAATATATTGGCAATCAATGTCATCTGTTATTACGCTTCTATTAGAATCACTTCT
GTTTTG
Protein sequenceShow/hide protein sequence
MAETQGKKRKMAGRRGEGDFKKTDPSSKWPTIKPKQNLQITRLKENDLFTVPSFFTCVESRAFIKAAESLGFVHQGSLGPTKGEAYRDNDRISVNDPDLADIIWRSGLDN
LFADIKIRGKVAVGLNPNIRLYRYKVGQRFGRHIDESVDLGEGKCTYYTLLVYLSGGSKNKMKNDTNDFKDPSSEPLVGGETIFYGSRNSIVAEVAPTEGMALLHLHGNK
CLLHEARNVTKGVKYVFRSDVIFSS