; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018134 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018134
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHydroxypyruvate reductase
Genome locationscaffold9:28025879..28043584
RNA-Seq ExpressionSpg018134
SyntenySpg018134
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004650 - polygalacturonase activity (molecular function)
GO:0016616 - oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor (molecular function)
GO:0051287 - NAD binding (molecular function)
InterPro domainsIPR006139 - D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607638.1 hypothetical protein SDJN03_00980, partial [Cucurbita argyrosperma subsp. sororia]7.6e-6679.88Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

XP_022926176.1 uncharacterized protein LOC111433373 isoform X1 [Cucurbita moschata]7.6e-6679.88Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

XP_022926177.1 uncharacterized protein LOC111433373 isoform X2 [Cucurbita moschata]7.6e-6679.88Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

XP_022981502.1 uncharacterized protein LOC111480599 isoform X2 [Cucurbita maxima]6.4e-6579.29Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNY +CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

XP_023523324.1 uncharacterized protein LOC111787553 isoform X1 [Cucurbita pepo subsp. pepo]2.9e-6578.7Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKME+THEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+Q+DVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

TrEMBL top hitse value%identityAlignment
A0A6J1EDT1 uncharacterized protein LOC111433373 isoform X23.7e-6679.88Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

A0A6J1EE52 uncharacterized protein LOC111433373 isoform X13.7e-6679.88Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

A0A6J1EEE7 uncharacterized protein LOC111433373 isoform X37.6e-6488.19Show/hide
Query:  MERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGI
        MERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNYH+CIPKM+KFDFDLISRASQMKLIVQ+GVGLEGVDIDAATKFGI
Subjt:  MERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGI

Query:  KVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        KVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  KVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

A0A6J1IU52 uncharacterized protein LOC111480599 isoform X23.1e-6579.29Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNY +CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

A0A6J1IWQ5 uncharacterized protein LOC111480599 isoform X13.1e-6579.29Show/hide
Query:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS
        SF+D   L    +   CN    +WKMERTHEDS+KGLTRVLFCGSQFA SHNYT+EYLQKYPF+QVDVVP+ DVPKVITNY +CIPKM+KFDFDLISRAS
Subjt:  SFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRAS

Query:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
        QMKLIVQ+GVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYL LGLLRRQ  E Q AI   RL
Subjt:  QMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

SwissProt top hitse value%identityAlignment
O29445 D-3-phosphoglycerate dehydrogenase4.5e-0536.47Show/hide
Query:  EDVPKVITNYH-ICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR
        E++ + +  Y  I +    K D ++I  A  +K+I + GVG++ +DI+AAT+ GI V   P    GN +S AE AI L L   R+
Subjt:  EDVPKVITNYH-ICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR

O33116 D-3-phosphoglycerate dehydrogenase2.2e-0436.99Show/hide
Query:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAI
        D ++++ A ++K++ + GVGL+ VD+DAAT  G+ V   P   T N  S AE A+ L L    RQ+ E   ++
Subjt:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAI

P0A545 D-3-phosphoglycerate dehydrogenase6.5e-0439.06Show/hide
Query:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR
        D ++++ A ++K++ + GVGL+ VD+DAAT  G+ V   P   T N  S AE A+ L L   R+
Subjt:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR

P73821 D-3-phosphoglycerate dehydrogenase5.0e-0429.13Show/hide
Query:  VDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIA
        +D+VP  D         I +    K    +I   SQ+K+I + GVG++ +D+ AAT+ GI V   P    GN ++ AE A+ + +  L R + +  K++ 
Subjt:  VDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIA

Query:  NTR
         ++
Subjt:  NTR

P9WNX2 D-3-phosphoglycerate dehydrogenase6.5e-0439.06Show/hide
Query:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR
        D ++++ A ++K++ + GVGL+ VD+DAAT  G+ V   P   T N  S AE A+ L L   R+
Subjt:  DFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRR

Arabidopsis top hitse value%identityAlignment
AT1G72190.1 D-isomer specific 2-hydroxyacid dehydrogenase family protein6.0e-4561.9Show/hide
Query:  VWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATK
        V K+ER  E  +  +TRVLFCG  F  S+N+TREYLQ YPF++VDVV   DVP+VI NYHIC+   ++ D ++ISRAS +KLI+Q+GVGL+GVDIDAATK
Subjt:  VWKMERTHEDSNKGLTRVLFCGSQFAGSHNYTREYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATK

Query:  FGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL
         GIKVARIPS  TGNA SC+EMAIYL LGLL++Q  E Q ++ N  L
Subjt:  FGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAIANTRL

AT1G72190.1 D-isomer specific 2-hydroxyacid dehydrogenase family protein3.3e-0379.17Show/hide
Query:  EMQIAVDQRRLGEPTGDTLLGKTV
        EMQI++  R LGEPTGDTLLGKTV
Subjt:  EMQIAVDQRRLGEPTGDTLLGKTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCCAATGGGATTTCACAGCTGGAAAAGTTCACCTTCATCATCAGAGATTACATGGAGCTTGCAATTAGTTCCAATGTTCTCAATTCTGTGGTTAGAACAGCTCG
TAATTTTGATATCCTAGCATTTCTTATGAACCAAATGTCATTTCAAGGAGCATATAGTAGTTTTGAAGATTTTCCAGACTTGGATTTTTCTCTTCTAGTGCTGATCTGCA
ATGCATTTCTCGTAGTGTGGAAGATGGAAAGAACACATGAGGATAGCAACAAAGGTTTAACCCGTGTTCTCTTTTGTGGGTCTCAATTTGCTGGTTCTCATAATTATACC
AGAGAATACTTGCAAAAGTATCCATTCGTTCAGGTTGATGTCGTTCCATTGGAGGATGTACCTAAAGTTATAACCAACTACCATATCTGTATTCCTAAAATGCTAAAATT
TGATTTTGATTTAATCTCTCGAGCAAGCCAGATGAAGCTCATAGTGCAGTTCGGTGTTGGCCTTGAAGGTGTGGATATTGATGCTGCCACAAAGTTTGGAATAAAAGTTG
CCAGGATACCGAGTGGAGTGACTGGAAATGCAATGTCATGTGCAGAAATGGCTATATACCTAACTTTAGGCCTCCTTCGCAGACAGTTGGGAGAAGGGCAGAAGGCTATT
GCCAATACTAGATTGTGTTTCCACGATGGACGACTTGGATATTTTGGGGCTTTAATAACACCAGTTATGGAGGTTGTTTGTGCTTCTCGAGGACTTCGATTGGAGGATCC
TCTTTCACCCTTTATGTTCCTGCTTGTGGTGGTTGTTCTAAGCAGGATTATTTCGCGTGGGGTTGAGCTTGAGCACAAGAGCAAGTTCATCTTGGGGAGATTAGGAAAAA
TGACACAAAAAAGAATAGAAGAGCGTTTGGATGTGACAGACACTAAACTCGAAGGTATCAAGAAAGAGATGCAGAAGTTACCAGCGATAGAAAGAAATCTGGCGAAGCTT
TCACAGATGTTAGAAGAGACGATAAAGGCCCTGGCAGCGATTGCAAGTGAAATGGCCTACTGGCGGACAAAACCATCATCCTTCGGCATCACGGAAGGATCAAGATCTAA
GGGAAAAGAACAGGAAAACAATCCAGTAAGAGATAGTGAGACACTGAATGTGAATGTCCCCCCTTCGAAAACCCGCAAAAGTGTGAAGAAAGAGGCTATCGGAGGCACTG
GAAGCGAACGTCTCAAGTGCATACTGAAAGAGATGCAGATTGCAGTTGACCAAAGAAGGCTTGGAGAGCCAACTGGAGATACACTTCTAGGGAAAACAGTAACTCTTTGT
CTCTCTCCCTCTCTCACTCTGATAGACCCCCAGTTAAATATGTATATAGTAGGAAACGGATCTATCTTGTATATACAGTATGGGTGTAAGATGGGTGGGGTATCTTTTGA
GCTAGGAATTCGGGTCTCGTGTTCAATTAGTACACCTATGTATATATATAAGTTTGCCAACTCACTGATAGTGCACAAGTGGGAGTTTTGGGAGGTCTCTACTCCTACTG
ATGGTAGAAGGCACCCAGTCAGTGGAGCTAAGTCTCATCCCTCAAGGCACCACTGGGCTTGTAGCGCTCTGAACTACCATTTCACTAGTGGGGCATTCATGGGTTTAAGT
CAGAAATCAACTTCCACAGTTAGTAAATGGCACCTAGTCTGCGAAGCCAAGTCCCGCCCATTAGGCCCCTCATCTCAGTCTAATACATCTCAGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTCCCAATGGGATTTCACAGCTGGAAAAGTTCACCTTCATCATCAGAGATTACATGGAGCTTGCAATTAGTTCCAATGTTCTCAATTCTGTGGTTAGAACAGCTCG
TAATTTTGATATCCTAGCATTTCTTATGAACCAAATGTCATTTCAAGGAGCATATAGTAGTTTTGAAGATTTTCCAGACTTGGATTTTTCTCTTCTAGTGCTGATCTGCA
ATGCATTTCTCGTAGTGTGGAAGATGGAAAGAACACATGAGGATAGCAACAAAGGTTTAACCCGTGTTCTCTTTTGTGGGTCTCAATTTGCTGGTTCTCATAATTATACC
AGAGAATACTTGCAAAAGTATCCATTCGTTCAGGTTGATGTCGTTCCATTGGAGGATGTACCTAAAGTTATAACCAACTACCATATCTGTATTCCTAAAATGCTAAAATT
TGATTTTGATTTAATCTCTCGAGCAAGCCAGATGAAGCTCATAGTGCAGTTCGGTGTTGGCCTTGAAGGTGTGGATATTGATGCTGCCACAAAGTTTGGAATAAAAGTTG
CCAGGATACCGAGTGGAGTGACTGGAAATGCAATGTCATGTGCAGAAATGGCTATATACCTAACTTTAGGCCTCCTTCGCAGACAGTTGGGAGAAGGGCAGAAGGCTATT
GCCAATACTAGATTGTGTTTCCACGATGGACGACTTGGATATTTTGGGGCTTTAATAACACCAGTTATGGAGGTTGTTTGTGCTTCTCGAGGACTTCGATTGGAGGATCC
TCTTTCACCCTTTATGTTCCTGCTTGTGGTGGTTGTTCTAAGCAGGATTATTTCGCGTGGGGTTGAGCTTGAGCACAAGAGCAAGTTCATCTTGGGGAGATTAGGAAAAA
TGACACAAAAAAGAATAGAAGAGCGTTTGGATGTGACAGACACTAAACTCGAAGGTATCAAGAAAGAGATGCAGAAGTTACCAGCGATAGAAAGAAATCTGGCGAAGCTT
TCACAGATGTTAGAAGAGACGATAAAGGCCCTGGCAGCGATTGCAAGTGAAATGGCCTACTGGCGGACAAAACCATCATCCTTCGGCATCACGGAAGGATCAAGATCTAA
GGGAAAAGAACAGGAAAACAATCCAGTAAGAGATAGTGAGACACTGAATGTGAATGTCCCCCCTTCGAAAACCCGCAAAAGTGTGAAGAAAGAGGCTATCGGAGGCACTG
GAAGCGAACGTCTCAAGTGCATACTGAAAGAGATGCAGATTGCAGTTGACCAAAGAAGGCTTGGAGAGCCAACTGGAGATACACTTCTAGGGAAAACAGTAACTCTTTGT
CTCTCTCCCTCTCTCACTCTGATAGACCCCCAGTTAAATATGTATATAGTAGGAAACGGATCTATCTTGTATATACAGTATGGGTGTAAGATGGGTGGGGTATCTTTTGA
GCTAGGAATTCGGGTCTCGTGTTCAATTAGTACACCTATGTATATATATAAGTTTGCCAACTCACTGATAGTGCACAAGTGGGAGTTTTGGGAGGTCTCTACTCCTACTG
ATGGTAGAAGGCACCCAGTCAGTGGAGCTAAGTCTCATCCCTCAAGGCACCACTGGGCTTGTAGCGCTCTGAACTACCATTTCACTAGTGGGGCATTCATGGGTTTAAGT
CAGAAATCAACTTCCACAGTTAGTAAATGGCACCTAGTCTGCGAAGCCAAGTCCCGCCCATTAGGCCCCTCATCTCAGTCTAATACATCTCAGTCCTAG
Protein sequenceShow/hide protein sequence
MSPNGISQLEKFTFIIRDYMELAISSNVLNSVVRTARNFDILAFLMNQMSFQGAYSSFEDFPDLDFSLLVLICNAFLVVWKMERTHEDSNKGLTRVLFCGSQFAGSHNYT
REYLQKYPFVQVDVVPLEDVPKVITNYHICIPKMLKFDFDLISRASQMKLIVQFGVGLEGVDIDAATKFGIKVARIPSGVTGNAMSCAEMAIYLTLGLLRRQLGEGQKAI
ANTRLCFHDGRLGYFGALITPVMEVVCASRGLRLEDPLSPFMFLLVVVVLSRIISRGVELEHKSKFILGRLGKMTQKRIEERLDVTDTKLEGIKKEMQKLPAIERNLAKL
SQMLEETIKALAAIASEMAYWRTKPSSFGITEGSRSKGKEQENNPVRDSETLNVNVPPSKTRKSVKKEAIGGTGSERLKCILKEMQIAVDQRRLGEPTGDTLLGKTVTLC
LSPSLTLIDPQLNMYIVGNGSILYIQYGCKMGGVSFELGIRVSCSISTPMYIYKFANSLIVHKWEFWEVSTPTDGRRHPVSGAKSHPSRHHWACSALNYHFTSGAFMGLS
QKSTSTVSKWHLVCEAKSRPLGPSSQSNTSQS