; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014249 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014249
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDAO domain-containing protein
Genome locationscaffold3:42018175..42030430
RNA-Seq ExpressionSpg014249
SyntenySpg014249
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0016491 - oxidoreductase activity (molecular function)
InterPro domainsIPR006076 - FAD dependent oxidoreductase
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593065.1 hypothetical protein SDJN03_12541, partial [Cucurbita argyrosperma subsp. sororia]1.8e-22076.81Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT  ASCSSRH C  GF PKW  INP QWRKS + S  HDYR+GPV FCALKDVKSSSSTSRNGN FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SDLSVAVVDKAVPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT+VNESII RIWERASEFFPTMKE+SFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FS+QGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

XP_022960152.1 uncharacterized protein LOC111460978 [Cucurbita moschata]1.8e-22076.62Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT  ASCSSRH C  GF+PKW  INP QWRKS + S  HDYR+GPV FCALKDVKSSSSTSRNGN FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SDLSVAVVDKAVPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT++NESII RIWERASEFFPTMKE+SFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FS+QGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

XP_023004792.1 uncharacterized protein LOC111497986 [Cucurbita maxima]9.8e-21976.44Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT I S SSRH C  GF PKW  I+P QWRKS + S GHDYR+GPV FCALKDVKSSSSTSRNGN FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SDLSVAVVDK VPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEE DMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT++NESII RIWERASEFFPTMKEVSFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FSVQGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

XP_023514870.1 uncharacterized protein LOC111779048 [Cucurbita pepo subsp. pepo]4.0e-22076.62Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT  ASCSSRH C  GF+PKW  INP QWRKS + S GHDY +GPV FCALKDVKSSSSTSRNG+ FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SD+SVAVVDKAVPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT+VNESII RIWERASEFFPTMKE+SFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FSVQGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

XP_038899358.1 D-amino acid dehydrogenase [Benincasa hispida]3.1e-22076.49Show/hide
Query:  SSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIAR
        SSSSLR CPS +PS RT+IASCS+RHFCNFGFNPKWP INP+   +S + S  HDY + PV FCA KDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIAR
Subjt:  SSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIAR

Query:  QFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTGA
        QFL+GSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAE+LRDQGLNPSEELGW+KTGSLLIG+TPEELDMLKRKVK L+ A
Subjt:  QFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTGA

Query:  GLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGVC
        GLEAEYLSGADLLSMEPALLIGD+CGAAFLPNDCQLDAH T+AFI+KANRHFKGRYAEFFHDPVTGLLR                               
Subjt:  GLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGVC

Query:  DATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTSA
                      SGS+GKIEAV+TSKTTLYSKKAIVLAAGCWSGTLLRDLL E KTVLDVPIMPRKGHLLVIENFNSLHVN GLMEVGYVNHQALT A
Subjt:  DATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTSA

Query:  KDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH-
        KD EQ+SS+SMTATMDVQGNL+LGSSREFAGFNTEVNE I+ RIWERASEFF TMKEVSFSDIKS+SKVRIGLRPY                   +  H 
Subjt:  KDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH-

Query:  ------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
              ALGTAEMIANMVLGS  KV+PA FSVQGRC
Subjt:  ------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

TrEMBL top hitse value%identityAlignment
A0A1S3BR23 D-amino acid dehydrogenase7.4e-21274.49Show/hide
Query:  SSSLRPCPSPSP-SIRTSIASCSS-RHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIA
        +SS+ PCPS SP S RT+IASCSS RHF NFGFNPKW  INP+Q      R+    YR+ PV FCALKDVKSSSS SRNGNAFEFDVVIIGAGIIGLTIA
Subjt:  SSSLRPCPSPSP-SIRTSIASCSS-RHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIA

Query:  RQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTG
        RQFL+GSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRS RLWEGLAE+LRDQGLNPSEELGWKKTGSLLIG+TP+ELDMLKRKVK L+G
Subjt:  RQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTG

Query:  AGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGV
        AGLEAEYLS  DLLSMEPALLIGDSCGAAFLPNDCQLDA+ TAAFI+KANRHFKGRYAEFFHDPVTGLLR                              
Subjt:  AGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGV

Query:  CDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTS
                       SGS+GKIEAVQTSKTTLYSKKAIV+AAGCWSGTLLRDLL E KTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALT 
Subjt:  CDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTS

Query:  AKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH
        AKD EQTSS+SMTATMDVQGNL+LGSSREFAGFNTE+NE I+ RIWERASEFFPT+KEVS SDIK +SKVRIGLRPY                   +  H
Subjt:  AKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH

Query:  -------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
               A+GTAEMI NMVLGS  KV+PA F +QGRC
Subjt:  -------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

A0A5A7V1N9 D-amino acid dehydrogenase7.4e-21274.49Show/hide
Query:  SSSLRPCPSPSP-SIRTSIASCSS-RHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIA
        +SS+ PCPS SP S RT+IASCSS RHF NFGFNPKW  INP+Q      R+    YR+ PV FCALKDVKSSSS SRNGNAFEFDVVIIGAGIIGLTIA
Subjt:  SSSLRPCPSPSP-SIRTSIASCSS-RHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIA

Query:  RQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTG
        RQFL+GSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRS RLWEGLAE+LRDQGLNPSEELGWKKTGSLLIG+TP+ELDMLKRKVK L+G
Subjt:  RQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTG

Query:  AGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGV
        AGLEAEYLS  DLLSMEPALLIGDSCGAAFLPNDCQLDA+ TAAFI+KANRHFKGRYAEFFHDPVTGLLR                              
Subjt:  AGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGV

Query:  CDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTS
                       SGS+GKIEAVQTSKTTLYSKKAIV+AAGCWSGTLLRDLL E KTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALT 
Subjt:  CDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTS

Query:  AKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH
        AKD EQTSS+SMTATMDVQGNL+LGSSREFAGFNTE+NE I+ RIWERASEFFPT+KEVS SDIK +SKVRIGLRPY                   +  H
Subjt:  AKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEYH

Query:  -------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
               A+GTAEMI NMVLGS  KV+PA F +QGRC
Subjt:  -------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

A0A6J1CR48 uncharacterized protein LOC111013420 isoform X12.7e-21473.42Show/hide
Query:  SSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKD--VKSSSSTSRNGNAFEFDVVIIGAGIIGLTI
        S+S L PCP P PS+RT  ASCSSRHFCNFGFNPKW    P  WRKSVD+S GHDYRF P  FCALKD    SSSSTSRN NAFEFDVVIIGAGIIGLTI
Subjt:  SSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKD--VKSSSSTSRNGNAFEFDVVIIGAGIIGLTI

Query:  ARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLT
        ARQFL+GSDLSVAVVDKAVPCSGATGAGQGYLWM HKSPGSDIWELALRSHRLWEGLAE+LRDQGLNPSEELGWKKTGSLLIG+TPEELD+LKRKVK L+
Subjt:  ARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLT

Query:  GAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISG
          GLEAE+LSG DLLSMEPAL +GD+CGAAF+PNDCQLDAHRT AFIEKANRHFKGRYAEF+H PVTGLLR                             
Subjt:  GAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISG

Query:  VCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALT
                        SGSNGKIEAVQTSKTTL+SKKAIVLAAGCWSGTLLRD+L EEKTVLDVPIMPRKGHLLVIENFNSLH+NHGLMEVGYVNHQ LT
Subjt:  VCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALT

Query:  SAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEY
        SA+ LEQTSSISMTATMDVQGNLVLGSSREFAGFNTEV+ESII RIW RASEFFP MKEVS SD+KSNSKVR+GLRPY                   +  
Subjt:  SAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SEY

Query:  H-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
        H       A GTAEM+ +MVLG+  K++P  FS +GRC
Subjt:  H-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

A0A6J1H6L0 uncharacterized protein LOC1114609788.7e-22176.62Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT  ASCSSRH C  GF+PKW  INP QWRKS + S  HDYR+GPV FCALKDVKSSSSTSRNGN FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SDLSVAVVDKAVPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT++NESII RIWERASEFFPTMKE+SFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FS+QGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

A0A6J1KRE1 uncharacterized protein LOC1114979864.8e-21976.44Show/hide
Query:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT
        M SSS SLR CPS S SIRT I S SSRH C  GF PKW  I+P QWRKS + S GHDYR+GPV FCALKDVKSSSSTSRNGN FEFDVVIIGAGIIGLT
Subjt:  MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLT

Query:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL
        IARQFL+ SDLSVAVVDK VPCSGATGAGQGYLWM HKSP SDIWELALRSHRLWE LAESLRDQGLNPSEELGWKKTGSLLIGKTPEE DMLKRKVK L
Subjt:  IARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVL

Query:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS
        +GAGLE EYLS ADLLSMEPALLIG+SCGAAFLPNDCQLDAH TAAFIEKANR+F+GRYAEFFHDPVTGLLR                            
Subjt:  TGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVIS

Query:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL
                         SGSNGKIEAVQT+KTTLYSKKAIVLAAGCWSGTLLRDLL EEKTVLDVPIMPRKGHLLVIENFNS HVNHGLMEVGYVNHQAL
Subjt:  GVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQAL

Query:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE
        TSAKDLE TSSISMTATMDVQGNLVLGSSREFAGFNT++NESII RIWERASEFFPTMKEVSFS+IKS+SKVRIGLRPY                   + 
Subjt:  TSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPY-------------------SE

Query:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC
         H       ALGTAEMI NMVLG   KV+PA FSVQGRC
Subjt:  YH-------ALGTAEMIANMVLGSLVKVNPATFSVQGRC

SwissProt top hitse value%identityAlignment
Q7TSQ8 Pyruvate dehydrogenase phosphatase regulatory subunit, mitochondrial4.0e-0525.27Show/hide
Query:  KSSSSTSRNGN---AFEFDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNP
        +SS+ST+   +     +  VVI G GI+G ++A          + ++++    +G+T    G L  A  S  S   ++A  S++L+  L +   + G+  
Subjt:  KSSSSTSRNGN---AFEFDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNP

Query:  SEELGWKKTGSLLIGKTPEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKA
          + G+ +TGS+ + +T + L  LKR    L   G+ +E +S   +  + P L + D  GA ++P D  + +   A  +  A
Subjt:  SEELGWKKTGSLLIGKTPEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKA

Arabidopsis top hitse value%identityAlignment
AT5G48440.1 FAD-dependent oxidoreductase family protein4.5e-12955.26Show/hide
Query:  FDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKT
        FDVV++G GIIGLTIARQFL GSDLSVAVVDKAVPCSGATGAGQGY+WM HK PGSD+W+L LRSH LW  LAESL D GL+P E LGWKKTGSLLIG+T
Subjt:  FDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKT

Query:  PEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHF--KGRYAEFFHDPVTGLLRYSQHLSKNSFYS
         EE   LK+KV  L+ AGL  EYLS A+LL  EPA+L+ D+ GAAFLP+D QLDAHR  A+IEK NR F   GRYAEF+++PVTGL              
Subjt:  PEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHF--KGRYAEFFHDPVTGLLRYSQHLSKNSFYS

Query:  ITLGVRQTGSSKDVISGVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLH
            +R  G SK V++G                         V+T K  LY KKA ++AAGCWSG+L+ +LL +    LDVP+ PRKGHLLV+ENF+S H
Subjt:  ITLGVRQTGSSKDVISGVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLH

Query:  VNHGLMEVGYVNHQ-ALTSAKDL-EQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPYSE-
        +NHG+ME GY NHQ A  S  D+ E+  SISMTATMD  GNLVLGSSREF GF+TE +E II  IWERA+EFFP ++++S  D   N KVR+GLRPY   
Subjt:  VNHGLMEVGYVNHQ-ALTSAKDL-EQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPYSE-

Query:  -------------------------YHALGTAEMIANMVLGSLVKVNPATFSVQGR
                                   AL TAEM+ +MVLG   +V+ +TF V+GR
Subjt:  -------------------------YHALGTAEMIANMVLGSLVKVNPATFSVQGR

AT5G48440.2 FAD-dependent oxidoreductase family protein2.3e-12559.4Show/hide
Query:  FDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKT
        FDVV++G GIIGLTIARQFL GSDLSVAVVDKAVPCSGATGAGQGY+WM HK PGSD+W+L LRSH LW  LAESL D GL+P E LGWKKTGSLLIG+T
Subjt:  FDVVIIGAGIIGLTIARQFLLGSDLSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKT

Query:  PEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHF--KGRYAEFFHDPVTGLLRYSQHLSKNSFYS
         EE   LK+KV  L+ AGL  EYLS A+LL  EPA+L+ D+ GAAFLP+D QLDAHR  A+IEK NR F   GRYAEF+++PVTGL              
Subjt:  PEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEPALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHF--KGRYAEFFHDPVTGLLRYSQHLSKNSFYS

Query:  ITLGVRQTGSSKDVISGVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLH
            +R  G SK V++G                         V+T K  LY KKA ++AAGCWSG+L+ +LL +    LDVP+ PRKGHLLV+ENF+S H
Subjt:  ITLGVRQTGSSKDVISGVCDATVEPVCLDAAILSGSNGKIEAVQTSKTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLH

Query:  VNHGLMEVGYVNHQ-ALTSAKDL-EQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPYSE
        +NHG+ME GY NHQ A  S  D+ E+  SISMTATMD  GNLVLGSSREF GF+TE +E II  IWERA+EFFP ++++S  D   N KVR+GLRPYS+
Subjt:  VNHGLMEVGYVNHQ-ALTSAKDL-EQTSSISMTATMDVQGNLVLGSSREFAGFNTEVNESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPYSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTTCTTCTTCTTCACTGCGTCCTTGCCCAAGCCCAAGTCCCTCTATCAGGACCAGCATAGCTTCATGTTCTAGTAGACATTTCTGCAACTTCGGATTCAATCC
AAAATGGCCGCTGATTAATCCTCACCAGTGGAGAAAATCGGTCGATAGGAGCCGTGGTCATGATTATAGGTTCGGACCGGTGTGCTTCTGTGCTTTGAAAGATGTGAAAT
CTTCATCTTCAACCTCTCGTAATGGCAATGCTTTTGAGTTCGATGTCGTGATCATTGGAGCGGGGATTATCGGGTTAACTATAGCGCGGCAGTTTCTTCTCGGGTCGGAT
TTGTCCGTCGCCGTCGTGGATAAGGCCGTGCCTTGCTCTGGAGCTACCGGTGCAGGGCAGGGCTATTTATGGATGGCGCACAAATCCCCTGGCAGTGACATTTGGGAACT
TGCTTTGAGAAGCCACAGGCTGTGGGAGGGTCTGGCTGAGTCCCTGCGTGATCAAGGCTTGAATCCATCGGAAGAATTGGGTTGGAAAAAGACAGGGAGCCTATTAATTG
GTAAAACACCTGAAGAGCTTGACATGTTAAAGAGGAAGGTAAAAGTACTAACTGGGGCCGGACTGGAAGCTGAATACTTGTCTGGTGCTGATCTGCTTTCGATGGAACCA
GCTCTGCTGATTGGGGACAGCTGTGGGGCTGCCTTTCTCCCCAATGACTGCCAATTGGATGCTCATCGTACTGCTGCCTTTATTGAAAAGGCAAACAGACATTTTAAAGG
AAGATATGCAGAGTTTTTTCATGACCCCGTTACAGGTTTATTGAGATACTCGCAACATCTATCAAAAAATTCTTTCTACAGCATAACCCTCGGAGTCCGTCAAACAGGAA
GCAGTAAGGATGTAATTTCTGGAGTGTGCGATGCAACTGTTGAGCCAGTTTGCCTTGATGCTGCTATTCTATCTGGTAGCAATGGGAAGATAGAAGCTGTTCAGACGTCC
AAGACTACATTGTATAGTAAGAAGGCCATTGTTTTGGCAGCTGGTTGTTGGAGTGGGACTTTGCTGCGTGATTTGCTCATGGAAGAAAAAACGGTTTTGGATGTTCCCAT
AATGCCAAGAAAGGGTCACTTGCTTGTCATCGAGAATTTTAACTCCCTTCATGTTAATCATGGTTTGATGGAGGTGGGCTATGTTAATCACCAAGCCTTAACTTCGGCTA
AAGATCTTGAGCAGACCTCATCCATTTCAATGACTGCTACTATGGATGTTCAAGGCAACCTAGTACTTGGGAGTAGTCGTGAGTTTGCTGGTTTCAACACTGAAGTGAAT
GAATCCATAATTACTCGTATATGGGAAAGAGCTTCAGAATTCTTTCCTACGATGAAAGAAGTGTCTTTTTCAGATATCAAAAGCAACAGTAAAGTGAGAATAGGTCTACG
ACCATACAGTGAGTATCATGCACTGGGGACTGCAGAAATGATTGCTAATATGGTGCTAGGAAGTCTTGTGAAAGTTAATCCTGCTACCTTTTCGGTACAAGGACGATGTT
AA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCTTCTTCTTCTTCACTGCGTCCTTGCCCAAGCCCAAGTCCCTCTATCAGGACCAGCATAGCTTCATGTTCTAGTAGACATTTCTGCAACTTCGGATTCAATCC
AAAATGGCCGCTGATTAATCCTCACCAGTGGAGAAAATCGGTCGATAGGAGCCGTGGTCATGATTATAGGTTCGGACCGGTGTGCTTCTGTGCTTTGAAAGATGTGAAAT
CTTCATCTTCAACCTCTCGTAATGGCAATGCTTTTGAGTTCGATGTCGTGATCATTGGAGCGGGGATTATCGGGTTAACTATAGCGCGGCAGTTTCTTCTCGGGTCGGAT
TTGTCCGTCGCCGTCGTGGATAAGGCCGTGCCTTGCTCTGGAGCTACCGGTGCAGGGCAGGGCTATTTATGGATGGCGCACAAATCCCCTGGCAGTGACATTTGGGAACT
TGCTTTGAGAAGCCACAGGCTGTGGGAGGGTCTGGCTGAGTCCCTGCGTGATCAAGGCTTGAATCCATCGGAAGAATTGGGTTGGAAAAAGACAGGGAGCCTATTAATTG
GTAAAACACCTGAAGAGCTTGACATGTTAAAGAGGAAGGTAAAAGTACTAACTGGGGCCGGACTGGAAGCTGAATACTTGTCTGGTGCTGATCTGCTTTCGATGGAACCA
GCTCTGCTGATTGGGGACAGCTGTGGGGCTGCCTTTCTCCCCAATGACTGCCAATTGGATGCTCATCGTACTGCTGCCTTTATTGAAAAGGCAAACAGACATTTTAAAGG
AAGATATGCAGAGTTTTTTCATGACCCCGTTACAGGTTTATTGAGATACTCGCAACATCTATCAAAAAATTCTTTCTACAGCATAACCCTCGGAGTCCGTCAAACAGGAA
GCAGTAAGGATGTAATTTCTGGAGTGTGCGATGCAACTGTTGAGCCAGTTTGCCTTGATGCTGCTATTCTATCTGGTAGCAATGGGAAGATAGAAGCTGTTCAGACGTCC
AAGACTACATTGTATAGTAAGAAGGCCATTGTTTTGGCAGCTGGTTGTTGGAGTGGGACTTTGCTGCGTGATTTGCTCATGGAAGAAAAAACGGTTTTGGATGTTCCCAT
AATGCCAAGAAAGGGTCACTTGCTTGTCATCGAGAATTTTAACTCCCTTCATGTTAATCATGGTTTGATGGAGGTGGGCTATGTTAATCACCAAGCCTTAACTTCGGCTA
AAGATCTTGAGCAGACCTCATCCATTTCAATGACTGCTACTATGGATGTTCAAGGCAACCTAGTACTTGGGAGTAGTCGTGAGTTTGCTGGTTTCAACACTGAAGTGAAT
GAATCCATAATTACTCGTATATGGGAAAGAGCTTCAGAATTCTTTCCTACGATGAAAGAAGTGTCTTTTTCAGATATCAAAAGCAACAGTAAAGTGAGAATAGGTCTACG
ACCATACAGTGAGTATCATGCACTGGGGACTGCAGAAATGATTGCTAATATGGTGCTAGGAAGTCTTGTGAAAGTTAATCCTGCTACCTTTTCGGTACAAGGACGATGTT
AA
Protein sequenceShow/hide protein sequence
MISSSSSLRPCPSPSPSIRTSIASCSSRHFCNFGFNPKWPLINPHQWRKSVDRSRGHDYRFGPVCFCALKDVKSSSSTSRNGNAFEFDVVIIGAGIIGLTIARQFLLGSD
LSVAVVDKAVPCSGATGAGQGYLWMAHKSPGSDIWELALRSHRLWEGLAESLRDQGLNPSEELGWKKTGSLLIGKTPEELDMLKRKVKVLTGAGLEAEYLSGADLLSMEP
ALLIGDSCGAAFLPNDCQLDAHRTAAFIEKANRHFKGRYAEFFHDPVTGLLRYSQHLSKNSFYSITLGVRQTGSSKDVISGVCDATVEPVCLDAAILSGSNGKIEAVQTS
KTTLYSKKAIVLAAGCWSGTLLRDLLMEEKTVLDVPIMPRKGHLLVIENFNSLHVNHGLMEVGYVNHQALTSAKDLEQTSSISMTATMDVQGNLVLGSSREFAGFNTEVN
ESIITRIWERASEFFPTMKEVSFSDIKSNSKVRIGLRPYSEYHALGTAEMIANMVLGSLVKVNPATFSVQGRC