; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020732 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020732
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSURF1-like protein
Genome locationChr05:1934748..1938093
RNA-Seq ExpressionHG10020732
SyntenyHG10020732
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]9.7e-17088.18Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK
        MASSSFAKSI KFRP  SLSRH +  TPLPSSS SFSSAA VSSA DP S+SLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK

Query:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE
        RLLMEPVN+N+LLPL DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGW P TWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE

Query:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSD  PS+VQESERSSWWKFWSK T++LENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_004137509.1 surfeit locus protein 1 [Cucumis sativus]3.2e-18195.07Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRPCFSLS H STPLPSSSSSFSSAAVVSS PDP+S+SLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LMEPVNINNLL L DKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGWAP TWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDIVPSLVQ  ERSSWWKFWSKKTESLENEITPITP+EVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_008465733.1 PREDICTED: surfeit locus protein 1 [Cucumis melo]1.5e-17893.33Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRPCFSLS H STPLPSSSSSFSSAAVVSS PDP+S+SLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LMEPVNINNLL L DKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+PGLPDSVQSPVLVNRGWAP TWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S  VPSLVQE ERSSWWKFWSKKTESLENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_023541124.1 surfeit locus protein 1-like isoform X2 [Cucurbita pepo subsp. pepo]4.3e-17089.05Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK
        MASSSFAKSI KFRP  SLSRH +  TPLPSSSSSFSSAA VSSA DP S+SLSQAQQKQR+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK

Query:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE
        RLLMEPVNIN+LLPL DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PI GLPDSVQSPVLVNRGW P TWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE

Query:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSDI PS+VQESERSSWWKFWSK T++LENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_038895168.1 surfeit locus protein 1-like [Benincasa hispida]3.3e-17893.62Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRP FSLS HCSTPLP SSSSFSSAAVVSSAPDP+S+ LSQAQQKQRESR SKWLLFLPGALTFGLGTWQI RRQ+KIEMLDYRQKRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LM+PVNINNLLPL DKLDDLEFRRVICKGVFDEKKS YVGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGWAP TWKEKALEVDQQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDIVPSLVQE+ RSSWWKFWSKKTESLENE TPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMAFKRL QKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein1.6e-18195.07Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRPCFSLS H STPLPSSSSSFSSAAVVSS PDP+S+SLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LMEPVNINNLL L DKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGWAP TWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDIVPSLVQ  ERSSWWKFWSKKTESLENEITPITP+EVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A1S3CPJ6 SURF1-like protein7.2e-17993.33Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRPCFSLS H STPLPSSSSSFSSAAVVSS PDP+S+SLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LMEPVNINNLL L DKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+PGLPDSVQSPVLVNRGWAP TWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S  VPSLVQE ERSSWWKFWSKKTESLENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A5D3BJ55 SURF1-like protein7.2e-17993.33Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL
        MASSS AKSITKFRPCFSLS H STPLPSSSSSFSSAAVVSS PDP+S+SLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRL

Query:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS
        LMEPVNINNLL L DKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+PGLPDSVQSPVLVNRGWAP TWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQS

Query:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S  VPSLVQE ERSSWWKFWSKKTESLENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A6J1GUD0 SURF1-like protein1.2e-16887.9Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK
        MASSSFAKSI KFRP   LSRH +  TPLPSSS SFSSAA VSSA DP S+SLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEK E+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK

Query:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE
        RLLMEPVNIN+LLPL DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGW P TWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE

Query:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSD  PS+VQESERSSWWKFWSK T++LENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A6J1JYJ0 SURF1-like protein3.4e-16887.64Show/hide
Query:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK
        MASSSFAKSI KFRP  SL+RH +  TPLPSSS SFSSAA VSSA DP S+SLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSRHCS--TPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQK

Query:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE
        RLLMEPVN+N+LLPL DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PIPGLPDSVQSPVLVNRGW P TWKEKALEVD Q  E
Subjt:  RLLMEPVNINNLLPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSE

Query:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPI-TPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPY
        QSSDI PS+VQESERSSWWKFWSK T++LENE++PI TP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PY
Subjt:  QSSDIVPSLVQESERSSWWKFWSKKTESLENEITPI-TPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 12.0e-2126.14Show/hide
Query:  STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD-KLDDLEFRRVICKGVFDEKKSIYVGPRS---
        S++   A  K  +    +W L L  A  FGLGTWQ+ RR+ K++++   + R++ EP+     LP    +L +LE+R V  +G FD  K +Y+ PR+   
Subjt:  STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD-KLDDLEFRRVICKGVFDEKKSIYVGPRS---

Query:  -------RSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITP
                     TE+G +V+TP          +   +LVNRG+ P   ++K     +Q  +   +                                  
Subjt:  -------RSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIVPSLVQESERSSWWKFWSKKTESLENEITPITP

Query:  IEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFK
        ++++G+VR +E    FVP N P    W+Y D+ A+A+ +G   D I+++    +  P    PI       +R+     +H+ Y LTWY L AA +++ F+
Subjt:  IEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFK

Query:  RLRQKT
        +  ++T
Subjt:  RLRQKT

Q15526 Surfeit locus protein 18.3e-2328.34Show/hide
Query:  STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD-KLDDLEFRRVICKGVFDEKKSIYVGPRSR--
        S++   +  K  +    +W+L L     FGLGTWQ+ RR+ K+ ++   + R+L EPV     LP    +L +LE+R V  +G FD  K +Y+ PR+   
Subjt:  STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD-KLDDLEFRRVICKGVFDEKKSIYVGPRSR--

Query:  ---------SISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIVPSLVQESERSSWWKFWSKKTESLENEITPIT
                  IS  T++G YV+TP          +   +LVNRG+ P                    + P   Q+ +              +E E     
Subjt:  ---------SISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIVPSLVQESERSSWWKFWSKKTESLENEITPIT

Query:  PIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAF
         +++IG+VR +E    FVP N+P    W Y D+ A+AR +G   + I+++   ++  P    PI       +R+     +HL Y +TWY LSAA +++ F
Subjt:  PIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAF

Query:  KRLRQKT
        K+  + T
Subjt:  KRLRQKT

Q9LP74 Surfeit locus protein 1-like4.1e-5439.38Show/hide
Query:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG
        SSS+ S+    S   +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N        LD L FRRV+CKG
Subjt:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE           + S +++ ++      S++S   KFW 
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS

Query:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNY
        K    +  E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  Y
Subjt:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNY

Query:  TLTWYSLSAAVTFMAFKRLRQKTSR
        T+ W+  S      A   L ++ ++
Subjt:  TLTWYSLSAAVTFMAFKRLRQKTSR

Q9QXU2 Surfeit locus protein 11.0e-2026.63Show/hide
Query:  CSTPLPSSS---SSFSSAAVVSSAPDPH----STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD
        C+ P P  +   S F  +        PH    ST+ + A + + +S L  +LLF+P A  FGLGTWQ+ RR+ K++++   + R++ EP+     LP   
Subjt:  CSTPLPSSS---SSFSSAAVVSSAPDPH----STSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGD

Query:  -KLDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIV
         +L +LE+R V  +G FD  K +Y+ PR+                TE+G YV+TP          +   +LVNRG+ P        +V+ ++ +Q     
Subjt:  -KLDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIV

Query:  PSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVN
                                     +  ++++G+VR +E    FVP N+P    W+Y D+ A+A+ +G   D I+++    +  P    PI     
Subjt:  PSLVQESERSSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVN

Query:  TLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT
          +R+     +H+ Y +TWY L AA +++ F++  ++T
Subjt:  TLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT

Q9SE51 Surfeit locus protein 17.3e-10459.34Show/hide
Query:  SRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDD
        SRH S    SSSSS       S+A    S+S +  Q+ +R S+ S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL   L+ 
Subjt:  SRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDD

Query:  LEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE------VDQQSSEQSSDIVPSLVQESER
        LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPL+PIPG  DS+QSP+LVNRGW P +W+EK+ E      +  QS++  S   PS    +E 
Subjt:  LEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE------VDQQSSEQSSDIVPSLVQESER

Query:  SSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP
         SWWKFWSK     +  I+ + P+EV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMP
Subjt:  SSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP

Query:  QDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        QDHLNY++TWYSLSAAVTFMA+KRL+ K  RR
Subjt:  QDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein2.9e-5539.38Show/hide
Query:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG
        SSS+ S+    S   +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N        LD L FRRV+CKG
Subjt:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE           + S +++ ++      S++S   KFW 
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS

Query:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNY
        K    +  E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  Y
Subjt:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNY

Query:  TLTWYSLSAAVTFMAFKRLRQKTSR
        T+ W+  S      A   L ++ ++
Subjt:  TLTWYSLSAAVTFMAFKRLRQKTSR

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.0e-5241.16Show/hide
Query:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG
        SSS+ S+    S   +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N        LD L FRRV+CKG
Subjt:  SSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE           + S +++ ++      S++S   KFW 
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE--------VDQQSSEQSSDIVPSLVQESERSSWWKFWS

Query:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP
        K    +  E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS+ +P
Subjt:  KKTESL--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein5.2e-10559.34Show/hide
Query:  SRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDD
        SRH S    SSSSS       S+A    S+S +  Q+ +R S+ S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL   L+ 
Subjt:  SRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNLLPLGDKLDD

Query:  LEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE------VDQQSSEQSSDIVPSLVQESER
        LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPL+PIPG  DS+QSP+LVNRGW P +W+EK+ E      +  QS++  S   PS    +E 
Subjt:  LEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALE------VDQQSSEQSSDIVPSLVQESER

Query:  SSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP
         SWWKFWSK     +  I+ + P+EV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMP
Subjt:  SSWWKFWSKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP

Query:  QDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        QDHLNY++TWYSLSAAVTFMA+KRL+ K  RR
Subjt:  QDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCTTCCTTCGCTAAATCCATAACAAAATTTCGCCCTTGTTTTTCCCTTTCTAGGCACTGTTCGACGCCTTTACCTTCATCTTCTTCCTCGTTCAGCTCTGC
CGCTGTAGTTTCTTCTGCTCCTGATCCCCACTCAACTTCCCTTTCGCAAGCTCAACAGAAACAAAGAGAGTCGAGATTGTCAAAATGGCTCCTGTTTCTACCTGGTGCTC
TCACGTTTGGTCTTGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATGCTAGATTACCGGCAGAAGCGATTGTTAATGGAACCTGTGAACATAAACAACTTA
CTGCCATTGGGAGACAAGCTGGATGATCTGGAGTTCAGGAGGGTGATCTGTAAAGGAGTTTTTGATGAGAAAAAGTCAATTTATGTTGGTCCACGTTCGAGAAGCATTTC
AGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGGTGCCAATTCCTGGCCTTCCTGATAGTGTGCAGTCACCAGTTCTGGTTAATAGAGGCTGGGCTCCATGCA
CTTGGAAGGAGAAGGCTTTAGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGATATTGTACCTTCCTTGGTTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGG
TCAAAAAAGACTGAGAGTCTAGAGAATGAAATTACTCCCATTACTCCAATTGAAGTCATCGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGA
TCCTGGCTCTAGGCAGTGGTTTTATGTTGATGTTCCAGCAATCGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAACCCAA
GTGATCCTTATCCCATTCCGAAGGATGTTAATACCTTGATACGGAGTTCCGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTACTCTCTCTCAGCTGCTGTA
ACCTTTATGGCTTTCAAAAGACTCAGGCAAAAAACAAGTCGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCTTCCTTCGCTAAATCCATAACAAAATTTCGCCCTTGTTTTTCCCTTTCTAGGCACTGTTCGACGCCTTTACCTTCATCTTCTTCCTCGTTCAGCTCTGC
CGCTGTAGTTTCTTCTGCTCCTGATCCCCACTCAACTTCCCTTTCGCAAGCTCAACAGAAACAAAGAGAGTCGAGATTGTCAAAATGGCTCCTGTTTCTACCTGGTGCTC
TCACGTTTGGTCTTGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATGCTAGATTACCGGCAGAAGCGATTGTTAATGGAACCTGTGAACATAAACAACTTA
CTGCCATTGGGAGACAAGCTGGATGATCTGGAGTTCAGGAGGGTGATCTGTAAAGGAGTTTTTGATGAGAAAAAGTCAATTTATGTTGGTCCACGTTCGAGAAGCATTTC
AGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGGTGCCAATTCCTGGCCTTCCTGATAGTGTGCAGTCACCAGTTCTGGTTAATAGAGGCTGGGCTCCATGCA
CTTGGAAGGAGAAGGCTTTAGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGATATTGTACCTTCCTTGGTTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGG
TCAAAAAAGACTGAGAGTCTAGAGAATGAAATTACTCCCATTACTCCAATTGAAGTCATCGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGA
TCCTGGCTCTAGGCAGTGGTTTTATGTTGATGTTCCAGCAATCGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAACCCAA
GTGATCCTTATCCCATTCCGAAGGATGTTAATACCTTGATACGGAGTTCCGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTACTCTCTCTCAGCTGCTGTA
ACCTTTATGGCTTTCAAAAGACTCAGGCAAAAAACAAGTCGAAGATAA
Protein sequenceShow/hide protein sequence
MASSSFAKSITKFRPCFSLSRHCSTPLPSSSSSFSSAAVVSSAPDPHSTSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDYRQKRLLMEPVNINNL
LPLGDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPGLPDSVQSPVLVNRGWAPCTWKEKALEVDQQSSEQSSDIVPSLVQESERSSWWKFW
SKKTESLENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAV
TFMAFKRLRQKTSRR