; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G014780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G014780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionSURF1-like protein
Genome locationCicolChr01:27612723..27616353
RNA-Seq ExpressionCcUC01G014780
SyntenyCcUC01G014780
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]3.8e-16687.21Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK
        MASSSFAKSI KFRP  SLS H +  TPLPSSS SFSS+A VSS  DP SSSLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK

Query:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE
        RLLMEPVN+N+LLPL+++LDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGW PRTWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE

Query:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSD  PS+VQESERSSWWKFWSK T++ ENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS
        IPKDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMA KRLRQK+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS

XP_004137509.1 surfeit locus protein 1 [Cucumis sativus]3.9e-17994.17Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRPCFSLSGH STPLPSSSSSFSS+AVVSSTPDP+SSSLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LMEPVNINNLL LE++LDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQ  ERSSWWKFWSKKTES ENEITPITP+EVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMAFKRLRQK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

XP_008465733.1 PREDICTED: surfeit locus protein 1 [Cucumis melo]1.8e-17692.42Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRPCFSLSGH STPLPSSSSSFSS+AVVSSTPDP+SSSLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LMEPVNINNLL LE++LDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMP+PGLPDSVQSPVLVNRGWAPRTWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S   PSLVQE ERSSWWKFWSKKTES ENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTL RSSVMPQDHLNYTLTWY+LSAAVTFMAFKRLRQK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

XP_023541124.1 surfeit locus protein 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.7e-16688.08Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK
        MASSSFAKSI KFRP  SLS H +  TPLPSSSSSFSS+A VSS  DP SSSLSQAQQKQR+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK

Query:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE
        RLLMEPVNIN+LLPL+++LDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPI GLPDSVQSPVLVNRGW PRTWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE

Query:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSDI PS+VQESERSSWWKFWSK T++ ENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS
        IPKDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMA KRLRQK+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS

XP_038895168.1 surfeit locus protein 1-like [Benincasa hispida]2.6e-17592.42Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRP FSLSGHCSTPLP SSSSFSS+AVVSS PDP+SS LSQAQQKQRESR SKWLLFLPGALTFGLGTWQI RRQ+KIE+LDYRQKRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LM+PVNINNLLPLE++LDDLEFRRVICKGVFDEKKS YVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQE+ RSSWWKFWSKKTES ENE TPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTLIRSSVMPQDHLNYTLTWY LSAAVTFMAFKRL QK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein1.9e-17994.17Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRPCFSLSGH STPLPSSSSSFSS+AVVSSTPDP+SSSLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LMEPVNINNLL LE++LDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQ  ERSSWWKFWSKKTES ENEITPITP+EVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMAFKRLRQK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

A0A1S3CPJ6 SURF1-like protein8.8e-17792.42Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRPCFSLSGH STPLPSSSSSFSS+AVVSSTPDP+SSSLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LMEPVNINNLL LE++LDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMP+PGLPDSVQSPVLVNRGWAPRTWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S   PSLVQE ERSSWWKFWSKKTES ENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTL RSSVMPQDHLNYTLTWY+LSAAVTFMAFKRLRQK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

A0A5D3BJ55 SURF1-like protein8.8e-17792.42Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL
        MASSS AKSITKFRPCFSLSGH STPLPSSSSSFSS+AVVSSTPDP+SSSLSQ QQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYR+KRL
Subjt:  MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRL

Query:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS
        LMEPVNINNLL LE++LDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMP+PGLPDSVQSPVLVNRGWAPRTWKEKALEV+QQ SEQS
Subjt:  LMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQS

Query:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP
        S   PSLVQE ERSSWWKFWSKKTES ENEITPITP+EVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAIARSSGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN
        KDVNTL RSSVMPQDHLNYTLTWY+LSAAVTFMAFKRLRQK++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKSN

A0A6J1GUD0 SURF1-like protein4.5e-16586.92Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK
        MASSSFAKSI KFRP   LS H +  TPLPSSS SFSS+A VSS  DP SSSLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEK E+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK

Query:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE
        RLLMEPVNIN+LLPL+++LDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGW PRTWKEKALEVD Q SE
Subjt:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE

Query:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP
        QSSD  PS+VQESERSSWWKFWSK T++ ENE++PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS
        IPKDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMA KRLRQK+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS

A0A6J1JYJ0 SURF1-like protein1.3e-16486.67Show/hide
Query:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK
        MASSSFAKSI KFRP  SL+ H +  TPLPSSS SFSS+A VSS  DP SSSLSQAQQK+R+S+LSKWLLFLPGALTFGLGTWQIFRRQEKIE+LDYRQK
Subjt:  MASSSFAKSITKFRPCFSLSGHCS--TPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQK

Query:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE
        RLLMEPVN+N+LLPL+++LDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGW PRTWKEKALEVD Q  E
Subjt:  RLLMEPVNINNLLPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSE

Query:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPI-TPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPY
        QSSDI PS+VQESERSSWWKFWSK T++ ENE++PI TP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVPAIAR+SGLPED IYVEDINENVNPS+PY
Subjt:  QSSDIGPSLVQESERSSWWKFWSKKTESQENEITPI-TPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS
        PIPKDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMA KRLRQK+
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 13.7e-2327.12Show/hide
Query:  SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-NELDDLEFRRVICKGVFDEKKSIYVGPRS---
        SS+   A  K  +    +W L L  A  FGLGTWQ+ RR+ K++L+   + R++ EP+     LP +  EL +LE+R V  +G FD  K +Y+ PR+   
Subjt:  SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-NELDDLEFRRVICKGVFDEKKSIYVGPRS---

Query:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITP
                     TE+G +V+TP          +   +LVNRG+ PR       +V+ ++ ++   +G                                
Subjt:  -------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTESQENEITPITP

Query:  IEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFK
        ++++G+VR +E    FVP N P    W+Y D+ A+A+ +G   D I+++    +  P    PI       +R+     +H+ Y LTWY L AA +++ F+
Subjt:  IEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAFK

Query:  RLRQKS
        +  +++
Subjt:  RLRQKS

Q15526 Surfeit locus protein 19.8e-2429.14Show/hide
Query:  SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-NELDDLEFRRVICKGVFDEKKSIYVGPRSR--
        SS+   +  K  +    +W+L L     FGLGTWQ+ RR+ K+ L+   + R+L EPV     LP +  EL +LE+R V  +G FD  K +Y+ PR+   
Subjt:  SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-NELDDLEFRRVICKGVFDEKKSIYVGPRSR--

Query:  ---------SISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTESQENEITPIT
                  IS  T++G YV+TP          +   +LVNRG+ PR                   + P   Q+ +                       
Subjt:  ---------SISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTESQENEITPIT

Query:  PIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAF
         +++IG+VR +E    FVP N+P    W Y D+ A+AR +G   + I+++   ++  P    PI       +R+     +HL Y +TWY LSAA +++ F
Subjt:  PIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAVTFMAF

Query:  KR
        K+
Subjt:  KR

Q9LP74 Surfeit locus protein 1-like4.1e-5441.67Show/hide
Query:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG
        SSS+ S+    S T +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N       +LD L FRRV+CKG
Subjt:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE        +        + +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES

Query:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY
           E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  YT+ W+
Subjt:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY

Q9QXU2 Surfeit locus protein 12.4e-2227Show/hide
Query:  CSTPLPSSS---SSFSSSAVVSSTPDPH---SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-N
        C+ P P  +   S F  S        PH   SS+   A  K  +    +W L    A  FGLGTWQ+ RR+ K++L+   + R++ EP+     LP +  
Subjt:  CSTPLPSSS---SSFSSSAVVSSTPDPH---SSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLE-N

Query:  ELDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGP
        EL +LE+R V  +G FD  K +Y+ PR+                TE+G YV+TP          +   +LVNRG+ PR       +V+ ++ +Q   +G 
Subjt:  ELDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGP

Query:  SLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNT
                                       ++++G+VR +E    FVP N+P    W+Y D+ A+A+ +G   D I+++    +  P    PI      
Subjt:  SLVQESERSSWWKFWSKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNT

Query:  LIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS
         +R+     +H+ Y +TWY L AA +++ F++  +++
Subjt:  LIRSSVMPQDHLNYTLTWYTLSAAVTFMAFKRLRQKS

Q9SE51 Surfeit locus protein 15.6e-10460.95Show/hide
Query:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG
        + SS SSSA + S     SSS +  Q+ +R S+ S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL+  L+ LEFRRV CKG
Subjt:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQES---ERSSWWKFWSKKTES
        VFDE++SIY+GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGW PR+W+EK+    Q+S+E       S   +S   E  SWWKFWSK    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQES---ERSSWWKFWSKKTES

Query:  QENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTL
         +  I+ + P+EV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWY+L
Subjt:  QENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTL

Query:  SAAVTFMAFKRLRQK
        SAAVTFMA+KRL+ K
Subjt:  SAAVTFMAFKRLRQK

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein2.9e-5541.67Show/hide
Query:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG
        SSS+ S+    S T +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N       +LD L FRRV+CKG
Subjt:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE        +        + +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES

Query:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY
           E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  YT+ W+
Subjt:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.8e-5241.87Show/hide
Query:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG
        SSS+ S+    S T +  S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N       +LD L FRRV+CKG
Subjt:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE        +        + +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALE---VDQQSSEQSSDIGPSLVQESERSSWWKFWSKKTES

Query:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP
           E++++    +EV+GVVR SE P I+   N P S  WFY+DVP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS+ +P
Subjt:  Q--ENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein4.0e-10560.95Show/hide
Query:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG
        + SS SSSA + S     SSS +  Q+ +R S+ S+ LLFLPGA+TFGLG+WQI RR+EK + L+Y+Q+RL MEP+ +N   PL+  L+ LEFRRV CKG
Subjt:  SSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNLLPLENELDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQES---ERSSWWKFWSKKTES
        VFDE++SIY+GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGW PR+W+EK+    Q+S+E       S   +S   E  SWWKFWSK    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQES---ERSSWWKFWSKKTES

Query:  QENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTL
         +  I+ + P+EV+GV+R  E PSIFVP+NDP + QWFYVDVPA+AR+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWY+L
Subjt:  QENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTL

Query:  SAAVTFMAFKRLRQK
        SAAVTFMA+KRL+ K
Subjt:  SAAVTFMAFKRLRQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTTCCTTCGCTAAATCTATAACAAAATTTCGCCCTTGTTTTTCCCTTTCTGGGCACTGTTCCACGCCTTTACCTTCATCTTCTTCCTCGTTCAGCTCTTC
CGCTGTAGTTTCTTCTACTCCCGATCCCCACTCATCTTCCCTTTCGCAAGCTCAACAGAAACAAAGAGAGTCGAGATTGTCAAAATGGCTCCTGTTTCTACCTGGTGCTC
TCACGTTTGGCCTTGGAACATGGCAGATTTTCAGGAGGCAAGAGAAGATAGAATTGTTAGATTACCGGCAGAAGCGATTGTTAATGGAACCTGTGAACATAAACAACTTA
TTGCCATTGGAAAACGAGCTCGATGATCTGGAGTTCAGGAGAGTGATCTGTAAAGGAGTTTTTGATGAGAAAAAATCAATTTATGTTGGTCCACGCTCGAGAAGCATTTC
AGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGATGCCAATTCCTGGCCTTCCTGATAGTGTGCAGTCGCCAGTTCTGGTTAATAGAGGCTGGGCTCCACGCA
CTTGGAAGGAGAAGGCTTTAGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGATATTGGACCTTCCTTGGTTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGG
TCAAAAAAGACTGAGAGTCAAGAGAATGAAATCACTCCCATCACTCCAATAGAAGTCATCGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGA
TCCTGGCTCTAGGCAGTGGTTTTATGTTGATGTTCCAGCAATTGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAACCCAA
GTGACCCGTATCCCATTCCAAAGGATGTTAATACCTTGATACGGAGTTCCGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTACACTCTCTCAGCTGCTGTA
ACCTTTATGGCTTTCAAAAGACTGAGGCAAAAATCAAATTGA
mRNA sequenceShow/hide mRNA sequence
AAATAAAGAAGACCAATAAGGAAATATTAAAGAATGCATAAACCCTTTAACCCTCAGAATTTGAGTTTTTGGACCCCAACAGAGAAGAACAGAGCAGCCAGGTGAAGGCC
AAGACTGAGAAACTCTCCAAATGTCAGCATCAAGAAGATGGCTTCTTCTTCCTTCGCTAAATCTATAACAAAATTTCGCCCTTGTTTTTCCCTTTCTGGGCACTGTTCCA
CGCCTTTACCTTCATCTTCTTCCTCGTTCAGCTCTTCCGCTGTAGTTTCTTCTACTCCCGATCCCCACTCATCTTCCCTTTCGCAAGCTCAACAGAAACAAAGAGAGTCG
AGATTGTCAAAATGGCTCCTGTTTCTACCTGGTGCTCTCACGTTTGGCCTTGGAACATGGCAGATTTTCAGGAGGCAAGAGAAGATAGAATTGTTAGATTACCGGCAGAA
GCGATTGTTAATGGAACCTGTGAACATAAACAACTTATTGCCATTGGAAAACGAGCTCGATGATCTGGAGTTCAGGAGAGTGATCTGTAAAGGAGTTTTTGATGAGAAAA
AATCAATTTATGTTGGTCCACGCTCGAGAAGCATTTCAGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGATGCCAATTCCTGGCCTTCCTGATAGTGTGCAG
TCGCCAGTTCTGGTTAATAGAGGCTGGGCTCCACGCACTTGGAAGGAGAAGGCTTTAGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGATATTGGACCTTCCTTGGT
TCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACTGAGAGTCAAGAGAATGAAATCACTCCCATCACTCCAATAGAAGTCATCGGAGTAGTTCGCA
CAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGATCCTGGCTCTAGGCAGTGGTTTTATGTTGATGTTCCAGCAATTGCACGCTCTTCTGGTCTACCTGAGGATACT
ATTTATGTTGAAGACATAAATGAGAATGTGAACCCAAGTGACCCGTATCCCATTCCAAAGGATGTTAATACCTTGATACGGAGTTCCGTTATGCCACAGGATCATCTGAA
TTACACGTTAACATGGTACACTCTCTCAGCTGCTGTAACCTTTATGGCTTTCAAAAGACTGAGGCAAAAATCAAATTGAAGATAACGAAAACTCTTGCTTGACCTTCTAT
AGAGCAGGAATGAACCCGGAACTCATAAATTGTTTCTGTGAGTTATTTTTGTTGGTATTGATTTTCATAATGCTTTTCACGAGAGAAGCCCGAATACCTCATTTCCTAAG
GACTTTTTGAACTCGAAAATGCTCCAAAATCCTCAATCCCTTTAGTGATACAGCATGTGGCTAGATAACAAGTTTTATGCAACCATAAAAACTGAGCAAACATTTTGATA
TTTTCATAACTAATAAATTTGTTACTCAATATGGAACATTCATGTTTCTGGTGTTTGAATCGTGAATTTCTTG
Protein sequenceShow/hide protein sequence
MASSSFAKSITKFRPCFSLSGHCSTPLPSSSSSFSSSAVVSSTPDPHSSSLSQAQQKQRESRLSKWLLFLPGALTFGLGTWQIFRRQEKIELLDYRQKRLLMEPVNINNL
LPLENELDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVDQQSSEQSSDIGPSLVQESERSSWWKFW
SKKTESQENEITPITPIEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYTLSAAV
TFMAFKRLRQKSN