; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009200 (gene) of Chayote v1 genome

Gene IDSed0009200
OrganismSechium edule (Chayote v1)
DescriptionSURF1-like protein
Genome locationLG01:12980634..12986325
RNA-Seq ExpressionSed0009200
SyntenySed0009200
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]4.3e-15180.69Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK
        MASSS AKSI KFR  + LS H +  T LPSSS SFSSAAAVS          ++ QQK RDS  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYRQK
Subjt:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK

Query:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE
        RLL EPVN++ L PL+DKLDDLEFRRV+CKGVFDE KSI+VGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGWVPRTWKEKALE D Q SE
Subjt:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE

Query:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP
        +SS     +VQE+E+SSWWKFWSK T+NLENEV+PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IAR+  +PEDAIYVEDINE+VNPSNPYP
Subjt:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_004137509.1 surfeit locus protein 1 [Cucumis sativus]1.0e-15282.03Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGH ST LPSSSSSFSSAA VS          ++ QQK R+S  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYR+KRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L EPVNI+ L  LEDKLDDLEFRRV+CKGVFDE KSIYVGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGW PRTWKEKALE ++Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQ  E+SSWWKFWSKKT++LENE+TPITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IARS  +PED IYVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_008465733.1 PREDICTED: surfeit locus protein 1 [Cucumis melo]1.9e-15180.87Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGH ST LPSSSSSFSSAA VS          ++ QQK R+S  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYR+KRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L EPVNI+ L  LEDKLDDLEFRRV+CKGVFDE KSIYVGPRSRSISGVTENGHYVITPLMP+P LPDSVQSPVLVNRGW PRTWKEKALE ++Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQE E+SSWWKFWSKKT++LENE+TPITP+EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IARS  +PED  YVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_023541124.1 surfeit locus protein 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.9e-15181.27Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK
        MASSS AKSI KFR F+ LS H +  T LPSSSSSFSSAAAVS          ++ QQK RDS  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYRQK
Subjt:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK

Query:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE
        RLL EPVNI+ L PL+DKLDDLEFRRV+CKGVFDE KSI+VGPRSRSISGVTENGHYVITPLMPI  LPDSVQSPVLVNRGWVPRTWKEKALE D Q SE
Subjt:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE

Query:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP
        +SS     +VQE+E+SSWWKFWSK T+NLENEV+PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IAR+  +PEDAIYVEDINE+VNPSNPYP
Subjt:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

XP_038895168.1 surfeit locus protein 1-like [Benincasa hispida]7.1e-15482.03Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGHCST LP SSSSFSSAA VS          ++ QQK R+S WSKWLLFLPGALTFGLGTWQI RRQDKIE+LDYRQKRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L +PVNI+ L PLEDKLDDLEFRRV+CKGVFDE KS YVGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGW PRTWKEKALE D+Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQE  +SSWWKFWSKKT++LENE TPITPIEVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IARS  +PED  YVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWY+LSAAVTFMAFKRL QKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein5.0e-15382.03Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGH ST LPSSSSSFSSAA VS          ++ QQK R+S  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYR+KRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L EPVNI+ L  LEDKLDDLEFRRV+CKGVFDE KSIYVGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGW PRTWKEKALE ++Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQ  E+SSWWKFWSKKT++LENE+TPITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IARS  +PED IYVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A1S3CPJ6 SURF1-like protein9.4e-15280.87Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGH ST LPSSSSSFSSAA VS          ++ QQK R+S  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYR+KRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L EPVNI+ L  LEDKLDDLEFRRV+CKGVFDE KSIYVGPRSRSISGVTENGHYVITPLMP+P LPDSVQSPVLVNRGW PRTWKEKALE ++Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQE E+SSWWKFWSKKT++LENE+TPITP+EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IARS  +PED  YVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A5D3BJ55 SURF1-like protein9.4e-15280.87Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL
        MASSS AKSITKFR    LSGH ST LPSSSSSFSSAA VS          ++ QQK R+S  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYR+KRL
Subjt:  MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRL

Query:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS
        L EPVNI+ L  LEDKLDDLEFRRV+CKGVFDE KSIYVGPRSRSISGVTENGHYVITPLMP+P LPDSVQSPVLVNRGW PRTWKEKALE ++Q SE+S
Subjt:  LTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKS

Query:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP
        S     LVQE E+SSWWKFWSKKT++LENE+TPITP+EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IARS  +PED  YVEDINE+VNPS+PYPIP
Subjt:  S----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A6J1GUD0 SURF1-like protein2.3e-15080.69Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK
        MASSS AKSI KFR  + LS H +  T LPSSS SFSSAAAVS          ++ QQK RDS  SKWLLFLPGALTFGLGTWQIFRRQ+K E+LDYRQK
Subjt:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK

Query:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE
        RLL EPVNI+ L PL+DKLDDLEFRRV+CKGVFDE KSI+VGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGWVPRTWKEKALE D Q SE
Subjt:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE

Query:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP
        +SS     +VQE+E+SSWWKFWSK T+NLENEV+PITP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IAR+  +PEDAIYVEDINE+VNPSNPYP
Subjt:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

A0A6J1JYJ0 SURF1-like protein8.8e-15080.17Show/hide
Query:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK
        MASSS AKSI KFR F+ L+ H +  T LPSSS SFSSAAAVS          ++ QQK RDS  SKWLLFLPGALTFGLGTWQIFRRQ+KIE+LDYRQK
Subjt:  MASSSAAKSITKFRAFLPLSGHCS--TTLPSSSSSFSSAAAVS----------AEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQK

Query:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE
        RLL EPVN++ L PL+DKLDDLEFRRV+CKGVFDE KSI+VGPRSRSISGVTENGHYVITPLMPIP LPDSVQSPVLVNRGWVPRTWKEKALE D Q  E
Subjt:  RLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSE

Query:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPI-TPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPY
        +SS     +VQE+E+SSWWKFWSK T+NLENEV+PI TP+EVIGVVRTSEKPSIFVPANDPGS QWFYVDVP IAR+  +PEDAIYVEDINE+VNPSNPY
Subjt:  KSS----PLVQETEKSSWWKFWSKKTQNLENEVTPI-TPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR
        PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT+R+
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 16.1e-2328.34Show/hide
Query:  SSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGP
        SS +  AA  AE      D  + +W L L  A  FGLGTWQ+ RR+ K++L+   + R++ EP+ +    P+E  L +LE+R V  +G FD +K +Y+ P
Subjt:  SSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGP

Query:  RS----------RSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPIT
        R+                TE+G +V+TP          +   +LVNRG+VPR              +K +P   ET +                    + 
Subjt:  RS----------RSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPIT

Query:  PIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAF
         ++++G+VR +E    FVP N P    W+Y D+  +A+      D I+++       P    PI       +R+     +H+ Y LTWY L AA +++ F
Subjt:  PIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAF

Query:  KRLRQKT
        ++  ++T
Subjt:  KRLRQKT

Q15526 Surfeit locus protein 11.7e-2531.8Show/hide
Query:  SSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSR
        SSAA  SA    K  D  + +W+L L     FGLGTWQ+ RR+ K+ L+   + R+L EPV +    P+E  L +LE+R V  +G FD +K +Y+ PR+ 
Subjt:  SSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGPRSR

Query:  -----------SISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPITPI
                    IS  T++G YV+TP          +   +LVNRG+VPR              +K +P   ET +             +E EV      
Subjt:  -----------SISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPITPI

Query:  EVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR
        ++IG+VR +E    FVP N+P    W Y D+  +AR      + I+++   +   P    PI       +R+     +HL Y +TWY LSAA +++ FK+
Subjt:  EVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR

Query:  LRQKT
          + T
Subjt:  LRQKT

Q9LP74 Surfeit locus protein 1-like1.6e-5238.2Show/hide
Query:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK
        +S   ++ LP++S + +  + + +      +    S  L +L G  T+GLG    F  Q ++E LD R++ L  +P+ ++        LD L FRRV+CK
Subjt:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK

Query:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT
        G+FDE +SIYVGP+ RS+S  +E G YVITPL+PIP+ P+S++SP+LVNRGWVP  WKE +LE        A  + S K++ L+  +++S   KFW K  
Subjt:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT

Query:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLT
          +  E++V+    +EV+GVVR SE P I+   N P S  WFY+DVP++A ++   ED +Y+E    D++ S  YP+P+DV  L RS  +P D+  YT+ 
Subjt:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLT

Query:  WYSLSAAVTFMAFKRLRQKTSR
        W+  S      A   L ++ ++
Subjt:  WYSLSAAVTFMAFKRLRQKTSR

Q9QXU2 Surfeit locus protein 11.6e-2329.87Show/hide
Query:  SSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGP
        SS +  AA  AE+     DS    +LLF+P A  FGLGTWQ+ RR+ K++L+   + R++ EP+ +    P+E  L +LE+R V  +G FD +K +Y+ P
Subjt:  SSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCKGVFDENKSIYVGP

Query:  RS----------RSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPIT
        R+                TE+G YV+TP          +   +LVNRG+VPR              +K +P   ET +                    + 
Subjt:  RS----------RSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPIT

Query:  PIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNP-YPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA
         ++++G+VR +E    FVP N+P    W+Y D+  +A+      D I+   I+ D N + P  PI       +R+     +H+ Y +TWY L AA +++ 
Subjt:  PIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNP-YPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMA

Query:  FKRLRQKT
        F++  ++T
Subjt:  FKRLRQKT

Q9SE51 Surfeit locus protein 18.3e-10558.81Show/hide
Query:  SGHCSTTLPSSSSS---FSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVL
        S H S    SSSSS     S ++ SA  Q+  R S WS+ LLFLPGA+TFGLG+WQI RR++K + L+Y+Q+RL  EP+ ++   PL+  L+ LEFRRV 
Subjt:  SGHCSTTLPSSSSS---FSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVL

Query:  CKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEK---ALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNL
        CKGVFDE +SIY+GPRSRSISG+TENG +VITPLMPIP   DS+QSP+LVNRGWVPR+W+EK   + EA+   ++ +       E  SWWKFWSK     
Subjt:  CKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEK---ALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNL

Query:  ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS
        +  ++ + P+EV+GV+R  E PSIFVP+NDP +GQWFYVDVP +AR++ +PE+ IYVED++E V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWYSLS
Subjt:  ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS

Query:  AAVTFMAFKRLRQKTSRR
        AAVTFMA+KRL+ K  RR
Subjt:  AAVTFMAFKRLRQKTSRR

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.2e-5338.2Show/hide
Query:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK
        +S   ++ LP++S + +  + + +      +    S  L +L G  T+GLG    F  Q ++E LD R++ L  +P+ ++        LD L FRRV+CK
Subjt:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK

Query:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT
        G+FDE +SIYVGP+ RS+S  +E G YVITPL+PIP+ P+S++SP+LVNRGWVP  WKE +LE        A  + S K++ L+  +++S   KFW K  
Subjt:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT

Query:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLT
          +  E++V+    +EV+GVVR SE P I+   N P S  WFY+DVP++A ++   ED +Y+E    D++ S  YP+P+DV  L RS  +P D+  YT+ 
Subjt:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLT

Query:  WYSLSAAVTFMAFKRLRQKTSR
        W+  S      A   L ++ ++
Subjt:  WYSLSAAVTFMAFKRLRQKTSR

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein4.2e-5139.86Show/hide
Query:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK
        +S   ++ LP++S + +  + + +      +    S  L +L G  T+GLG    F  Q ++E LD R++ L  +P+ ++        LD L FRRV+CK
Subjt:  LSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVLCK

Query:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT
        G+FDE +SIYVGP+ RS+S  +E G YVITPL+PIP+ P+S++SP+LVNRGWVP  WKE +LE        A  + S K++ L+  +++S   KFW K  
Subjt:  GVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALE--------ADRQVSEKSSPLVQETEKSSWWKFWSKKT

Query:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMP
          +  E++V+    +EV+GVVR SE P I+   N P S  WFY+DVP++A ++   ED +Y+E    D++ S  YP+P+DV  L RS+ +P
Subjt:  QNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein5.9e-10658.81Show/hide
Query:  SGHCSTTLPSSSSS---FSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVL
        S H S    SSSSS     S ++ SA  Q+  R S WS+ LLFLPGA+TFGLG+WQI RR++K + L+Y+Q+RL  EP+ ++   PL+  L+ LEFRRV 
Subjt:  SGHCSTTLPSSSSS---FSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDLEFRRVL

Query:  CKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEK---ALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNL
        CKGVFDE +SIY+GPRSRSISG+TENG +VITPLMPIP   DS+QSP+LVNRGWVPR+W+EK   + EA+   ++ +       E  SWWKFWSK     
Subjt:  CKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEK---ALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNL

Query:  ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS
        +  ++ + P+EV+GV+R  E PSIFVP+NDP +GQWFYVDVP +AR++ +PE+ IYVED++E V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWYSLS
Subjt:  ENEVTPITPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS

Query:  AAVTFMAFKRLRQKTSRR
        AAVTFMA+KRL+ K  RR
Subjt:  AAVTFMAFKRLRQKTSRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCTTCCGCTGCTAAATCCATAACGAAATTTCGAGCTTTTCTTCCCCTTTCTGGGCACTGTTCGACGACTTTACCTTCATCTTCTTCATCCTTCAGTTCTGC
CGCTGCAGTTTCTGCTGAAGAACAGCAGAAACATAGGGATTCGGGATGGTCAAAATGGCTCCTTTTTTTACCTGGGGCTCTCACGTTTGGCCTTGGGACGTGGCAGATTT
TCAGGAGGCAAGATAAGATAGAATTGTTAGATTACAGGCAGAAGCGATTGTTAACGGAACCGGTGAACATAGACGGTTTATTTCCTTTGGAAGACAAGTTAGATGATTTG
GAGTTCAGGAGAGTGCTCTGTAAAGGAGTTTTTGATGAGAATAAATCAATCTATGTTGGTCCACGTTCAAGAAGCATTTCAGGAGTGACTGAAAATGGCCATTATGTTAT
TACCCCATTGATGCCAATTCCCAGCCTACCTGATAGTGTGCAGTCACCAGTTCTGGTCAACAGAGGATGGGTTCCTCGCACTTGGAAGGAGAAGGCTTTAGAAGCAGATC
GACAAGTTAGTGAAAAGTCTTCCCCATTGGTTCAAGAGACTGAGAAAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACCCAGAATCTAGAGAATGAAGTCACTCCCATT
ACTCCAATAGAAGTTATTGGAGTTGTTCGAACAAGTGAAAAGCCTAGCATATTTGTACCAGCAAATGATCCTGGCTCTGGGCAATGGTTTTATGTTGATGTTCCAGAAAT
CGCACGCTCTTTGGATATGCCTGAGGATGCTATTTATGTGGAAGACATAAATGAGGATGTGAACCCAAGTAACCCATATCCGATTCCCAAGGATGTTAATACGTTGATAC
GAAGTTCCGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTACTCCCTCTCAGCTGCTGTGACCTTCATGGCTTTCAAAAGACTGAGGCAAAAAACCAGTCGA
AGATAA
mRNA sequenceShow/hide mRNA sequence
GAACACAATGCACAAAACCCTTAACCCCAAGAATGAGTTTTCGGCCGCAATAGAGCAAAACAGAGCGGCCAGCTGAAGAACAAGATTCAAAACTTTTCAAATGTCAGCAT
CAAGAACATGGCATCTTCTTCCGCTGCTAAATCCATAACGAAATTTCGAGCTTTTCTTCCCCTTTCTGGGCACTGTTCGACGACTTTACCTTCATCTTCTTCATCCTTCA
GTTCTGCCGCTGCAGTTTCTGCTGAAGAACAGCAGAAACATAGGGATTCGGGATGGTCAAAATGGCTCCTTTTTTTACCTGGGGCTCTCACGTTTGGCCTTGGGACGTGG
CAGATTTTCAGGAGGCAAGATAAGATAGAATTGTTAGATTACAGGCAGAAGCGATTGTTAACGGAACCGGTGAACATAGACGGTTTATTTCCTTTGGAAGACAAGTTAGA
TGATTTGGAGTTCAGGAGAGTGCTCTGTAAAGGAGTTTTTGATGAGAATAAATCAATCTATGTTGGTCCACGTTCAAGAAGCATTTCAGGAGTGACTGAAAATGGCCATT
ATGTTATTACCCCATTGATGCCAATTCCCAGCCTACCTGATAGTGTGCAGTCACCAGTTCTGGTCAACAGAGGATGGGTTCCTCGCACTTGGAAGGAGAAGGCTTTAGAA
GCAGATCGACAAGTTAGTGAAAAGTCTTCCCCATTGGTTCAAGAGACTGAGAAAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACCCAGAATCTAGAGAATGAAGTCAC
TCCCATTACTCCAATAGAAGTTATTGGAGTTGTTCGAACAAGTGAAAAGCCTAGCATATTTGTACCAGCAAATGATCCTGGCTCTGGGCAATGGTTTTATGTTGATGTTC
CAGAAATCGCACGCTCTTTGGATATGCCTGAGGATGCTATTTATGTGGAAGACATAAATGAGGATGTGAACCCAAGTAACCCATATCCGATTCCCAAGGATGTTAATACG
TTGATACGAAGTTCCGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTACTCCCTCTCAGCTGCTGTGACCTTCATGGCTTTCAAAAGACTGAGGCAAAAAAC
CAGTCGAAGATAACGAAAACTATAACGTAATCTTCTATAGAGCAGGAACAAACCCGGAATTCCTAAATTGTATGTGAGAGTGTTTTTTTGGTATTGTTTTTTCATAATGC
TTTTCACTAGAGAAGCCTGAATATCTCATTTCCTTGAACTTGAAAATCCTAAGAAACCCTCAATTCCCTTTAGCCATACAAGCATGTTGAGCAAGATATCAAGTTTTTTG
CAACCATTAAACTGAACATACATTTTGATATTTCCACTGTCATTTTCTGCTGTCTAAATTGTGAATCTTGGGCATAGCCAATAATGGTGGTCTGGGTCTTTCAATTTCTT
TGTCTGTTTTGTTTGTTTTTTTCTCTTAAAATCTTTCATTGTCAAGGTTATTAGGCACTGTTCAAAGGTGAAAGTGACAAAATATGTATCTTGTTAGGTGGGTATCAATG
GATGTGTATGAATATTGAAAACTTATTTGGTTGG
Protein sequenceShow/hide protein sequence
MASSSAAKSITKFRAFLPLSGHCSTTLPSSSSSFSSAAAVSAEEQQKHRDSGWSKWLLFLPGALTFGLGTWQIFRRQDKIELLDYRQKRLLTEPVNIDGLFPLEDKLDDL
EFRRVLCKGVFDENKSIYVGPRSRSISGVTENGHYVITPLMPIPSLPDSVQSPVLVNRGWVPRTWKEKALEADRQVSEKSSPLVQETEKSSWWKFWSKKTQNLENEVTPI
TPIEVIGVVRTSEKPSIFVPANDPGSGQWFYVDVPEIARSLDMPEDAIYVEDINEDVNPSNPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSR
R