; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g1161 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g1161
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSURF1-like protein
Genome locationMC03:17778308..17782200
RNA-Seq ExpressionMC03g1161
SyntenyMC03g1161
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.31e-20384.77Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP-----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQ
        MAS  FAKSI KFRP +SLS H +TP     SS+S FSSAAAVSSA+DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEKIE+LD+RQ
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP-----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQ

Query:  KRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSS
        KRLLMEPVN+NSL PL+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PIP +PDSVQSPVLVNRGWVPR+WKEKALEVD Q S
Subjt:  KRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSS

Query:  EQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPY
        EQSSD A S +QESERSSWWKFWSK  +NL+N+V+PITP EVIGVVRTSEKPSIFVPANDP SSQWFYVDVP IAR SGLPED IYVEDINE+VNPS+PY
Subjt:  EQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPY

Query:  PIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        PIPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT R+
Subjt:  PIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

XP_004137509.1 surfeit locus protein 1 [Cucumis sativus]3.65e-20585.8Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL
        MAS   AKSITKFRPC SLSGH STP  SS S FSSAA VSS  DP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEKIEMLD+R+KRL
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL

Query:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS
        LMEPVNIN+L  LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIP +PDSVQSPVLVNRGW PR+WKEKALEV+QQ SEQS
Subjt:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS

Query:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP
        SDI  S +Q  ERSSWWKFWSK+ E+L+N++ PITP EVIGVVRTSEKPSIFVPANDP S QWFYVDVP IAR+SGLPED IYVEDINE+VNPSDPYPIP
Subjt:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP

Query:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT+RR
Subjt:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

XP_008465733.1 PREDICTED: surfeit locus protein 1 [Cucumis melo]2.11e-20484.93Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL
        MAS   AKSITKFRPC SLSGH STP  SS S FSSAA VSS  DP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEKIEMLD+R+KRL
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL

Query:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS
        LMEPVNIN+L  LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+P +PDSVQSPVLVNRGW PR+WKEKALEV+QQ SEQS
Subjt:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS

Query:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP
        S    S +QE ERSSWWKFWSK+ E+L+N++ PITP EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IAR+SGLPED  YVEDINE+VNPSDPYPIP
Subjt:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP

Query:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT+RR
Subjt:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

XP_022136969.1 surfeit locus protein 1 [Momordica charantia]3.87e-243100Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM
        MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM

Query:  EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD
        EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD
Subjt:  EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD

Query:  IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
        IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
Subjt:  IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD

Query:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
Subjt:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

XP_023541124.1 surfeit locus protein 1-like isoform X2 [Cucurbita pepo subsp. pepo]1.31e-20385.3Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQK
        MAS  FAKSI KFRP +SLS H +TP    SS S FSSAAAVSSA+DPQSSSLSQAQQKQR+S+ SKWLLFLPGALTFGLGTWQIFRRQEKIE+LD+RQK
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQK

Query:  RLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSE
        RLLMEPVNINSL PL+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PI  +PDSVQSPVLVNRGWVPR+WKEKALEVD Q SE
Subjt:  RLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSE

Query:  QSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYP
        QSSDI  S +QESERSSWWKFWSK  +NL+N+V+PITP EVIGVVRTSEKPSIFVPANDP SSQWFYVDVP IAR SGLPED IYVEDINE+VNPS+PYP
Subjt:  QSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYP

Query:  IPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        IPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT R+
Subjt:  IPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein1.76e-20585.8Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL
        MAS   AKSITKFRPC SLSGH STP  SS S FSSAA VSS  DP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEKIEMLD+R+KRL
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL

Query:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS
        LMEPVNIN+L  LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIP +PDSVQSPVLVNRGW PR+WKEKALEV+QQ SEQS
Subjt:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS

Query:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP
        SDI  S +Q  ERSSWWKFWSK+ E+L+N++ PITP EVIGVVRTSEKPSIFVPANDP S QWFYVDVP IAR+SGLPED IYVEDINE+VNPSDPYPIP
Subjt:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP

Query:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        KDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT+RR
Subjt:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

A0A1S3CPJ6 SURF1-like protein1.02e-20484.93Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL
        MAS   AKSITKFRPC SLSGH STP  SS S FSSAA VSS  DP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEKIEMLD+R+KRL
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL

Query:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS
        LMEPVNIN+L  LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+P +PDSVQSPVLVNRGW PR+WKEKALEV+QQ SEQS
Subjt:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS

Query:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP
        S    S +QE ERSSWWKFWSK+ E+L+N++ PITP EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IAR+SGLPED  YVEDINE+VNPSDPYPIP
Subjt:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP

Query:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT+RR
Subjt:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

A0A5D3BJ55 SURF1-like protein1.02e-20484.93Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL
        MAS   AKSITKFRPC SLSGH STP  SS S FSSAA VSS  DP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEKIEMLD+R+KRL
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP--SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRL

Query:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS
        LMEPVNIN+L  LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+P+P +PDSVQSPVLVNRGW PR+WKEKALEV+QQ SEQS
Subjt:  LMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQS

Query:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP
        S    S +QE ERSSWWKFWSK+ E+L+N++ PITP EVIGV+RTSEKPSIFVPANDP S QWFYVDVP IAR+SGLPED  YVEDINE+VNPSDPYPIP
Subjt:  SDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIP

Query:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT+RR
Subjt:  KDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

A0A6J1C5U4 SURF1-like protein1.87e-243100Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM
        MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLM

Query:  EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD
        EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD
Subjt:  EPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSD

Query:  IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
        IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
Subjt:  IASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD

Query:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
Subjt:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

A0A6J1GUD0 SURF1-like protein4.26e-20284.48Show/hide
Query:  MASPYFAKSITKFRPCLSLSGHYSTP-----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQ
        MAS  FAKSI KFRP + LS H +TP     SS+S FSSAAAVSSA+DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEK E+LD+RQ
Subjt:  MASPYFAKSITKFRPCLSLSGHYSTP-----SSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQ

Query:  KRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSS
        KRLLMEPVNINSL PL+DKLDDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPL+PIP +PDSVQSPVLVNRGWVPR+WKEKALEVD Q S
Subjt:  KRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSS

Query:  EQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPY
        EQSSD A S +QESERSSWWKFWSK  +NL+N+V+PITP EVIGVVRTSEKPSIFVPANDP SSQWFYVDVP IAR SGLPED IYVEDINE+VNPS+PY
Subjt:  EQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPY

Query:  PIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        PIPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVTFMA KRLRQKT R+
Subjt:  PIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 12.4e-2226.56Show/hide
Query:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRS----
        SS+   A  K  +  + +W L L  A  FGLGTWQ+ RR+ K++++   + R++ EP+ + +  P+E  L +LE+R V  +G FD  K +Y+ PR+    
Subjt:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRS----

Query:  ------RSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPT
                    TE+G +V+TP          +   +LVNRG+VPR       +V+ ++ ++   +                                  
Subjt:  ------RSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPT

Query:  EVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR
        +++G+VR +E    FVP N P  + W+Y D+  +A+ +G   D I+++       P  P      +   TR + +  +H+ Y LTWY L AA +++ F++
Subjt:  EVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR

Query:  LRQKT
          ++T
Subjt:  LRQKT

Q15526 Surfeit locus protein 12.2e-2328.76Show/hide
Query:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSR---
        SS+   +  K  +  + +W+L L     FGLGTWQ+ RR+ K+ ++   + R+L EPV + +  P+E  L +LE+R V  +G FD  K +Y+ PR+    
Subjt:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSR---

Query:  --------SISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITP
                 IS  T++G YV+TP     +  D +   +LVNRG+VPR   +K     +Q  +   ++                                 
Subjt:  --------SISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITP

Query:  TEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFK
         ++IG+VR +E    FVP N+P  + W Y D+  +AR +G   + I+++   +   P  P      +   TR + +  +HL Y +TWY LSAA +++ FK
Subjt:  TEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFK

Query:  RLRQKT
        +  + T
Subjt:  RLRQKT

Q9LP74 Surfeit locus protein 1-like3.7e-5543Show/hide
Query:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG
        S S  S+  A S  S+ +S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+       LD L FRRV+CKG
Subjt:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIPN P+S++SP+LVNRGWVP  WKE +LE        +       ++ +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN

Query:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWY
        +  ++ V+     EV+GVVR SE P I+   N P S  WFY+DVP +A A G  ED +Y+E    D++ S  YP+P+DV  LTRS  +P D+  YT+ W+
Subjt:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWY

Q9QXU2 Surfeit locus protein 12.8e-2326.89Show/hide
Query:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRS----
        SS+   A  K  +  + +W L    A  FGLGTWQ+ RR+ K++++   + R++ EP+ + +  P+E  L +LE+R V  +G FD  K +Y+ PR+    
Subjt:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRS----

Query:  ------RSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPT
                    TE+G YV+TP          +   +LVNRG+VPR       +V+ ++ +Q   +                                  
Subjt:  ------RSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSKRAENLKNDVAPITPT

Query:  EVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR
        +++G+VR +E    FVP N+P  S W+Y D+  +A+ +G   D I+++       P  P      +   TR + +  +H+ Y +TWY L AA +++ F++
Subjt:  EVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKR

Query:  LRQKT
          ++T
Subjt:  LRQKT

Q9SE51 Surfeit locus protein 18.6e-10556.56Show/hide
Query:  YFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVN
        Y+  + T      SL   + +    ++  S+++ S+A   QSSS +  Q+ +R S+WS+ LLFLPGA+TFGLG+WQI RR+EK + L+++Q+RL MEP+ 
Subjt:  YFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVN

Query:  INSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASS
        +N   PL+  L+ LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPL+PIP   DS+QSP+LVNRGWVPRSW+EK+     Q S ++  IA+ 
Subjt:  INSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASS

Query:  SIQ----ESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
        S +     +E  SWWKFWSK     K  ++ + P EV+GV+R  E PSIFVP+NDP + QWFYVDVP +ARA GLPE+ IYVED++E V+ S PYP+PKD
Subjt:  SIQ----ESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD

Query:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        +NTL RS VMPQDHLNY++TWYSLSAAVTFMA+KRL+ K  RR
Subjt:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein2.6e-5643Show/hide
Query:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG
        S S  S+  A S  S+ +S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+       LD L FRRV+CKG
Subjt:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIPN P+S++SP+LVNRGWVP  WKE +LE        +       ++ +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN

Query:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWY
        +  ++ V+     EV+GVVR SE P I+   N P S  WFY+DVP +A A G  ED +Y+E    D++ S  YP+P+DV  LTRS  +P D+  YT+ W+
Subjt:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWY

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.6e-5343.25Show/hide
Query:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG
        S S  S+  A S  S+ +S  LS A    ++ R S  L +L G  T+GLG    F  Q ++E LD R++ L M+P+ +N+       LD L FRRV+CKG
Subjt:  SYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFPLEDKLDDLEFRRVICKG

Query:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN
        +FDE++SIYVGP+ RS+S  +E G YVITPL+PIPN P+S++SP+LVNRGWVP  WKE +LE        +       ++ +  S++S   KFW K    
Subjt:  VFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALE---VDQQSSEQSSDIASSSIQESERSSWWKFWSKRAEN

Query:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMP
        +  ++ V+     EV+GVVR SE P I+   N P S  WFY+DVP +A A G  ED +Y+E    D++ S  YP+P+DV  LTRS+ +P
Subjt:  L--KNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein6.1e-10656.56Show/hide
Query:  YFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVN
        Y+  + T      SL   + +    ++  S+++ S+A   QSSS +  Q+ +R S+WS+ LLFLPGA+TFGLG+WQI RR+EK + L+++Q+RL MEP+ 
Subjt:  YFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVN

Query:  INSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASS
        +N   PL+  L+ LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPL+PIP   DS+QSP+LVNRGWVPRSW+EK+     Q S ++  IA+ 
Subjt:  INSLFPLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASS

Query:  SIQ----ESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD
        S +     +E  SWWKFWSK     K  ++ + P EV+GV+R  E PSIFVP+NDP + QWFYVDVP +ARA GLPE+ IYVED++E V+ S PYP+PKD
Subjt:  SIQ----ESERSSWWKFWSKRAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKD

Query:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR
        +NTL RS VMPQDHLNY++TWYSLSAAVTFMA+KRL+ K  RR
Subjt:  VNTLTRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTNRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCACCTTATTTTGCTAAATCCATAACCAAATTTCGTCCTTGTCTTTCCCTTTCCGGGCACTACTCCACGCCTTCATCTTATTCTTTGTTCAGCTCTGCCGCTGC
AGTTTCTTCTGCCTCGGATCCCCAGTCATCTTCCCTTTCGCAAGCTCAACAGAAACAAAGAGAGTCGAGATGGTCAAAATGGCTCCTGTTTCTACCTGGGGCTCTCACGT
TTGGCCTCGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATGCTAGATTTCAGGCAGAAGCGATTGTTAATGGAACCCGTGAACATAAACAGCTTGTTTCCA
TTGGAAGACAAGCTAGATGATCTGGAGTTCAGAAGGGTAATCTGTAAAGGAGTTTTTGATGAGAAAAAATCAATCTATGTCGGTCCACGTTCAAGAAGCATTTCAGGAGT
GACTGAAAATGGCCATTATGTTATTACCCCATTGGTGCCCATTCCCAACATACCTGATAGTGTGCAGTCACCGGTTTTGGTTAATAGAGGATGGGTTCCGCGTAGTTGGA
AGGAGAAGGCTTTGGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGATATTGCATCCTCCTCGATTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCCAAA
AGGGCTGAGAATCTAAAGAATGATGTCGCTCCCATTACTCCAACAGAAGTAATTGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTTCCAGCAAATGATCCCCG
CTCTAGCCAGTGGTTTTATGTTGACGTTCCAGGGATTGCCCGTGCGTCTGGTCTTCCTGAGGATGTTATTTATGTGGAAGACATAAATGAGGATGTGAACCCGAGTGATC
CATACCCCATTCCAAAGGATGTTAATACCTTGACACGGAGTTCTGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTACTCTCTCTCGGCTGCCGTAACCTTC
ATGGCTTTCAAAAGACTCAGGCAAAAAACAAACCGAAGATAG
mRNA sequenceShow/hide mRNA sequence
ATTGGTATGATCTGATTTTTCTTTTTTCTTTTATTTTTATTGAGTATGCAATATTACAATGATTTATTAAATTGTGATTTGATTTTTAATTATAAAAGTGGAATACAATT
TATTTTATTGAGCAAGTTGAATTTCTCTTTGAACAAGAGGCAGCGGCTGTATGAAATATGAAGGCAAATTCATAAACCCTTAACCCCCAGGAAGAGTTTGCAAGCTCAGC
GGGGCAGGGCAGCCAGCTGAAGAAAACGACTGAAACTCTCCAAATGTCAGCATTAAGAACATGGCATCACCTTATTTTGCTAAATCCATAACCAAATTTCGTCCTTGTCT
TTCCCTTTCCGGGCACTACTCCACGCCTTCATCTTATTCTTTGTTCAGCTCTGCCGCTGCAGTTTCTTCTGCCTCGGATCCCCAGTCATCTTCCCTTTCGCAAGCTCAAC
AGAAACAAAGAGAGTCGAGATGGTCAAAATGGCTCCTGTTTCTACCTGGGGCTCTCACGTTTGGCCTCGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGATAGAAATG
CTAGATTTCAGGCAGAAGCGATTGTTAATGGAACCCGTGAACATAAACAGCTTGTTTCCATTGGAAGACAAGCTAGATGATCTGGAGTTCAGAAGGGTAATCTGTAAAGG
AGTTTTTGATGAGAAAAAATCAATCTATGTCGGTCCACGTTCAAGAAGCATTTCAGGAGTGACTGAAAATGGCCATTATGTTATTACCCCATTGGTGCCCATTCCCAACA
TACCTGATAGTGTGCAGTCACCGGTTTTGGTTAATAGAGGATGGGTTCCGCGTAGTTGGAAGGAGAAGGCTTTGGAAGTAGATCAACAAAGTAGTGAACAGTCTTCAGAT
ATTGCATCCTCCTCGATTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCCAAAAGGGCTGAGAATCTAAAGAATGATGTCGCTCCCATTACTCCAACAGAAGT
AATTGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTTCCAGCAAATGATCCCCGCTCTAGCCAGTGGTTTTATGTTGACGTTCCAGGGATTGCCCGTGCGTCTG
GTCTTCCTGAGGATGTTATTTATGTGGAAGACATAAATGAGGATGTGAACCCGAGTGATCCATACCCCATTCCAAAGGATGTTAATACCTTGACACGGAGTTCTGTTATG
CCACAGGATCATCTTAATTACACATTAACATGGTACTCTCTCTCGGCTGCCGTAACCTTCATGGCTTTCAAAAGACTCAGGCAAAAAACAAACCGAAGATAGCGAAAACT
ATACCATAATATTCTATAGAGCAGGAGCGAACTGGAACTCTTAAATTGTATGTGAGAGTTATTTTTGTTGGTATTGTTTTTCATAATGCTGCCTCAGACTCTTTGAACTA
AGAAAAGATAACAAGTTTTCTACATTTTGAATAAATCTGTTCTTCATTTTGGAAAATTCTGTGTGGATGGTTGGTCAGACCTTCATAATCTGTCTACTTGTTTGTTTTCC
TTAAAAGCTTAAAAGCTTTCATTATCAAGTTTATTAGGGGCATTGTCAAAGGTGAAAGTGACATAAAACATATATCTTGTTAGGTGGGCAACAATGGATGTGTATGAACA
TTGAGAACTCATTTGGTTGGTTGTATGTTTTCAACATCTAATTGGATTTTAACCCTTTATCCACTCAAATAAGTCCTTTGGCATTTTTTGAAAAGAAAGGTCACATCTCC
ATGCCAATGGATGGTTCCCACCTCCA
Protein sequenceShow/hide protein sequence
MASPYFAKSITKFRPCLSLSGHYSTPSSYSLFSSAAAVSSASDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKIEMLDFRQKRLLMEPVNINSLFP
LEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLVPIPNIPDSVQSPVLVNRGWVPRSWKEKALEVDQQSSEQSSDIASSSIQESERSSWWKFWSK
RAENLKNDVAPITPTEVIGVVRTSEKPSIFVPANDPRSSQWFYVDVPGIARASGLPEDVIYVEDINEDVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVTF
MAFKRLRQKTNRR