; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027409 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027409
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSURF1-like protein
Genome locationtig00153054:1198967..1204567
RNA-Seq ExpressionSgr027409
SyntenySgr027409
Gene Ontology termsGO:0005743 - mitochondrial inner membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR002994 - Surfeit locus 1/Shy1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572973.1 Surfeit locus protein 1, partial [Cucurbita argyrosperma subsp. sororia]2.7e-15886.05Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK
        MAS SFAKS+ KFRP +SLSR  ATP+  PSSS SFSSAAAVSSA DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEK+E+LD+RQK
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK

Query:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE
        RLLMEPVN+NS+ PL DK DDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGWVPR+WK+KALEVD QG E
Subjt:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE

Query:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP
        QSSD APS+VQESERSSWWKFWSK T NLENEV+PITP+EVIGVVRTSEKPSIFVPANDP SSQWFYVDVPAIAR SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI
        IPKDVNTLIRSSVMPQDHLNYTLTWY     VTF  I
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI

KAG7012154.1 Surfeit locus protein 1 [Cucurbita argyrosperma subsp. argyrosperma]9.2e-15986.53Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK
        MAS SFAKS+ KFRP +SLSR  ATP+  PSSS SFSSAAAVSSA DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEK+E+LD+RQK
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK

Query:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE
        RLLMEPVN+NS+ PL DK DDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGWVPR+WK+KALEVD QG E
Subjt:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE

Query:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP
        QSSD APS+VQESERSSWWKFWSK T NLENEV+PITP+EVIGVVRTSEKPSIFVPANDP SSQWFYVDVPAIAR SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWYVVTFFHIS
        IPKDVNTLIRSSVMPQDHLNYTLTWYV   F  S
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWYVVTFFHIS

XP_004137509.1 surfeit locus protein 1 [Cucumis sativus]1.2e-15887.65Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MAS S AKS+TKFRPC SLS   +TP PSSSSSFSSAA VSS PDP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEK+EMLD+R+KRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LMEPVNIN++  L DK DDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGW PR+WK+KALEV++QG EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQ  ERSSWWKFWSKKT +LENE+TPITP+EVIGVVRTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWY
        KDVNTLIRSSVMPQDHLNYTLTWY
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWY

XP_022136969.1 surfeit locus protein 1 [Momordica charantia]2.7e-15888.89Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MASP FAKS+TKFRPCLSLS   +T  PSS S FSSAAAVSSA DPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEK+EMLDFRQKRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LMEPVNINS+FPL DK DDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIP IPDSVQSPVLVNRGWVPRSWK+KALEVD+Q  EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        SDIA S +QESERSSWWKFWSK+  NL+N+V PITP EVIGVVRTSEKPSIFVPANDP SSQWFYVDVP IARASGLPED IYVEDINE+VNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWY
        KDVNTL RSSVMPQDHLNYTLTWY
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWY

XP_038895168.1 surfeit locus protein 1-like [Benincasa hispida]2.4e-15986.85Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MAS S AKS+TKFRP  SLS  C+TP P SSSSFSSAA VSSAPDP SS LSQAQQKQRESRWSKWLLFLPGALTFGLGTWQI RRQ+K+EMLD+RQKRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LM+PVNIN++ PL DK DDLEFRRVICKGVFDEKKS YVGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGW PR+WK+KALEVD+QG EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQE+ RSSWWKFWSKKT +LENE TPITPIEVIGVVRTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWYVVT
        KDVNTLIRSSVMPQDHLNYTLTWY ++
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWYVVT

TrEMBL top hitse value%identityAlignment
A0A0A0LVW3 SURF1-like protein5.8e-15987.65Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MAS S AKS+TKFRPC SLS   +TP PSSSSSFSSAA VSS PDP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEK+EMLD+R+KRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LMEPVNIN++  L DK DDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGW PR+WK+KALEV++QG EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        SDI PSLVQ  ERSSWWKFWSKKT +LENE+TPITP+EVIGVVRTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPEDTIYVEDINENVNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWY
        KDVNTLIRSSVMPQDHLNYTLTWY
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWY

A0A5D3BJ55 SURF1-like protein7.1e-15786.11Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MAS S AKS+TKFRPC SLS   +TP PSSSSSFSSAA VSS PDP SSSLSQ QQKQRESR SKWLLFLPGALTFGLGTWQIFRRQEK+EMLD+R+KRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LMEPVNIN++  L DK DDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMP+PG+PDSVQSPVLVNRGW PR+WK+KALEV++QG EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        S   PSLVQE ERSSWWKFWSKKT +LENE+TPITP+EVIGV+RTSEKPSIFVPANDP S QWFYVDVPAIAR+SGLPEDT YVEDINENVNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWY
        KDVNTL RSSVMPQDHLNYTLTWY
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWY

A0A6J1C5U4 SURF1-like protein1.3e-15888.89Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL
        MASP FAKS+TKFRPCLSLS   +T  PSS S FSSAAAVSSA DPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEK+EMLDFRQKRL
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRL

Query:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS
        LMEPVNINS+FPL DK DDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPL+PIP IPDSVQSPVLVNRGWVPRSWK+KALEVD+Q  EQS
Subjt:  LMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQS

Query:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP
        SDIA S +QESERSSWWKFWSK+  NL+N+V PITP EVIGVVRTSEKPSIFVPANDP SSQWFYVDVP IARASGLPED IYVEDINE+VNPSDPYPIP
Subjt:  SDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIP

Query:  KDVNTLIRSSVMPQDHLNYTLTWY
        KDVNTL RSSVMPQDHLNYTLTWY
Subjt:  KDVNTLIRSSVMPQDHLNYTLTWY

A0A6J1GUD0 SURF1-like protein1.9e-15786.05Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK
        MAS SFAKS+ KFRP + LSR  ATP+  PSSS SFSSAAAVSSA DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEK E+LD+RQK
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK

Query:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE
        RLLMEPVNINS+ PL DK DDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGWVPR+WK+KALEVD QG E
Subjt:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE

Query:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP
        QSSD APS+VQESERSSWWKFWSK T NLENEV+PITP+EVIGVVRTSEKPSIFVPANDP SSQWFYVDVPAIAR SGLPED IYVEDINENVNPS+PYP
Subjt:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYP

Query:  IPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI
        IPKDVNTLIRSSVMPQDHLNYTLTWY     VTF  I
Subjt:  IPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI

A0A6J1JYJ0 SURF1-like protein3.2e-15785.8Show/hide
Query:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK
        MAS SFAKS+ KFRP +SL+R  ATP+  PSSS SFSSAAAVSSA DPQSSSLSQAQQK+R+S+ SKWLLFLPGALTFGLGTWQIFRRQEK+E+LD+RQK
Subjt:  MASPSFAKSVTKFRPCLSLSRLCATPS--PSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQK

Query:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE
        RLLMEPVN+NS+ PL DK DDLEFRRVICKGVFDEKKSI+VGPRSRSISGVTENGHYVITPLMPIPG+PDSVQSPVLVNRGWVPR+WK+KALEVD QG E
Subjt:  RLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIE

Query:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPI-TPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPY
        QSSDIAPS+VQESERSSWWKFWSK T NLENEV+PI TP+EVIGVVRTSEKPSIFVPANDP SSQWFYVDVPAIAR SGLPED IYVEDINENVNPS+PY
Subjt:  QSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPI-TPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPY

Query:  PIPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI
        PIPKDVNTLIRSSVMPQDHLNYTLTWY     VTF  I
Subjt:  PIPKDVNTLIRSSVMPQDHLNYTLTWY----VVTFFHI

SwissProt top hitse value%identityAlignment
P09925 Surfeit locus protein 11.3e-1926.83Show/hide
Query:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRS----
        SS+   A  K  +  + +W L L  A  FGLGTWQ+ RR+ K++++   + R++ EP+ + +  P+  K  +LE+R V  +G FD  K +Y+ PR+    
Subjt:  SSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRS----

Query:  ------RSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPI
                    TE+G +V+TP          +   +LVNRG+VPR                   + P   Q+ +                     +  +
Subjt:  ------RSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPI

Query:  EVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY
        +++G+VR +E    FVP N P  + W+Y D+ A+A+ +G   D I+++    +  P    PI       +R+     +H+ Y LTWY
Subjt:  EVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY

Q15526 Surfeit locus protein 17.4e-1828.62Show/hide
Query:  LSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQ-QKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKS
        L R  A+ +  S    S    V+  P    SS ++A   K  +  + +W+L L     FGLGTWQ+ RR+ K+ ++   + R+L EPV + +  P+  K 
Subjt:  LSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQ-QKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKS

Query:  DDLEFRRVICKGVFDEKKSIYVGPRSR-----------SISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPS
         +LE+R V  +G FD  K +Y+ PR+             IS  T++G YV+TP          +   +LVNRG+VPR                   + P 
Subjt:  DDLEFRRVICKGVFDEKKSIYVGPRSR-----------SISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPS

Query:  LVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTL
          Q+ +              +E EV      ++IG+VR +E    FVP N+P  + W Y D+ A+AR +G   + I+++   ++  P    PI       
Subjt:  LVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTL

Query:  IRSSVMPQDHLNYTLTWY
        +R+     +HL Y +TWY
Subjt:  IRSSVMPQDHLNYTLTWY

Q9LP74 Surfeit locus protein 1-like2.3e-5643.55Show/hide
Query:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL
        RL +     SSS+ S+  A S   + +S  LS A    ++ R S  L +L G  T+GLG    F  Q +VE LD R++ L M+P+ +N+   L    D L
Subjt:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL

Query:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW
         FRRV+CKG+FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WK+ +LE    G      + S  A  L+  S++S  
Subjt:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW

Query:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQ
         KFW K  N +  E++V+    +EV+GVVR SE P I+   N P+S  WFY+DVP +A A G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P 
Subjt:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQ

Query:  DHLNYTLTWY
        D+  YT+ W+
Subjt:  DHLNYTLTWY

Q9QXU2 Surfeit locus protein 14.3e-1826.73Show/hide
Query:  CATPSP-----SSSSSFSSAAAVSSAPDPQSSSLSQ-AQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDK
        CA P+P      S   FS  + +   P    SS ++ A  K  +  + +W L    A  FGLGTWQ+ RR+ K++++   + R++ EP+ + +  P+  K
Subjt:  CATPSP-----SSSSSFSSAAAVSSAPDPQSSSLSQ-AQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDK

Query:  SDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPS
          +LE+R V  +G FD  K +Y+ PR+                TE+G YV+TP          +   +LVNRG+VPR                   + P 
Subjt:  SDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPS

Query:  LVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTL
          Q+ +                     +  ++++G+VR +E    FVP N+P  S W+Y D+ A+A+ +G   D I+++    +  P    PI       
Subjt:  LVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTL

Query:  IRSSVMPQDHLNYTLTWY
        +R+     +H+ Y +TWY
Subjt:  IRSSVMPQDHLNYTLTWY

Q9SE51 Surfeit locus protein 11.9e-9856.29Show/hide
Query:  SPSFAKSVTKFRPCLSLSRLCATP-----------SPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVE
        S    +S TK   C + + + A+P           S  + SS SS+AA+ S    QSSS +  Q+ +R S+WS+ LLFLPGA+TFGLG+WQI RR+EK +
Subjt:  SPSFAKSVTKFRPCLSLSRLCATP-----------SPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVE

Query:  MLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALE
         L+++Q+RL MEP+ +N   PL    + LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGWVPRSW++K+ E
Subjt:  MLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALE

Query:  -VDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINEN
          + + I   S  A S    +E  SWWKFWSK     +  ++ + P+EV+GV+R  E PSIFVP+NDP++ QWFYVDVPA+ARA GLPE+TIYVED++E+
Subjt:  -VDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINEN

Query:  VNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY
        V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWY
Subjt:  VNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY

Arabidopsis top hitse value%identityAlignment
AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.7e-5743.55Show/hide
Query:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL
        RL +     SSS+ S+  A S   + +S  LS A    ++ R S  L +L G  T+GLG    F  Q +VE LD R++ L M+P+ +N+   L    D L
Subjt:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL

Query:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW
         FRRV+CKG+FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WK+ +LE    G      + S  A  L+  S++S  
Subjt:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW

Query:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQ
         KFW K  N +  E++V+    +EV+GVVR SE P I+   N P+S  WFY+DVP +A A G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P 
Subjt:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQ

Query:  DHLNYTLTWY
        D+  YT+ W+
Subjt:  DHLNYTLTWY

AT1G48510.2 Surfeit locus 1 cytochrome c oxidase biogenesis protein7.7e-5543.81Show/hide
Query:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL
        RL +     SSS+ S+  A S   + +S  LS A    ++ R S  L +L G  T+GLG    F  Q +VE LD R++ L M+P+ +N+   L    D L
Subjt:  RLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSIFPLGDKSDDL

Query:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW
         FRRV+CKG+FDE++SIYVGP+ RS+S  +E G YVITPL+PIP  P+S++SP+LVNRGWVP  WK+ +LE    G      + S  A  L+  S++S  
Subjt:  EFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQG----IEQSSDIAPSLVQESERSSW

Query:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP
         KFW K  N +  E++V+    +EV+GVVR SE P I+   N P+S  WFY+DVP +A A G  EDT+Y+E    +++ S  YP+P+DV  L RS+ +P
Subjt:  WKFWSKKTNNL--ENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMP

AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein1.3e-9956.29Show/hide
Query:  SPSFAKSVTKFRPCLSLSRLCATP-----------SPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVE
        S    +S TK   C + + + A+P           S  + SS SS+AA+ S    QSSS +  Q+ +R S+WS+ LLFLPGA+TFGLG+WQI RR+EK +
Subjt:  SPSFAKSVTKFRPCLSLSRLCATP-----------SPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVE

Query:  MLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALE
         L+++Q+RL MEP+ +N   PL    + LEFRRV CKGVFDE++SIY+GPRSRSISG+TENG +VITPLMPIPG  DS+QSP+LVNRGWVPRSW++K+ E
Subjt:  MLDFRQKRLLMEPVNINSIFPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALE

Query:  -VDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINEN
          + + I   S  A S    +E  SWWKFWSK     +  ++ + P+EV+GV+R  E PSIFVP+NDP++ QWFYVDVPA+ARA GLPE+TIYVED++E+
Subjt:  -VDKQGIEQSSDIAPSLVQESERSSWWKFWSKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINEN

Query:  VNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY
        V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWY
Subjt:  VNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCACCTTCTTTCGCTAAATCCGTAACAAAATTTCGTCCTTGTCTTTCCCTTTCCCGCCTCTGCGCGACGCCTTCACCTTCGTCTTCTTCCTCGTTCAGCTCTGC
CGCTGCAGTTTCCTCTGCTCCCGATCCTCAGTCATCTTCCCTTTCACAAGCTCAACAGAAACAAAGAGAGTCGAGATGGTCAAAATGGCTCCTGTTTCTACCTGGTGCTC
TCACGTTTGGCCTTGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGGTAGAAATGCTAGATTTCAGGCAGAAGCGATTGTTAATGGAACCTGTCAACATAAACAGCATA
TTTCCTTTGGGAGACAAGTCAGATGATCTGGAGTTCAGGAGGGTAATATGTAAAGGAGTTTTCGATGAGAAAAAATCAATCTATGTTGGTCCACGCTCGAGAAGCATTTC
CGGAGTGACTGAAAATGGCCATTATGTTATTACCCCATTGATGCCCATTCCCGGCATACCTGATAGTGTGCAGTCACCGGTTCTGGTTAATAGAGGATGGGTTCCACGCA
GCTGGAAGGATAAGGCTTTAGAAGTAGATAAACAAGGTATTGAGCAGTCTTCAGATATTGCGCCCTCCTTGGTTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGG
TCAAAAAAGACTAATAATCTAGAGAATGAAGTCACTCCCATTACTCCAATAGAAGTTATTGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTTCCAGCAAATGA
TCCCGCCTCTAGCCAGTGGTTCTATGTTGATGTTCCTGCAATTGCACGTGCTTCTGGTCTTCCTGAGGATACTATTTATGTGGAAGACATAAATGAGAATGTGAACCCAA
GTGACCCATACCCCATTCCAAAGGATGTTAATACCTTGATACGAAGTTCTGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTATGTCGTTACCTTTTTTCAT
ATTTCGTTTCCATGTAAAGCTCACTACTTAGATGCATATAAGTATAACCATGAAGTATTCCTAAGCATGAAGTCATTTTCTGAAGAACAAAAAATGACGAGAGAATTAAG
ATGCTGCATTCTATCTAATTCCCATTTGGTCGACTGCCAAATGTTGGTTGATAGCACAGCACCTGTTCGTGCTGCGAAGAAAGTACTCACTCTCTGCTGCTGTAACCTTC
ATGGCATTCAAAAGACTGAGGCAGAAAACGAGTCAAAGATAGCAAAAACAATAACAGAGCGCCGGCTGCCGACGCTTCTTCCTCAGTTTGCGCCGGCGGAAGCACTCTCC
CGTTGGCGAACAACATTCGCCCATTCCTCTGTTGATCTACCACGCCTTCCAGATCCACCGCGTTCCCGTAAAAAGCGCCGTCTCCGAATCTCCGTTCCACCTCGAGAAAT
TCGCCCAGAGATGCCGGCTTCACGCCATCGTCCCTACACAGCCGCCACCATCTTCGCTTGCGATCTGCAACCAGTGCCGGCGTCGTTGTCGTTTTTCTCTTCGGCTTCTT
GGTCCTGCGGGAAGGGCCGGCGACGGCGGCGTTGGCAGGGGTGGTGGCGTGTTGGTCTTGGTTCTGGGAGGGCACTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCACCTTCTTTCGCTAAATCCGTAACAAAATTTCGTCCTTGTCTTTCCCTTTCCCGCCTCTGCGCGACGCCTTCACCTTCGTCTTCTTCCTCGTTCAGCTCTGC
CGCTGCAGTTTCCTCTGCTCCCGATCCTCAGTCATCTTCCCTTTCACAAGCTCAACAGAAACAAAGAGAGTCGAGATGGTCAAAATGGCTCCTGTTTCTACCTGGTGCTC
TCACGTTTGGCCTTGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGGTAGAAATGCTAGATTTCAGGCAGAAGCGATTGTTAATGGAACCTGTCAACATAAACAGCATA
TTTCCTTTGGGAGACAAGTCAGATGATCTGGAGTTCAGGAGGGTAATATGTAAAGGAGTTTTCGATGAGAAAAAATCAATCTATGTTGGTCCACGCTCGAGAAGCATTTC
CGGAGTGACTGAAAATGGCCATTATGTTATTACCCCATTGATGCCCATTCCCGGCATACCTGATAGTGTGCAGTCACCGGTTCTGGTTAATAGAGGATGGGTTCCACGCA
GCTGGAAGGATAAGGCTTTAGAAGTAGATAAACAAGGTATTGAGCAGTCTTCAGATATTGCGCCCTCCTTGGTTCAAGAGAGTGAGAGAAGCTCTTGGTGGAAGTTCTGG
TCAAAAAAGACTAATAATCTAGAGAATGAAGTCACTCCCATTACTCCAATAGAAGTTATTGGAGTAGTTCGCACAAGTGAGAAGCCTAGCATATTTGTTCCAGCAAATGA
TCCCGCCTCTAGCCAGTGGTTCTATGTTGATGTTCCTGCAATTGCACGTGCTTCTGGTCTTCCTGAGGATACTATTTATGTGGAAGACATAAATGAGAATGTGAACCCAA
GTGACCCATACCCCATTCCAAAGGATGTTAATACCTTGATACGAAGTTCTGTTATGCCACAGGATCATCTTAATTACACATTAACATGGTATGTCGTTACCTTTTTTCAT
ATTTCGTTTCCATGTAAAGCTCACTACTTAGATGCATATAAGTATAACCATGAAGTATTCCTAAGCATGAAGTCATTTTCTGAAGAACAAAAAATGACGAGAGAATTAAG
ATGCTGCATTCTATCTAATTCCCATTTGGTCGACTGCCAAATGTTGGTTGATAGCACAGCACCTGTTCGTGCTGCGAAGAAAGTACTCACTCTCTGCTGCTGTAACCTTC
ATGGCATTCAAAAGACTGAGGCAGAAAACGAGTCAAAGATAGCAAAAACAATAACAGAGCGCCGGCTGCCGACGCTTCTTCCTCAGTTTGCGCCGGCGGAAGCACTCTCC
CGTTGGCGAACAACATTCGCCCATTCCTCTGTTGATCTACCACGCCTTCCAGATCCACCGCGTTCCCGTAAAAAGCGCCGTCTCCGAATCTCCGTTCCACCTCGAGAAAT
TCGCCCAGAGATGCCGGCTTCACGCCATCGTCCCTACACAGCCGCCACCATCTTCGCTTGCGATCTGCAACCAGTGCCGGCGTCGTTGTCGTTTTTCTCTTCGGCTTCTT
GGTCCTGCGGGAAGGGCCGGCGACGGCGGCGTTGGCAGGGGTGGTGGCGTGTTGGTCTTGGTTCTGGGAGGGCACTCTGA
Protein sequenceShow/hide protein sequence
MASPSFAKSVTKFRPCLSLSRLCATPSPSSSSSFSSAAAVSSAPDPQSSSLSQAQQKQRESRWSKWLLFLPGALTFGLGTWQIFRRQEKVEMLDFRQKRLLMEPVNINSI
FPLGDKSDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGIPDSVQSPVLVNRGWVPRSWKDKALEVDKQGIEQSSDIAPSLVQESERSSWWKFW
SKKTNNLENEVTPITPIEVIGVVRTSEKPSIFVPANDPASSQWFYVDVPAIARASGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYVVTFFH
ISFPCKAHYLDAYKYNHEVFLSMKSFSEEQKMTRELRCCILSNSHLVDCQMLVDSTAPVRAAKKVLTLCCCNLHGIQKTEAENESKIAKTITERRLPTLLPQFAPAEALS
RWRTTFAHSSVDLPRLPDPPRSRKKRRLRISVPPREIRPEMPASRHRPYTAATIFACDLQPVPASLSFFSSASWSCGKGRRRRRWQGWWRVGLGSGRAL