; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0016580 (gene) of Chayote v1 genome

Gene IDSed0016580
OrganismSechium edule (Chayote v1)
DescriptionS1 motif domain-containing protein
Genome locationLG01:68261842..68276627
RNA-Seq ExpressionSed0016580
SyntenySed0016580
Gene Ontology termsGO:0034337 - RNA folding (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042780.1 protein MLP1 [Cucumis melo var. makuwa]0.0e+0080.83Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFF P+DL RPRR +VRNLCFNGR +KFSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSL+MSN  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN+AS GS+SDR+D F LSKKP+IGGDKTS E END          TVD K++N I  LYI++RP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN
          SEET   SS+ EN  DID  ++GLQ  +PSDI   E+PAA SE F+DILDSTI++  +A+LLGKP+R D  S ET K +R+EAS P  D++GA+ET +
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPSTY+QNLGTIGS   G+Q FAST+
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK

Query:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE
         DSEIDVK G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVLANRKSRKLVFS+RPKEREELVEKKRSLM  LQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+SA+VDTEWADV+SLI ELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI

Query:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

XP_008437231.1 PREDICTED: uncharacterized protein LOC103482723 isoform X1 [Cucumis melo]0.0e+0080.83Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFF P+DL RPRR +VRNLCFNGR +KFSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSL+MSN  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN+AS GS+SDR+D F LSKKP+IGGDKTS E END          TVD K++N I  LYI++RP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN
          SEET   SS+ EN  DID  ++GLQ  +PSDI   E+PAA SE F+DILDSTI++  +A+LLGKP+R D  S ET K +R+EAS P  D++GA+ET +
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPSTY+QNLGTIGS   GSQ FAST+
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK

Query:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE
         DSEIDVK G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKV VVLANRKSRKLVFS+RPKEREELVEKKRSLM  LQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+SA+VDTEWADV+SLI ELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI

Query:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

XP_011654764.1 uncharacterized protein LOC101217667 [Cucumis sativus]0.0e+0080.46Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFTP+DL RPRR +VRNLCFNGRP+KFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNLVRPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSL+MSN STKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN AS   TSDR+D F LSKKP+IGGD+T  E          SES  VD K++N I  LYI+KRP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQP-RKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETE
          SEET   SS+ EN  DID  A+GLQ   +PSDI   E+PAA SE FSDILD TI+   +A+LLGKP+R D  S ET K +R+E S P  DV+GA ETE
Subjt:  -TSEETVEASSSKENRIDID--ALGLQP-RKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETE

Query:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST
        NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS D GSQ FAST
Subjt:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST

Query:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV
        +PDSEIDVK G ELTPDMKLEDLLQIY++EK+KFLSSFVGQKIKVNVVLANRKSRKL+FSIRPKER++LV+KKRSLM  LQVGDVVKCCIKKIAYFGIFV
Subjt:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV

Query:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG
        EIEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+S ++DTEWADV+SL+ ELQNIEG
Subjt:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG

Query:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        IE+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

XP_022159776.1 uncharacterized protein LOC111026097 isoform X1 [Momordica charantia]0.0e+0080.75Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFT +DL RPRR +VRNLC NGRP++FSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL K+MKLKA  KP  SD+KKPSQA+AK A SPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT +E I++KEK SGDE VDI +NI+++AS GS+SD VD F LSKKP+I GD+    +END  NL  SES T+D KKEN I+ LYI+KRP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASI--PGIDVDGAVET
          S ET EA+SS ENR DID  AL LQP +PS++   EDPAA SEPFSDILD ++KL   A LLGKP+R D  SNETLK S +EA +  P  DV+ AVET
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASI--PGIDVDGAVET

Query:  ENFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAS
        +NFSAIPAL+EHELADWT+AEDLAK GDRA+VEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS DSGSQNF+S
Subjt:  ENFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAS

Query:  TKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIF
        T+PD E DVK+G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVL NRKSRKL+FSIRPKEREELVEKKRSLMA LQVGDVVKCCIKKIAYFGIF
Subjt:  TKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIF

Query:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIE
        VEIEGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV D D MDGRLE A+VDTEWADV+SL  ELQNIE
Subjt:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIE

Query:  GIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        G+E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ +Q
Subjt:  GIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

XP_038874720.1 uncharacterized protein LOC120067263 [Benincasa hispida]0.0e+0081.24Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFTP+DL RPRR +VRNLCFNGR +KFSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKK NPD SYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNLVRPQL KEMKLKA +KP A DLKKPSQ + K A SPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL+MSNASTKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDEKEKLSG+  VDIVENIEN+ S GSTSD  D F LSKKP+IGGDKT F           +ES TVD K END++ +YI+KRP N   
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDIDALGLQPRKPSDIEDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETENFSAIP
          SEET   SS+ EN  DID   +       ++D A  SE FSDILDST+K+  +A+LLGKP+R D  S ET+K SR+EAS P  DV GAVETENFSAIP
Subjt:  -TSEETVEASSSKENRIDIDALGLQPRKPSDIEDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETENFSAIP

Query:  ALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTKPDSEI
        ALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS D  SQ FAST+ DSEI
Subjt:  ALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTKPDSEI

Query:  DVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVP
        DVK G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVLANRKSRKL+FS+RPKEREELVEKKRSLMA LQVGDVVKCCIKKIAYFGIFVEIEGVP
Subjt:  DVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVP

Query:  ALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGIESVSK
        ALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGR +SA+VDTEW+DV+SL+ ELQN +GIE+VSK
Subjt:  ALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGIESVSK

Query:  GRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        GRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  GRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

TrEMBL top hitse value%identityAlignment
A0A0A0KKL1 S1 motif domain-containing protein0.0e+0080.46Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFTP+DL RPRR +VRNLCFNGRP+KFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNLVRPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSL+MSN STKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN AS   TSDR+D F LSKKP+IGGD+T  E          SES  VD K++N I  LYI+KRP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQP-RKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETE
          SEET   SS+ EN  DID  A+GLQ   +PSDI   E+PAA SE FSDILD TI+   +A+LLGKP+R D  S ET K +R+E S P  DV+GA ETE
Subjt:  -TSEETVEASSSKENRIDID--ALGLQP-RKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETE

Query:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST
        NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS D GSQ FAST
Subjt:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST

Query:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV
        +PDSEIDVK G ELTPDMKLEDLLQIY++EK+KFLSSFVGQKIKVNVVLANRKSRKL+FSIRPKER++LV+KKRSLM  LQVGDVVKCCIKKIAYFGIFV
Subjt:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV

Query:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG
        EIEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+S ++DTEWADV+SL+ ELQNIEG
Subjt:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG

Query:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        IE+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

A0A1S3AU41 uncharacterized protein LOC103482723 isoform X10.0e+0080.83Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFF P+DL RPRR +VRNLCFNGR +KFSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSL+MSN  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN+AS GS+SDR+D F LSKKP+IGGDKTS E END          TVD K++N I  LYI++RP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN
          SEET   SS+ EN  DID  ++GLQ  +PSDI   E+PAA SE F+DILDSTI++  +A+LLGKP+R D  S ET K +R+EAS P  D++GA+ET +
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPSTY+QNLGTIGS   GSQ FAST+
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK

Query:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE
         DSEIDVK G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKV VVLANRKSRKLVFS+RPKEREELVEKKRSLM  LQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+SA+VDTEWADV+SLI ELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI

Query:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

A0A5A7THK9 Protein MLP10.0e+0080.83Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFF P+DL RPRR +VRNLCFNGR +KFSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL KEMKLKA NKP   D+KKPSQA+ K   SPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSL+MSN  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT NEVIDE EKLSGD  VD VENIEN+AS GS+SDR+D F LSKKP+IGGDKTS E END          TVD K++N I  LYI++RP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN
          SEET   SS+ EN  DID  ++GLQ  +PSDI   E+PAA SE F+DILDSTI++  +A+LLGKP+R D  S ET K +R+EAS P  D++GA+ET +
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASIPGIDVDGAVETEN

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPSTY+QNLGTIGS   G+Q FAST+
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTK

Query:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE
         DSEIDVK G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVLANRKSRKLVFS+RPKEREELVEKKRSLM  LQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVVGD D MDGRL+SA+VDTEWADV+SLI ELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEGI

Query:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++
Subjt:  ESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

A0A6J1E0R7 uncharacterized protein LOC111026097 isoform X20.0e+0079.79Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFT +DL RPRR +VRNLC NGRP++FSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL K+MKLKA  KP  SD+KKPSQA+AK A SPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT +E I++KEK SGDE VDI +NI+++AS GS+SD VD F LSKKP+I GD+    +END  NL  SES T+D KKEN I+ LYI+KRP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKRDDPSNETLKFSRDEASI--PGIDVDGAVETE
          S ET EA+SS ENR DID  AL LQP +PS++   EDPAA SEPFSDILD ++KL   A LLGKP+           S +EA +  P  DV+ AVET+
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKRDDPSNETLKFSRDEASI--PGIDVDGAVETE

Query:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST
        NFSAIPAL+EHELADWT+AEDLAK GDRA+VEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS DSGSQNF+ST
Subjt:  NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAST

Query:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV
        +PD E DVK+G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVL NRKSRKL+FSIRPKEREELVEKKRSLMA LQVGDVVKCCIKKIAYFGIFV
Subjt:  KPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFV

Query:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG
        EIEGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV D D MDGRLE A+VDTEWADV+SL  ELQNIEG
Subjt:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIEG

Query:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        +E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ +Q
Subjt:  IESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

A0A6J1E4Z1 uncharacterized protein LOC111026097 isoform X10.0e+0080.75Show/hide
Query:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK
        MDGR LTA SSFFT +DL RPRR +VRNLC NGRP++FSVLASKEEAELDRWDQMELKFGRLIGEDPKLT+AKIMSKKMNPDASYLEVEKSFYQKKGKS 
Subjt:  MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSK

Query:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL
        EVEE+SLDGLNL+RPQL K+MKLKA  KP  SD+KKPSQA+AK A SPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKEKYSDMTLL
Subjt:  EVEEISLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLL

Query:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--
        RKPEPMT +E I++KEK SGDE VDI +NI+++AS GS+SD VD F LSKKP+I GD+    +END  NL  SES T+D KKEN I+ LYI+KRP N+  
Subjt:  RKPEPMTPNEVIDEKEKLSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNM--

Query:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASI--PGIDVDGAVET
          S ET EA+SS ENR DID  AL LQP +PS++   EDPAA SEPFSDILD ++KL   A LLGKP+R D  SNETLK S +EA +  P  DV+ AVET
Subjt:  -TSEETVEASSSKENRIDID--ALGLQPRKPSDI---EDPAASSEPFSDILDSTIKLFNRASLLGKPKR-DDPSNETLKFSRDEASI--PGIDVDGAVET

Query:  ENFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAS
        +NFSAIPAL+EHELADWT+AEDLAK GDRA+VEVISSS RGFVVSFGS+VGFIPYRNLAAKWKFLAFESWLRQKGLDPS Y+QNLGTIGS DSGSQNF+S
Subjt:  ENFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFAS

Query:  TKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIF
        T+PD E DVK+G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVL NRKSRKL+FSIRPKEREELVEKKRSLMA LQVGDVVKCCIKKIAYFGIF
Subjt:  TKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIF

Query:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIE
        VEIEGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV D D MDGRLE A+VDTEWADV+SL  ELQNIE
Subjt:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRLESADVDTEWADVDSLIMELQNIE

Query:  GIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ
        G+E+VSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ +Q
Subjt:  GIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQ

SwissProt top hitse value%identityAlignment
Q4L6I1 30S ribosomal protein S14.1e-1432.62Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK
        S F GQ I++ V   + ++ +++ S +  E+ E   KK SL+  L  GDV+K  + ++  FG FV+I GV  L+H +E+S +   +P     +GQ V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK

Query:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRL
        V  ++   ERI LS+K   P P  E+++    + D ++G++
Subjt:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDGRL

Q5HFU7 30S ribosomal protein S12.7e-1332.37Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK
        S F GQ I++ V   + ++ +++ S +  E+EE   KK  L+  L  GDV+   + ++  FG F++I GV  L+H +E+S +    P     IGQ V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK

Query:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG
        +  +D   ERI LS+K   P P  E ++    + D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG

Q7A5J0 30S ribosomal protein S12.7e-1332.37Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK
        S F GQ I++ V   + ++ +++ S +  E+EE   KK  L+  L  GDV+   + ++  FG F++I GV  L+H +E+S +    P     IGQ V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK

Query:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG
        +  +D   ERI LS+K   P P  E ++    + D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG

Q99U14 30S ribosomal protein S12.7e-1332.37Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK
        S F GQ I++ V   + ++ +++ S +  E+EE   KK  L+  L  GDV+   + ++  FG F++I GV  L+H +E+S +    P     IGQ V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAK

Query:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG
        +  +D   ERI LS+K   P P  E ++    + D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLTETLESVVGDQDYMDG

Q9JZ44 30S ribosomal protein S13.7e-1533.33Show/hide
Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASY
        + D   +K  S F G++I+  V+  ++K   +V S R      L E++++L+  LQ G V+K  +K I  +G FV++ G+  L+H T+++W    +P+  
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASY

Query:  FKIGQVVEAKVHQLDFSLERIFLSLKQITPDP
         ++GQ VEAKV + D   +R+ L +KQ+  DP
Subjt:  FKIGQVVEAKVHQLDFSLERIFLSLKQITPDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein7.5e-18951.79Show/hide
Query:  MDGRTLTASSSFFTPVD------LFRPRRVSVRNLCFN--GRPTKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKS
        MD   L++S+S   P        L  P RV VR    N   +  KF V ASK EE +L+ WDQMEL FGRL+GEDPKLT+AKI+++K++P+AS++++EKS
Subjt:  MDGRTLTASSSFFTPVD------LFRPRRVSVRNLCFN--GRPTKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKS

Query:  FYQKKGKSKEVEEI--------------SLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKP----AGSPK-GRVPNVILRKPTIYNEDDVEDKP
        FY+ KGK  EVEEI              SLDGL LV+P L   +K +   +P    +KKPS  + KP      +PK  R+PNVILRKP+ +   + +D+ 
Subjt:  FYQKKGKSKEVEEI--------------SLDGLNLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKP----AGSPK-GRVPNVILRKPTIYNEDDVEDKP

Query:  SRIRMKPNLSLRMSNASTKEKYSDMTLLRKPEPMTPNEVIDEKEK-LSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDE-NDHNNL
        S++R+KPNL+L+M N    E++SDMTLLRKPEP+  + V +E++K LS D  ++     E +   G+ S    ++ L +KP+      + E+E  D   +
Subjt:  SRIRMKPNLSLRMSNASTKEKYSDMTLLRKPEPMTPNEVIDEKEK-LSGDENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDE-NDHNNL

Query:  ANSESGTVDEKKENDIKGLYIVKRPSNMTSEETVEASSSKENRIDIDALGLQPRKPSDIEDPAASSEPFSDILDSTI-KLFNRASLLGKPKRDDPSNETL
         +SE      +K      L  +++   +     +E+S  + N I  +   +Q       E+   +S+P   I    I +    ASL GKP+R DPS+   
Subjt:  ANSESGTVDEKKENDIKGLYIVKRPSNMTSEETVEASSSKENRIDIDALGLQPRKPSDIEDPAASSEPFSDILDSTI-KLFNRASLLGKPKRDDPSNETL

Query:  KFSRDEASIPGIDVDGAVETE------NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQK
             E S+P I     V  E           P     E  DW KAE L K+  RADVE+ISSS RGF VS+GS++GF+PYRNLAAKWKFLAFESWLR+K
Subjt:  KFSRDEASIPGIDVDGAVETE------NFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQK

Query:  GLDPSTYRQNLGTIGSIDSGSQN-FASTKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEK
        G+DPS YRQNLG IG  D  S++    +  DSE+      E++ DMKLEDLL +YDREK KFLSSFVGQKIKVNVV+ANR SRKL+FS+RP+E EE VEK
Subjt:  GLDPSTYRQNLGTIGSIDSGSQN-FASTKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEK

Query:  KRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV-GDQDY
        KR+LMAKL+VGDVVKCCIKKI YFGIF E+EGVPAL+HQ+EVSWDATL+PASYFKIGQ+VEAKVHQLDF+LERIFLSLK+ITPDPLTE LESVV GD D 
Subjt:  KRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV-GDQDY

Query:  MDGRLESADVDTEWADVDSLIMELQNIEGIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYS
        + GRL++A++D EW DV+SLI EL+ +EGI+SVSK RFFLSPGLAPTFQVYMA M+ENQYKLLAR+GN+VQE+ ++   S
Subjt:  MDGRLESADVDTEWADVDSLIMELQNIEGIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYS

AT3G23700.1 Nucleic acid-binding proteins superfamily4.5e-1625.22Show/hide
Query:  ADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTKPDSEIDVKSGEE
        +DW  A+   KSGD  + EV   +  G ++ F S+VGF+PY  L                                    S + +  +P   I       
Subjt:  ADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVGFIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTKPDSEIDVKSGEE

Query:  LTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIE------GVPA
                           +   + VG K+ V VV A+ ++RKL+ S    E+  L  K       + VGDV    +  +  +G F+ +        +  
Subjt:  LTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIE------GVPA

Query:  LIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV-GDQDYMDGRLESADVDT--EWADVDSLIMELQNIEGIESV
        L+H +EVSWD   +     + G  V   V  +D    RI LS+KQ+  DPL ETL+ V+  D       L S + DT      +++++ EL   +GIE+V
Subjt:  LIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVV-GDQDYMDGRLESADVDT--EWADVDSLIMELQNIEGIESV

Query:  SKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKVQEVNI
           R  F    ++   Q+++++    + ++ LLAR+G +VQE+++
Subjt:  SKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKVQEVNI

AT5G30510.1 ribosomal protein S11.8e-0424.79Show/hide
Query:  QKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLD
        ++I +  V  + +  KLV S R    +          A+L +G VV   ++ +  +G F++I G+  L+H +++S D   + A+  + G  ++  +   D
Subjt:  QKIKVNVVLANRKSRKLVFSIRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLD

Query:  FSLERIFLSLKQITPDP
            R+ LS K++ P P
Subjt:  FSLERIFLSLKQITPDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTCGAACTCTAACGGCCTCCTCCTCGTTCTTCACTCCCGTTGATTTATTCCGACCCAGAAGAGTATCCGTTAGAAATCTGTGTTTTAATGGCAGACCCACTAA
GTTTTCGGTTCTCGCTTCCAAAGAAGAGGCTGAGCTCGACCGCTGGGACCAAATGGAGCTCAAGTTCGGTCGCTTGATTGGGGAAGACCCCAAGTTAACAATGGCCAAGA
TAATGAGCAAAAAAATGAATCCGGATGCTTCATATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAATCCAAGGAGGTAGAGGAAATTTCTCTTGATGGTTTG
AATTTGGTTAGACCTCAGTTAAATAAGGAAATGAAGTTGAAAGCTGTCAATAAGCCAACTGCATCAGATTTAAAGAAACCAAGCCAAGCAATTGCAAAGCCAGCAGGTAG
TCCAAAAGGCAGGGTTCCCAATGTTATCTTGAGGAAACCAACGATTTATAATGAGGACGATGTCGAAGACAAGCCATCGAGGATAAGAATGAAGCCGAATTTATCATTAA
GAATGAGCAATGCATCAACAAAGGAGAAATATAGTGATATGACACTGTTGAGGAAGCCGGAACCAATGACTCCAAATGAAGTTATTGACGAGAAGGAAAAGCTCTCTGGT
GATGAGAATGTTGATATTGTGGAGAATATTGAAAATAAAGCTAGTAATGGATCAACAAGTGACCGAGTTGATCGGTTTGCTCTTTCTAAGAAGCCAGATATAGGAGGTGA
CAAAACAAGTTTTGAAGATGAAAATGATCATAACAATCTTGCTAATTCAGAAAGTGGTACTGTTGATGAAAAGAAAGAGAATGACATCAAAGGCTTATACATTGTAAAAA
GGCCTTCAAATATGACGTCGGAAGAAACTGTAGAAGCCTCATCATCAAAAGAAAACAGGATAGATATTGATGCTCTAGGGTTACAGCCTCGTAAGCCGAGTGATATTGAG
GACCCAGCAGCTTCTAGTGAACCATTTAGTGATATTTTGGATTCAACTATAAAGTTGTTCAACAGAGCTTCATTATTGGGTAAACCAAAAAGAGATGATCCTTCGAATGA
AACATTAAAATTCAGTAGAGACGAGGCCTCCATCCCTGGAATTGACGTTGACGGTGCCGTTGAGACAGAGAACTTCTCTGCTATCCCTGCTTTGGAGGAACATGAACTTG
CTGACTGGACAAAAGCAGAGGATCTGGCAAAGTCGGGAGATAGAGCTGACGTGGAAGTAATAAGCTCGAGTGCCCGAGGTTTTGTTGTATCGTTCGGCTCCATAGTAGGG
TTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTAGCTTTTGAGTCATGGCTAAGACAGAAAGGTTTAGATCCATCAACATACAGGCAAAACTTAGGAACTAT
TGGAAGTATTGATAGTGGAAGCCAAAATTTTGCCAGTACTAAGCCAGATTCAGAAATTGATGTAAAAAGTGGGGAAGAACTTACACCTGATATGAAATTGGAGGATCTAC
TTCAGATTTATGATAGAGAGAAACTCAAATTTTTGTCATCATTTGTTGGCCAGAAAATCAAAGTGAATGTGGTGTTGGCTAACAGAAAATCAAGGAAGCTGGTATTTTCC
ATAAGGCCAAAAGAAAGGGAAGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCCAAGCTGCAAGTAGGTGATGTTGTCAAATGTTGCATCAAGAAGATTGCTTATTTTGG
TATTTTTGTCGAGATTGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCTTGGGATGCTACTTTAAACCCTGCATCATACTTTAAGATTGGTCAGGTTGTCGAGG
CAAAAGTTCACCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGCTGAAGCAGATTACACCCGATCCGCTAACTGAAACATTAGAGTCTGTTGTTGGTGATCAAGAT
TACATGGATGGAAGATTAGAATCCGCCGATGTAGATACCGAGTGGGCTGATGTAGATTCTCTCATCATGGAACTGCAAAATATTGAAGGTATTGAGTCTGTATCCAAAGG
GCGATTTTTCCTTAGCCCTGGCTTGGCTCCGACCTTTCAGGTTTATATGGCTTCCATGTACGAAAATCAGTACAAATTGCTTGCTCGATCCGGAAACAAAGTACAAGAGG
TAAATATTCAGAAACATTATTCTTTTGCTATTGCAAGTCACTCATATTCTTATCAAGCTTGGGACTGTAAGTCAGATTGTACAATGGCATAA
mRNA sequenceShow/hide mRNA sequence
AAACGCCACGGTCTCGAAGTTCTGTATTTTCTATCGCCTTCTTCATACTTCCACAGTTTCTGCCAAAAAAAAAAAAAAAAAACCTCCATAGCTCCATAGCTCAATCTCAT
ATCCATAATCTCCTAAATTAAACCTTCCCCAAAACTCCAAAATCTCTAATTTCATTGAGTTGAATGACTCCAATCACTGTGAAGCAAGTGTAATCCACTCTGCAATTCCT
TATCCCAAGAAATGGACGGTCGAACTCTAACGGCCTCCTCCTCGTTCTTCACTCCCGTTGATTTATTCCGACCCAGAAGAGTATCCGTTAGAAATCTGTGTTTTAATGGC
AGACCCACTAAGTTTTCGGTTCTCGCTTCCAAAGAAGAGGCTGAGCTCGACCGCTGGGACCAAATGGAGCTCAAGTTCGGTCGCTTGATTGGGGAAGACCCCAAGTTAAC
AATGGCCAAGATAATGAGCAAAAAAATGAATCCGGATGCTTCATATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAATCCAAGGAGGTAGAGGAAATTTCTC
TTGATGGTTTGAATTTGGTTAGACCTCAGTTAAATAAGGAAATGAAGTTGAAAGCTGTCAATAAGCCAACTGCATCAGATTTAAAGAAACCAAGCCAAGCAATTGCAAAG
CCAGCAGGTAGTCCAAAAGGCAGGGTTCCCAATGTTATCTTGAGGAAACCAACGATTTATAATGAGGACGATGTCGAAGACAAGCCATCGAGGATAAGAATGAAGCCGAA
TTTATCATTAAGAATGAGCAATGCATCAACAAAGGAGAAATATAGTGATATGACACTGTTGAGGAAGCCGGAACCAATGACTCCAAATGAAGTTATTGACGAGAAGGAAA
AGCTCTCTGGTGATGAGAATGTTGATATTGTGGAGAATATTGAAAATAAAGCTAGTAATGGATCAACAAGTGACCGAGTTGATCGGTTTGCTCTTTCTAAGAAGCCAGAT
ATAGGAGGTGACAAAACAAGTTTTGAAGATGAAAATGATCATAACAATCTTGCTAATTCAGAAAGTGGTACTGTTGATGAAAAGAAAGAGAATGACATCAAAGGCTTATA
CATTGTAAAAAGGCCTTCAAATATGACGTCGGAAGAAACTGTAGAAGCCTCATCATCAAAAGAAAACAGGATAGATATTGATGCTCTAGGGTTACAGCCTCGTAAGCCGA
GTGATATTGAGGACCCAGCAGCTTCTAGTGAACCATTTAGTGATATTTTGGATTCAACTATAAAGTTGTTCAACAGAGCTTCATTATTGGGTAAACCAAAAAGAGATGAT
CCTTCGAATGAAACATTAAAATTCAGTAGAGACGAGGCCTCCATCCCTGGAATTGACGTTGACGGTGCCGTTGAGACAGAGAACTTCTCTGCTATCCCTGCTTTGGAGGA
ACATGAACTTGCTGACTGGACAAAAGCAGAGGATCTGGCAAAGTCGGGAGATAGAGCTGACGTGGAAGTAATAAGCTCGAGTGCCCGAGGTTTTGTTGTATCGTTCGGCT
CCATAGTAGGGTTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTAGCTTTTGAGTCATGGCTAAGACAGAAAGGTTTAGATCCATCAACATACAGGCAAAAC
TTAGGAACTATTGGAAGTATTGATAGTGGAAGCCAAAATTTTGCCAGTACTAAGCCAGATTCAGAAATTGATGTAAAAAGTGGGGAAGAACTTACACCTGATATGAAATT
GGAGGATCTACTTCAGATTTATGATAGAGAGAAACTCAAATTTTTGTCATCATTTGTTGGCCAGAAAATCAAAGTGAATGTGGTGTTGGCTAACAGAAAATCAAGGAAGC
TGGTATTTTCCATAAGGCCAAAAGAAAGGGAAGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCCAAGCTGCAAGTAGGTGATGTTGTCAAATGTTGCATCAAGAAGATT
GCTTATTTTGGTATTTTTGTCGAGATTGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCTTGGGATGCTACTTTAAACCCTGCATCATACTTTAAGATTGGTCA
GGTTGTCGAGGCAAAAGTTCACCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGCTGAAGCAGATTACACCCGATCCGCTAACTGAAACATTAGAGTCTGTTGTTG
GTGATCAAGATTACATGGATGGAAGATTAGAATCCGCCGATGTAGATACCGAGTGGGCTGATGTAGATTCTCTCATCATGGAACTGCAAAATATTGAAGGTATTGAGTCT
GTATCCAAAGGGCGATTTTTCCTTAGCCCTGGCTTGGCTCCGACCTTTCAGGTTTATATGGCTTCCATGTACGAAAATCAGTACAAATTGCTTGCTCGATCCGGAAACAA
AGTACAAGAGGTAAATATTCAGAAACATTATTCTTTTGCTATTGCAAGTCACTCATATTCTTATCAAGCTTGGGACTGTAAGTCAGATTGTACAATGGCATAAGTTATAG
AACAAGAGGTAAATATCCCTGGTTTCTGGATTTTATAATTTGTACATGTTAGTATGTAGTGTAGAATGTGTTCTCCGACATACTGACAATACAATTTCCTACGAGGTACT
TGGTTACTACATGTAGTAGAGTAAGATGTTTATTCTGTGAGAATAGTTGAGGTGTTGTGCAGGCTTGACTAAGTTGTAGGTTAGCTTGTGGATTATAGAGCCTGTGAGAT
TAACAAACTTGTAGCCTTGTGGATTAAACATAAGTTTGACTTCTCAACTCTAGATTACATAAGCTTGACTTATTAGACAAACACCCATATCTTCCTATAGTAAACCGACT
TAACCCTTTCAATTAAGCAACTCTATTAACCATCATGAAGTGACCTAGTGATCAAGAGGGTCTTTGAGAATTTAGGGGGTGTTTGGGCCACCAAGAATAAGGGGGAGAAT
GGGAATGGGGATGATTCCCCGTGTTGGTGGGCCACCAGGAATCATTCCTCCCATAATCACGTTTTCCAAAAATCACTTTTTTCATTCTCCCCCTAGCTACTTTTAATTCT
CTCTCCAACAATTTTTCATACATCACATCACTATTTTTCACTCACATCATATTCATTATTTTATTTTTCCCACAATCATTACCTTTATCTCAAACACAAATAACCATTCT
CCTCATGATCATTACCTTCACCCCAAACACAAATAATCATTACCCCCATAATCATTCTCCTCCTCATTACCATTCCCCTTGCCCCAAACACACCCTAAGAGATGATGAAT
TTAATCAATAGTGACCACCTACTTAAGAATTAATTTTCTACGAGTTTCCTTAGCACTAAAATGTTGTCGGGTGAGGCTATATGTTCTGTGTCATTAGTCGAGGTGCACAT
AAACTGGCCTGGACAGTCACAAATATATAAAAAAAAAATATCAGAAACTCTATTCGGACGTATACATCTACTTTTTTAAGACCATTCTAAGCTATTTGCCTATTAAAACT
TGTATAATCTCGTGATCTTTATTTTTGTACCTAATGAAGAATGGTATATTGAAACATGAAATATATATATATATATATATATATATATATATATAT
Protein sequenceShow/hide protein sequence
MDGRTLTASSSFFTPVDLFRPRRVSVRNLCFNGRPTKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTMAKIMSKKMNPDASYLEVEKSFYQKKGKSKEVEEISLDGL
NLVRPQLNKEMKLKAVNKPTASDLKKPSQAIAKPAGSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLRMSNASTKEKYSDMTLLRKPEPMTPNEVIDEKEKLSG
DENVDIVENIENKASNGSTSDRVDRFALSKKPDIGGDKTSFEDENDHNNLANSESGTVDEKKENDIKGLYIVKRPSNMTSEETVEASSSKENRIDIDALGLQPRKPSDIE
DPAASSEPFSDILDSTIKLFNRASLLGKPKRDDPSNETLKFSRDEASIPGIDVDGAVETENFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSARGFVVSFGSIVG
FIPYRNLAAKWKFLAFESWLRQKGLDPSTYRQNLGTIGSIDSGSQNFASTKPDSEIDVKSGEELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLVFS
IRPKEREELVEKKRSLMAKLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLTETLESVVGDQD
YMDGRLESADVDTEWADVDSLIMELQNIEGIESVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSFAIASHSYSYQAWDCKSDCTMA