; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10009159 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10009159
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionS1 motif domain-containing protein
Genome locationChr06:3098922..3104224
RNA-Seq ExpressionHG10009159
SyntenyHG10009159
Gene Ontology termsGO:0034337 - RNA folding (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR003029 - S1 domain
IPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042780.1 protein MLP1 [Cucumis melo var. makuwa]0.0e+0088.79Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRN+CFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASKGS+SD++DDFTLSKKPEIGGDKT LE+EN          DTVD KEKN I+DLYIL++PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
        +S+ETE+GSSTNE+ KDIDYS IGLQLHEPSDIDY +NPAALSESF+DILDSTIEVS KATLLGKPRRVDHSS+E  KL+REEASTPETD+NGA+ET  F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGS+ GG+QAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSA+VDT+WADVESLIKELQNTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

KAE8647864.1 hypothetical protein Csa_000384 [Cucumis sativus]0.0e+0090.84Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRN+CFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASK  TSD++DDFTLSKKPEIGGD+TRLE          SESD VD KEKN I+DLYILK+PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF
        +S+ETE+GSSTNE+ KDIDYS IGL QLHEPSDIDY +NPAALSESFSDILD TIE S KATLLGKPRRVDHSSKE  KL+REE STPETDVNGA ETE 
Subjt:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        PD EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS ++DT+WADVESL+KELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI

Query:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE
        EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE
Subjt:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE

XP_008437231.1 PREDICTED: uncharacterized protein LOC103482723 isoform X1 [Cucumis melo]0.0e+0088.79Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRN+CFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASKGS+SD++DDFTLSKKPEIGGDKT LE+EN          DTVD KEKN I+DLYIL++PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
        +S+ETE+GSSTNE+ KDIDYS IGLQLHEPSDIDY +NPAALSESF+DILDSTIEVS KATLLGKPRRVDHSS+E  KL+REEASTPETD+NGA+ET  F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGSS GGSQAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKV VVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHD MDGRLDSA+VDT+WADVESLIKELQNTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

XP_011654764.1 uncharacterized protein LOC101217667 [Cucumis sativus]0.0e+0089.06Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRN+CFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASK  TSD++DDFTLSKKPEIGGD+TRLE          SESD VD KEKN I+DLYILK+PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF
        +S+ETE+GSSTNE+ KDIDYS IGL QLHEPSDIDY +NPAALSESFSDILD TIE S KATLLGKPRRVDHSSKE  KL+REE STPETDVNGA ETE 
Subjt:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        PD EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS ++DT+WADVESL+KELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI

Query:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

XP_038874720.1 uncharacterized protein LOC120067263 [Benincasa hispida]0.0e+0087.86Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR  VRN+CFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKK NPD SYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKE+KLKAA+KPPAPDLKKPSQ VGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSN STKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDEKEKLSG+GYVDIVENIEN  SKGSTSD  D FTLSKKPEIGGDKTR           ++ES+TVDGK +ND+ED+YILK+PLN MSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
        MS+ETE+GSSTNE+ KDIDYS           IDY  + A LSESFSDILDST++VS KATLLGKPRRVDHSSKE +KLSREEASTPETDV+GAVETE F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPS+YKQNLGTIGSSDG SQ FASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFS+RPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGR DSA+VDT+W+DVESL+KELQNT+GIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

TrEMBL top hitse value%identityAlignment
A0A0A0KKL1 S1 motif domain-containing protein0.0e+0089.06Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFTPIDLLRPRR AVRN+CFNGRPSKFSVL+SKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNLVRPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPT YNEDDVEDKPSRIRMKPNLSLKMSNVSTKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASK  TSD++DDFTLSKKPEIGGD+TRLE          SESD VD KEKN I+DLYILK+PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF
        +S+ETE+GSSTNE+ KDIDYS IGL QLHEPSDIDY +NPAALSESFSDILD TIE S KATLLGKPRRVDHSSKE  KL+REE STPETDVNGA ETE 
Subjt:  MSKETELGSSTNEDRKDIDYSPIGL-QLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEF

Query:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
        FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR
Subjt:  FSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTR

Query:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE
        PD EIDVKDGGELTPDMKLEDLLQIY++EKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKER++LV+KKRSLM TLQVGDVVKCCIKKIAYFGIFVE
Subjt:  PDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVE

Query:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI
        IEGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDS ++DT+WADVESL+KELQN EGI
Subjt:  IEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGI

Query:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

A0A1S3AU41 uncharacterized protein LOC103482723 isoform X10.0e+0088.79Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRN+CFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASKGS+SD++DDFTLSKKPEIGGDKT LE+EN          DTVD KEKN I+DLYIL++PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
        +S+ETE+GSSTNE+ KDIDYS IGLQLHEPSDIDY +NPAALSESF+DILDSTIEVS KATLLGKPRRVDHSS+E  KL+REEASTPETD+NGA+ET  F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGSS GGSQAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKV VVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHD MDGRLDSA+VDT+WADVESLIKELQNTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

A0A5A7THK9 Protein MLP10.0e+0088.79Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF PIDLLRPRR AVRN+CFNGR SKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKKE+KLKAANKPP PD+KKPSQ+VGKV VSPKGRVPNVILRKPTIYNEDDVEDKPSR+RMKPNLSLKMSNV TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTSNEVIDE EKLSGDGYVD VENIEN ASKGS+SD++DDFTLSKKPEIGGDKT LE+EN          DTVD KEKN I+DLYIL++PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
        +S+ETE+GSSTNE+ KDIDYS IGLQLHEPSDIDY +NPAALSESF+DILDSTIEVS KATLLGKPRRVDHSS+E  KL+REEASTPETD+NGA+ET  F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPS YKQNLGTIGS+ GG+QAFASTR 
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D EIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKL+FS+RPKEREELVEKKRSLM TLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPALIHQTE+SWD  LNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSA+VDT+WADVESLIKELQNTEGIE
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ ++    +++ K+++K
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

A0A6J1E0R7 uncharacterized protein LOC111026097 isoform X20.0e+0082.43Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFT IDLLRPRRGAVRN+C NGRPS+FSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKK++KLKAA KPP  D+KKPSQ+V K AVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTS+E I++KEK SGD YVDI +NI++ ASKGS+SD VDDFTLSKKPEI GD+ RL NEND KNL YSESDT+DGK++N IE+LYILK+PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF
         S ETE  +S+ E+R+DID   + LQ HEPS++ YD++PAA SE FSDILD ++++S +A LLGKP     S++EA+       +TPETDVN AVET+ F
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFF

Query:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP
        SAIPAL+EHELADWT+AEDLAK GDRA+VEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGS+D GSQ F+STRP
Subjt:  SAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRP

Query:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
        D E DVK+GGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVL NRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI
Subjt:  DPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEI

Query:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE
        EGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVV DHDPMDGRL+ A+VDT+WADVESL KELQN EG+E
Subjt:  EGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIE

Query:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL
        AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ +Q        K ++
Subjt:  AVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL

A0A6J1E4Z1 uncharacterized protein LOC111026097 isoform X10.0e+0083.4Show/hide
Query:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFT IDLLRPRRGAVRN+C NGRPS+FSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR
        VEELSLDGLNL+RPQLKK++KLKAA KPP  D+KKPSQ+V K AVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSL M+N  TKE+YSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLR

Query:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG
        KPEPMTS+E I++KEK SGD YVDI +NI++ ASKGS+SD VDDFTLSKKPEI GD+ RL NEND KNL YSESDT+DGK++N IE+LYILK+PLNVMSG
Subjt:  KPEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSG

Query:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEA--STPETDVNGAVETE
         S ETE  +S+ E+R+DID   + LQ HEPS++ YD++PAA SE FSDILD ++++S +A LLGKP R+DH S E LKLS EEA  +TPETDVN AVET+
Subjt:  MSKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEA--STPETDVNGAVETE

Query:  FFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFAST
         FSAIPAL+EHELADWT+AEDLAK GDRA+VEVISSSTRGFVVSFGSL+GFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGS+D GSQ F+ST
Subjt:  FFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFAST

Query:  RPDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV
        RPD E DVK+GGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVL NRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV
Subjt:  RPDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFV

Query:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEG
        EIEGVPAL+HQTEVSWDATLNPAS++KIGQVVEAKVHQLDFSLERIFLSLKQITPDPL E LESVV DHDPMDGRL+ A+VDT+WADVESL KELQN EG
Subjt:  EIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEG

Query:  IEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL
        +EAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQE+ +Q        K ++
Subjt:  IEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL

SwissProt top hitse value%identityAlignment
Q49XT0 30S ribosomal protein S17.0e-1432.87Show/hide
Query:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV
        I+  S F GQ +K+ V   +  + ++I S +  E  E  EKK  L+ +L  GDV++  + ++  FG FV+I GV  L+H +E+S +   +P     IG+ 
Subjt:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV

Query:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        V  K+  +D   ERI LS+K   P P  E+++    + D ++G
Subjt:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q4L6I1 30S ribosomal protein S11.8e-1433.1Show/hide
Query:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV
        I+  S F GQ I++ V   + ++ ++I S +  E+ E   KK SL+ +L  GDV+K  + ++  FG FV+I GV  L+H +E+S +   +P     +GQ 
Subjt:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV

Query:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL
        V+ KV  ++   ERI LS+K   P P  E+++    + D ++G++
Subjt:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL

Q5HFU7 30S ribosomal protein S19.1e-1432.87Show/hide
Query:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV
        I+  S F GQ I++ V   + ++ ++I S +  E+EE   KK  L+ +L  GDV+   + ++  FG F++I GV  L+H +E+S +    P     IGQ 
Subjt:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV

Query:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        V+ K+  +D   ERI LS+K   P P  E ++    ++D ++G
Subjt:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q6GGT5 30S ribosomal protein S17.0e-1432.87Show/hide
Query:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV
        I+  S F GQ I++ V   + ++ ++I S +  E+EE   KK  L+ +L  GDV+   + ++  FG F++I GV  L+H +E+S +    P     IGQ 
Subjt:  IKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQV

Query:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        V+ K+  +D   ERI LS+K   P P  E ++    ++D ++G
Subjt:  VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q9JZ44 30S ribosomal protein S11.3e-1532.58Show/hide
Query:  IYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASY
        + D   +K  S F G++I+  V+  ++K   ++ S R      L E++++L+  LQ G V+K  +K I  +G FV++ G+  L+H T+++W    +P+  
Subjt:  IYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASY

Query:  FKIGQVVEAKVHQLDFSLERIFLSLKQITPDP
         ++GQ VEAKV + D   +R+ L +KQ+  DP
Subjt:  FKIGQVVEAKVHQLDFSLERIFLSLKQITPDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein1.2e-18952.96Show/hide
Query:  PRRGAVRNMCFN--GRPSKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEEL------------
        P R  VR    N   +  KF V ASK EE +L+ WDQMEL FGRL+GEDPKLTLAKI+++K++P+AS++++EKSFY+ KGK  EVEE+            
Subjt:  PRRGAVRNMCFN--GRPSKFSVLASK-EEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEEL------------

Query:  --SLDGLNLVRPQLKKEIKL-KAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLRK
          SLDGL LV+P LK  +K  +   K P+P LKKP   V  VA     R+PNVILRKP+ +   + +D+ S++R+KPNL+LKM N    E++SDMTLLRK
Subjt:  --SLDGLNLVRPQLKKEIKL-KAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLRK

Query:  PEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSGM
        PEP+  + V +E++K   D        +E G  +G T      +TL +KPE      RL+  N  + +  S      G E ++I +  I +KP       
Subjt:  PEPMTSNEVIDEKEKLSGDGYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSGM

Query:  SKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDD---NPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETE
        + E E+  S   +  +I+ + I  ++   S++  ++   N   L    S  +  TI    +A+L GKP+R+D SS E    +  + S    +  G   + 
Subjt:  SKETELGSSTNEDRKDIDYSPIGLQLHEPSDIDYDD---NPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETE

Query:  FFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQA-FAS
             P     E  DW KAE L K+  RADVE+ISSSTRGF VS+GSLIGF+PYRNLAAKWKFLAFESWLR+KG+DPS Y+QNLG IG  D  S++    
Subjt:  FFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQA-FAS

Query:  TRPDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIF
        +  D E+     GE++ DMKLEDLL +YDREK KFLSSFVGQKIKVNVV+ANR SRKLIFS+RP+E EE VEKKR+LMA L+VGDVVKCCIKKI YFGIF
Subjt:  TRPDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIF

Query:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAKVDTQWADVESLIKELQNT
         E+EGVPAL+HQ+EVSWDATL+PASYFKIGQ+VEAKVHQLDF+LERIFLSLK+ITPDPL EALESVV GD+D + GRL +A++D +W DVESLIKEL+  
Subjt:  VEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAKVDTQWADVESLIKELQNT

Query:  EGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL
        EGI++VSK RFFLSPGLAPTFQVYMA M+ENQYKLLAR+GN+VQE+ ++   S    K ++
Subjt:  EGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSL

AT3G23700.1 Nucleic acid-binding proteins superfamily8.2e-1825.63Show/hide
Query:  ADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRPDPEIDVKDGGE
        +DW  A+   KSGD  + EV   +  G ++ F SL+GF+PY  L+                                             P    K+   
Subjt:  ADWTKAEDLAKSGDRADVEVISSSTRGFVVSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRPDPEIDVKDGGE

Query:  LTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIE------GVPA
          P   + ++ +           + VG K+ V VV A+ ++RKLI S    E+  L  K       + VGDV    +  +  +G F+ +        +  
Subjt:  LTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIE------GVPA

Query:  LIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAKVDT--QWADVESLIKELQNTEGIEAV
        L+H +EVSWD   +     + G  V   V  +D    RI LS+KQ+  DPL E L+ V+  D       L S   DT      +E++++EL   +GIEAV
Subjt:  LIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLDSAKVDT--QWADVESLIKELQNTEGIEAV

Query:  SKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK
           R  F    ++   Q+++++    + ++ LLAR+G +VQE+++         KK+L+
Subjt:  SKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLK

AT5G30510.1 ribosomal protein S12.3e-0423.93Show/hide
Query:  QKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLD
        ++I +  V  + +  KL+ S R    +          A L +G VV   ++ +  +G F++I G+  L+H +++S D   + A+  + G  ++  +   D
Subjt:  QKIKVNVVLANRKSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLD

Query:  FSLERIFLSLKQITPDP
            R+ LS K++ P P
Subjt:  FSLERIFLSLKQITPDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTCGCGCTCTAACGGCCTCCTCCTTCTTCACTCCCATTGATTTATTGCGACCCAGAAGAGGAGCTGTTAGAAATATGTGTTTTAATGGCAGACCCAGTAAGTT
TTCGGTTCTTGCTTCCAAAGAAGAGGCTGAACTCGACCGTTGGGACCAAATGGAGCTCAAGTTTGGCCGCTTGATTGGGGAAGACCCCAAATTAACATTGGCCAAGATAA
TGAGCAAAAAAATGAATCCAGATGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAGTCTAACGAGGTAGAGGAACTTTCTCTTGATGGTTTGAAT
TTGGTCAGACCTCAGTTAAAGAAGGAAATTAAGTTAAAAGCTGCCAATAAGCCACCTGCACCAGATTTAAAGAAACCAAGCCAATCAGTTGGAAAGGTAGCAGTTAGTCC
TAAAGGTAGGGTTCCCAATGTTATTTTGAGAAAACCAACGATTTATAATGAGGACGATGTTGAAGATAAGCCATCGCGGATAAGGATGAAGCCGAATTTATCATTGAAAA
TGAGCAATGTATCGACAAAGGAGCAATATAGTGATATGACACTGTTGAGGAAGCCTGAACCAATGACTTCAAATGAAGTTATTGATGAGAAGGAAAAGCTCTCTGGTGAT
GGGTATGTTGATATTGTTGAGAATATTGAAAATGGGGCTAGTAAGGGATCAACAAGTGACAAAGTTGATGATTTTACTCTTTCTAAGAAGCCAGAAATAGGAGGTGACAA
AACAAGACTTGAAAATGAAAATGATCACAAAAATCTGGCTTATTCAGAAAGTGATACGGTTGATGGTAAAGAGAAGAATGACATTGAAGACTTATACATCCTAAAAAAGC
CTTTAAATGTGATGTCTGGCATGTCTAAAGAAACTGAACTAGGCTCATCAACAAATGAAGACAGGAAAGATATTGATTATTCTCCTATAGGATTGCAGCTACATGAGCCA
AGTGATATAGATTATGATGATAACCCAGCAGCTTTGAGCGAATCATTCAGTGATATTTTGGATTCAACAATAGAGGTGTCCAACAAAGCTACATTATTGGGTAAACCAAG
AAGAGTAGATCATTCTTCTAAGGAAGCACTAAAACTCAGTAGAGAAGAGGCCTCCACCCCTGAAACGGACGTCAATGGTGCCGTTGAGACAGAGTTCTTCTCTGCTATCC
CTGCTTTGGAGGAACACGAACTTGCTGACTGGACTAAAGCAGAGGATCTGGCAAAGTCAGGAGACAGAGCTGACGTGGAAGTAATAAGCTCGAGTACTCGAGGTTTTGTT
GTATCATTCGGCTCCCTAATAGGATTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTAGCTTTTGAGTCATGGTTAAGACAGAAAGGTTTGGATCCATCAAT
ATACAAGCAAAACTTAGGAACTATTGGAAGTAGTGATGGTGGAAGCCAGGCCTTTGCCAGTACTAGGCCGGATCCGGAAATTGATGTAAAAGATGGAGGAGAACTTACAC
CTGATATGAAACTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAAATCAAATTCTTGTCATCATTTGTTGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCGAACAGA
AAATCAAGGAAGCTTATATTTTCCATAAGGCCAAAAGAGAGGGAGGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCGACGCTGCAAGTAGGTGATGTTGTCAAATGCTG
CATTAAGAAGATTGCTTATTTTGGTATCTTTGTTGAGATCGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCCTGGGACGCTACTTTAAACCCTGCATCATACT
TTAAAATTGGTCAGGTTGTAGAGGCCAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGTTGAAGCAGATTACACCAGACCCGCTAGCTGAAGCATTG
GAGTCTGTGGTTGGTGATCATGATCCCATGGATGGAAGATTAGATTCAGCCAAAGTAGATACCCAGTGGGCTGATGTCGAATCTCTCATCAAGGAACTGCAAAATACTGA
AGGTATTGAGGCTGTATCCAAAGGGCGTTTTTTCCTTAGCCCTGGCTTGGCTCCAACCTTTCAGGTATATATGGCTTCTATGTATGAAAATCAGTACAAATTACTTGCTC
GATCTGGAAACAAAGTACAAGAGGTAAATATTCAGAAACATTACTCTTCTGCCATTGCCAAAAAGTCACTAAAATTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGTCGCGCTCTAACGGCCTCCTCCTTCTTCACTCCCATTGATTTATTGCGACCCAGAAGAGGAGCTGTTAGAAATATGTGTTTTAATGGCAGACCCAGTAAGTT
TTCGGTTCTTGCTTCCAAAGAAGAGGCTGAACTCGACCGTTGGGACCAAATGGAGCTCAAGTTTGGCCGCTTGATTGGGGAAGACCCCAAATTAACATTGGCCAAGATAA
TGAGCAAAAAAATGAATCCAGATGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAGTCTAACGAGGTAGAGGAACTTTCTCTTGATGGTTTGAAT
TTGGTCAGACCTCAGTTAAAGAAGGAAATTAAGTTAAAAGCTGCCAATAAGCCACCTGCACCAGATTTAAAGAAACCAAGCCAATCAGTTGGAAAGGTAGCAGTTAGTCC
TAAAGGTAGGGTTCCCAATGTTATTTTGAGAAAACCAACGATTTATAATGAGGACGATGTTGAAGATAAGCCATCGCGGATAAGGATGAAGCCGAATTTATCATTGAAAA
TGAGCAATGTATCGACAAAGGAGCAATATAGTGATATGACACTGTTGAGGAAGCCTGAACCAATGACTTCAAATGAAGTTATTGATGAGAAGGAAAAGCTCTCTGGTGAT
GGGTATGTTGATATTGTTGAGAATATTGAAAATGGGGCTAGTAAGGGATCAACAAGTGACAAAGTTGATGATTTTACTCTTTCTAAGAAGCCAGAAATAGGAGGTGACAA
AACAAGACTTGAAAATGAAAATGATCACAAAAATCTGGCTTATTCAGAAAGTGATACGGTTGATGGTAAAGAGAAGAATGACATTGAAGACTTATACATCCTAAAAAAGC
CTTTAAATGTGATGTCTGGCATGTCTAAAGAAACTGAACTAGGCTCATCAACAAATGAAGACAGGAAAGATATTGATTATTCTCCTATAGGATTGCAGCTACATGAGCCA
AGTGATATAGATTATGATGATAACCCAGCAGCTTTGAGCGAATCATTCAGTGATATTTTGGATTCAACAATAGAGGTGTCCAACAAAGCTACATTATTGGGTAAACCAAG
AAGAGTAGATCATTCTTCTAAGGAAGCACTAAAACTCAGTAGAGAAGAGGCCTCCACCCCTGAAACGGACGTCAATGGTGCCGTTGAGACAGAGTTCTTCTCTGCTATCC
CTGCTTTGGAGGAACACGAACTTGCTGACTGGACTAAAGCAGAGGATCTGGCAAAGTCAGGAGACAGAGCTGACGTGGAAGTAATAAGCTCGAGTACTCGAGGTTTTGTT
GTATCATTCGGCTCCCTAATAGGATTTATTCCATATCGTAATCTTGCTGCCAAGTGGAAGTTCTTAGCTTTTGAGTCATGGTTAAGACAGAAAGGTTTGGATCCATCAAT
ATACAAGCAAAACTTAGGAACTATTGGAAGTAGTGATGGTGGAAGCCAGGCCTTTGCCAGTACTAGGCCGGATCCGGAAATTGATGTAAAAGATGGAGGAGAACTTACAC
CTGATATGAAACTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAAATCAAATTCTTGTCATCATTTGTTGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCGAACAGA
AAATCAAGGAAGCTTATATTTTCCATAAGGCCAAAAGAGAGGGAGGAATTGGTCGAGAAAAAGAGGAGTTTAATGGCGACGCTGCAAGTAGGTGATGTTGTCAAATGCTG
CATTAAGAAGATTGCTTATTTTGGTATCTTTGTTGAGATCGAAGGAGTGCCTGCTTTGATTCATCAGACAGAGGTTTCCTGGGACGCTACTTTAAACCCTGCATCATACT
TTAAAATTGGTCAGGTTGTAGAGGCCAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGTTGAAGCAGATTACACCAGACCCGCTAGCTGAAGCATTG
GAGTCTGTGGTTGGTGATCATGATCCCATGGATGGAAGATTAGATTCAGCCAAAGTAGATACCCAGTGGGCTGATGTCGAATCTCTCATCAAGGAACTGCAAAATACTGA
AGGTATTGAGGCTGTATCCAAAGGGCGTTTTTTCCTTAGCCCTGGCTTGGCTCCAACCTTTCAGGTATATATGGCTTCTATGTATGAAAATCAGTACAAATTACTTGCTC
GATCTGGAAACAAAGTACAAGAGGTAAATATTCAGAAACATTACTCTTCTGCCATTGCCAAAAAGTCACTAAAATTCTTATGA
Protein sequenceShow/hide protein sequence
MDGRALTASSFFTPIDLLRPRRGAVRNMCFNGRPSKFSVLASKEEAELDRWDQMELKFGRLIGEDPKLTLAKIMSKKMNPDASYLEVEKSFYQKKGKSNEVEELSLDGLN
LVRPQLKKEIKLKAANKPPAPDLKKPSQSVGKVAVSPKGRVPNVILRKPTIYNEDDVEDKPSRIRMKPNLSLKMSNVSTKEQYSDMTLLRKPEPMTSNEVIDEKEKLSGD
GYVDIVENIENGASKGSTSDKVDDFTLSKKPEIGGDKTRLENENDHKNLAYSESDTVDGKEKNDIEDLYILKKPLNVMSGMSKETELGSSTNEDRKDIDYSPIGLQLHEP
SDIDYDDNPAALSESFSDILDSTIEVSNKATLLGKPRRVDHSSKEALKLSREEASTPETDVNGAVETEFFSAIPALEEHELADWTKAEDLAKSGDRADVEVISSSTRGFV
VSFGSLIGFIPYRNLAAKWKFLAFESWLRQKGLDPSIYKQNLGTIGSSDGGSQAFASTRPDPEIDVKDGGELTPDMKLEDLLQIYDREKIKFLSSFVGQKIKVNVVLANR
KSRKLIFSIRPKEREELVEKKRSLMATLQVGDVVKCCIKKIAYFGIFVEIEGVPALIHQTEVSWDATLNPASYFKIGQVVEAKVHQLDFSLERIFLSLKQITPDPLAEAL
ESVVGDHDPMDGRLDSAKVDTQWADVESLIKELQNTEGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKVQEVNIQKHYSSAIAKKSLKFL