; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g220940 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g220940
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionS1 motif domain-containing protein
Genome locationCsor_Chr02:8783431..8788685
RNA-Seq ExpressionCsor.00g220940
SyntenyCsor.00g220940
Gene Ontology termsGO:0034337 - RNA folding (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0019843 - rRNA binding (molecular function)
InterPro domainsIPR012340 - Nucleic acid-binding, OB-fold
IPR022967 - RNA-binding domain, S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606113.1 hypothetical protein SDJN03_03430, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRK
        GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRK
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRK

Query:  LIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMD
        LIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMD
Subjt:  LIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMD

Query:  GRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        GRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  GRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

KAG7036057.1 rpsA [Cucurbita argyrosperma subsp. argyrosperma]0.090.12Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTI+KEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLR-------PPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTK
        AVG          P       +      SKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHEL+DWTK
Subjt:  AVGLR-------PPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTK

Query:  AEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDM
        AEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDM
Subjt:  AEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDM

Query:  KLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV----
        KLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV    
Subjt:  KLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV----

Query:  -------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPT
                     VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPT
Subjt:  -------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPT

Query:  FQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        FQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  FQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

XP_022958651.1 uncharacterized protein LOC111459810 [Cucurbita moschata]0.092.72Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGK NE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKL+AANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGI NNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPDLSKSFSDL+DSTIKLSNEATLLGKPKRVDYSSNETLKLGGEET TPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ
        GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDMKLEDLLQ
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ

Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------
        IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEI+GVPALIHQTEV           
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------

Query:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
              VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
Subjt:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS

Query:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

XP_022996046.1 uncharacterized protein LOC111491368 [Cucurbita maxima]0.090.84Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRRV+VRNPCFN RPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGK NE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKP APDLKKPSQAV     SPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENE+DHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGI NNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPD SKS SDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGE+T TPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ
        GDRADVEIISSTTRGFVVSF SIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDMKLEDLLQ
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ

Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------
        IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV           
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------

Query:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
              VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEW DVESLI ELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
Subjt:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS

Query:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        MYENQYKLLARSGNK+QELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

XP_023521237.1 uncharacterized protein LOC111784966 [Cucurbita pepo subsp. pepo]0.092.05Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGK NE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKP APDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENT LENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGI NNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPD SKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEET TPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ
        GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDMKLEDLLQ
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ

Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------
        IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFS+RQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV           
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------

Query:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
              VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEW DVESLI ELQNTEGIEAV KGRFFLSPGLAPTFQVYMAS
Subjt:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS

Query:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

TrEMBL top hitse value%identityAlignment
A0A0A0KKL1 S1 motif domain-containing protein0.077.18Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRR AVRN CFN RPSKFSVL+SKEEAELD+WDQMELKFGR+IGEDPKLTLAKIMSKKMN  ASYLEVEKSFYQKKGKSNE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKP  PD+KKPSQAV K  VSPKGRVPNVILRKPT Y EDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA
        KPEPM SNEVIDE EKLSG+      EN+ N ASK  TSDRID FTL KKPEIG + TRLE+E D   +D  E + +DD     +  NV S +SEETE  
Subjt:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA

Query:  SSSKENGIHNNYFAVGLRP-PEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALE
        SS+ ENG   +Y A+GL+   EPSD+ Y+E+   LS+SFSD+LD TI+ S +ATLLGKP+RVD+SS ET KL  EET TP  DV GA ETENFSA+PALE
Subjt:  SSSKENGIHNNYFAVGLRP-PEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALE

Query:  EHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVK
        EHELADWTKAEDLAKSGDRADVE+ISS+TRGFVVSFGS+VGFIPYRNL+AKWKFLAFESWLRQK                               EIDVK
Subjt:  EHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVK

Query:  NGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALI
        +G ELTPDMKLEDLLQIY++EK+KFLSSFVGQKIKVNVVLANRKSRKLIFSIR KER++LV+KKRSLM TLQVGDVVKCCI KIAYFGIFVEIEGVPALI
Subjt:  NGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALI

Query:  HQTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRF
        HQTE                 VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL+S E+DTEW DVESL+ ELQN EGIEAVSKGRF
Subjt:  HQTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRF

Query:  FLSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        FLSPGLAPTFQVYMASMYENQYKLLARSGNK+QELMV+TSLDKET+KSVILTCTNRV+
Subjt:  FLSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

A0A1S3AU41 uncharacterized protein LOC103482723 isoform X10.076.88Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRR AVRN CFN R SKFSVLASKEEAELD+WDQMELKFGR+IGEDPKLTLAKIMSKKMN  ASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKP  PD+KKPSQAV K  VSPKGRVPNVILRKPTIY EDDVEDKPSR+RMKPNLSLKMSNV TKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA
        KPEPM SNEVIDE EKLSG+      EN+ N+ASKGS+SDRID FTL KKPEIG + T LE+E D   +D  E + +DD     +  NV S +SEETE  
Subjt:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA

Query:  SSSKENGIHNNYFAVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALEE
        SS+ ENG   +Y ++GL+  EPSD+ Y+E+   LS+SF+D+LDSTI++S +ATLLGKP+RVD+SS ET KL  EE  TP  D+ GA ET +FSA+PALEE
Subjt:  SSSKENGIHNNYFAVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALEE

Query:  HELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKN
        HELADWTKAEDLAKSGDRADVE+ISS+TRGFVVSFGS+VGFIPYRNL+AKWKFLAFESWLRQK                               EIDVK+
Subjt:  HELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKN

Query:  GSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIH
        G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKV VVLANRKSRKL+FS+R KEREELVEKKRSLM TLQVGDVVKCCI KIAYFGIFVEIEGVPALIH
Subjt:  GSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIH

Query:  QTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFF
        QTE                 VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHD MDGRL+SAEVDTEW DVESLI ELQNTEGIEAVSKGRFF
Subjt:  QTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFF

Query:  LSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        LSPGLAPTFQVYMASMYENQYKLLARSGNK+QELMV+TSLDKET+KSVILTCTNRV+
Subjt:  LSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

A0A5A7THK9 Protein MLP10.077.15Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRR AVRN CFN R SKFSVLASKEEAELD+WDQMELKFGR+IGEDPKLTLAKIMSKKMN  ASYLEVEKSFYQKKGKS+E
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNL+RPQLKKEMKLKAANKP  PD+KKPSQAV K  VSPKGRVPNVILRKPTIY EDDVEDKPSR+RMKPNLSLKMSNV TKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA
        KPEPM SNEVIDE EKLSG+      EN+ N+ASKGS+SDRID FTL KKPEIG + T LE+E D   +D  E + +DD     +  NV S +SEETE  
Subjt:  KPEPMVSNEVIDEKEKLSGE------ENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENEQDHKNLDHSESSTVDD-----KNENV-SAISEETEDA

Query:  SSSKENGIHNNYFAVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALEE
        SS+ ENG   +Y ++GL+  EPSD+ Y+E+   LS+SF+D+LDSTI++S +ATLLGKP+RVD+SS ET KL  EE  TP  D+ GA ET +FSA+PALEE
Subjt:  SSSKENGIHNNYFAVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTP--DVIGAGETENFSALPALEE

Query:  HELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKN
        HELADWTKAEDLAKSGDRADVE+ISS+TRGFVVSFGS+VGFIPYRNL+AKWKFLAFESWLRQK                               EIDVK+
Subjt:  HELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKN

Query:  GSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIH
        G ELTPDMKLEDLLQIYDREK+KFLSSFVGQKIKVNVVLANRKSRKL+FS+R KEREELVEKKRSLM TLQVGDVVKCCI KIAYFGIFVEIEGVPALIH
Subjt:  GSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIH

Query:  QTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFF
        QTE                 VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL+SAEVDTEW DVESLI ELQNTEGIEAVSKGRFF
Subjt:  QTE-----------------VVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFF

Query:  LSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        LSPGLAPTFQVYMASMYENQYKLLARSGNK+QELMV+TSLDKET+KSVILTCTNRV+
Subjt:  LSPGLAPTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

A0A6J1H2N6 uncharacterized protein LOC1114598100.092.72Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGK NE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKL+AANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGI NNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPDLSKSFSDL+DSTIKLSNEATLLGKPKRVDYSSNETLKLGGEET TPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ
        GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDMKLEDLLQ
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ

Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------
        IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEI+GVPALIHQTEV           
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------

Query:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
              VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
Subjt:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS

Query:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

A0A6J1K3L5 uncharacterized protein LOC1114913680.090.84Show/hide
Query:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE
        MDGRALTASSFF+PIDLLRPRRV+VRNPCFN RPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGK NE
Subjt:  MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNE

Query:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
        VEELSLDGLNLVRPQLKKEMKLKAANKP APDLKKPSQAV     SPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
Subjt:  VEELSLDGLNLVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF
        KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENE+DHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGI NNYF
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYF

Query:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
        AVGLRPPEPSDMGYIEDTPD SKS SDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGE+T TPDVIGAGETENFSALPALEEHELADWTKAEDLAKS
Subjt:  AVGLRPPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKS

Query:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ
        GDRADVEIISSTTRGFVVSF SIVGFIPYRNLSAKWKFLAFESWLRQK                               EIDVKNGSELTPDMKLEDLLQ
Subjt:  GDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQK-------------------------------EIDVKNGSELTPDMKLEDLLQ

Query:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------
        IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV           
Subjt:  IYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------

Query:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
              VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEW DVESLI ELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS
Subjt:  ------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMAS

Query:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        MYENQYKLLARSGNK+QELMVQTSLDKETVKSVILTCTNRVQ
Subjt:  MYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

SwissProt top hitse value%identityAlignment
Q4L6I1 30S ribosomal protein S11.1e-0830.5Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK
        S F GQ I++ V   + ++ ++I S +  E+ E   KK SL+ +L  GDV+K  + ++  FG FV+I GV  L+H +E+                 V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK

Query:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL
        V  ++   ERI LS+K   P P  E+++    + D ++G++
Subjt:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRL

Q5HFU7 30S ribosomal protein S18.9e-0829.5Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK
        S F GQ I++ V   + ++ ++I S +  E+EE   KK  L+ +L  GDV+   + ++  FG F++I GV  L+H +E+                 V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK

Query:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        +  +D   ERI LS+K   P P  E ++    ++D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q6GGT5 30S ribosomal protein S16.8e-0829.5Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK
        S F GQ I++ V   + ++ ++I S +  E+EE   KK  L+ +L  GDV+   + ++  FG F++I GV  L+H +E+                 V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK

Query:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        +  +D   ERI LS+K   P P  E ++    ++D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q99U14 30S ribosomal protein S18.9e-0829.5Show/hide
Query:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK
        S F GQ I++ V   + ++ ++I S +  E+EE   KK  L+ +L  GDV+   + ++  FG F++I GV  L+H +E+                 V+ K
Subjt:  SSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV-----------------VEAK

Query:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG
        +  +D   ERI LS+K   P P  E ++    ++D ++G
Subjt:  VHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDG

Q9JZ44 30S ribosomal protein S11.8e-0825Show/hide
Query:  LKLGGEETLTPDVI--GAGETENFSALPALEEHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVK
        +K+G   T+T + +  G GET+    L   +    ADW   E+  ++GD     I      G  V   SI  F+P                         
Subjt:  LKLGGEETLTPDVI--GAGETENFSALPALEEHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVK

Query:  NGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALI
         GS             + D   +K  S F G++I+  V+  ++K   ++ S R      L E++++L+  LQ G V+K  +  I  +G FV++ G+  L+
Subjt:  NGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALI

Query:  HQTEV-----------------VEAKVHQLDFSLERIFLSLKQITPDP
        H T++                 VEAKV + D   +R+ L +KQ+  DP
Subjt:  HQTEV-----------------VEAKVHQLDFSLERIFLSLKQITPDP

Arabidopsis top hitse value%identityAlignment
AT1G12800.1 Nucleic acid-binding, OB-fold-like protein2.7e-16950.4Show/hide
Query:  PRRVAVRNPCFN--ARPSKFSVLASK-EEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNEVEEL------------
        P RV VR    N  A+  KF V ASK EE +L++WDQMEL FGR++GEDPKLTLAKI+++K++  AS++++EKSFY+ KGK  EVEE+            
Subjt:  PRRVAVRNPCFN--ARPSKFSVLASK-EEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNEVEEL------------

Query:  --SLDGLNLVRPQLKKEMKL-KAANKPSAPDLKKP-SQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR
          SLDGL LV+P LK  +K  +   K  +P LKKP  +AVA P V    R+PNVILRKP+ +   + +D+ S++R+KPNL+LKM N    E++SDMTLLR
Subjt:  --SLDGLNLVRPQLKKEMKL-KAANKPSAPDLKKP-SQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLR

Query:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENE-QDHKNLDHSE----SSTVDDKNENVSAISEETEDASSSKENG
        KPEP VS    +E + LS +  +     +G T  +   +TL +KPE   +   +E E  D   ++ SE    S    +    +  I +E  D+   + + 
Subjt:  KPEPMVSNEVIDEKEKLSGEENVVNQASKGSTSDRIDGFTLFKKPEIG-ENTRLENE-QDHKNLDHSE----SSTVDDKNENVSAISEETEDASSSKENG

Query:  IHNNYFAVGLR-PPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTK
        I NN     ++   E S      ++  L +  S  +  TI    EA+L GKP+R+D SS E       +    +  G   +      P     E  DW K
Subjt:  IHNNYFAVGLR-PPEPSDMGYIEDTPDLSKSFSDLLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTK

Query:  AEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVK---------------------------------NGSELTP
        AE L K+  RADVE+ISS+TRGF VS+GS++GF+PYRNL+AKWKFLAFESWLR+K +D                                   NG E++ 
Subjt:  AEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLAFESWLRQKEIDVK---------------------------------NGSELTP

Query:  DMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV--
        DMKLEDLL +YDREK KFLSSFVGQKIKVNVV+ANR SRKLIFS+R +E EE VEKKR+LMA L+VGDVVKCCI KI YFGIF E+EGVPAL+HQ+EV  
Subjt:  DMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVPALIHQTEV--

Query:  ---------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGL
                       VEAKVHQLDF+LERIFLSLK+ITPDPL EALESVV GD+D + GRL++AE+D EWPDVESLI EL+  EGI++VSK RFFLSPGL
Subjt:  ---------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGL

Query:  APTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ
        APTFQVYMA M+ENQYKLLAR+GN++QEL+V+ SL KE +KS I++CTNRV+
Subjt:  APTFQVYMASMYENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ

AT3G23700.1 Nucleic acid-binding proteins superfamily4.8e-1725.06Show/hide
Query:  LLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFI
        L  S+   SN  +L+   K    S+        +++ +  V+ A  +              +DW  A+   KSGD  + E+      G ++ F S+VGF+
Subjt:  LLDSTIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFI

Query:  PYRNLSAKWKFLAFESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVG
        PY  LS                    + S   P   + ++ +           + VG K+ V VV A+ ++RKLI S    E+  L  K       + VG
Subjt:  PYRNLSAKWKFLAFESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVG

Query:  DVVKCCITKIAYFGIFVEIE------GVPALIHQTEV-----------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRL
        DV    +  +  +G F+ +        +  L+H +EV                 V   V  +D    RI LS+KQ+  DPL E L+ V+  D       L
Subjt:  DVVKCCITKIAYFGIFVEIE------GVPALIHQTEV-----------------VEAKVHQLDFSLERIFLSLKQITPDPLAEALESVV-GDHDPMDGRL

Query:  ESAEVDT--EWPDVESLINELQNTEGIEAVSKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRV
         S   DT    P +E+++ EL   +GIEAV   R  F    ++   Q+++++    + ++ LLAR+G ++QE+ + TSL++  +K  +     RV
Subjt:  ESAEVDT--EWPDVESLINELQNTEGIEAVSKGR-FFLSPGLAPTFQVYMASM--YENQYKLLARSGNKIQELMVQTSLDKETVKSVILTCTNRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTCGCGCTCTAACGGCCTCCTCCTTCTTCTCACCTATTGATTTATTGCGACCCAGAAGAGTTGCTGTTAGAAATCCGTGCTTTAATGCCAGACCCAGTAAGTT
TTCGGTTCTTGCTTCCAAAGAAGAGGCTGAGCTCGACAAATGGGACCAAATGGAGCTCAAGTTTGGCCGCATGATTGGCGAAGACCCCAAATTAACACTGGCCAAGATAA
TGAGCAAAAAAATGAACACTGGCGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAGTCCAACGAGGTAGAGGAACTTTCTCTTGATGGTCTGAAT
TTGGTCAGACCTCAGTTAAAGAAGGAAATGAAGTTAAAAGCTGCCAATAAGCCATCAGCACCAGATTTAAAGAAACCAAGCCAAGCAGTTGCAAAGCCAGCAGTTAGTCC
TAAAGGCAGGGTTCCCAATGTTATTTTGAGGAAACCGACAATTTATAAGGAGGATGATGTTGAAGATAAACCGTCGAGAATAAGAATGAAGCCAAATTTATCATTGAAAA
TGAGCAATGTATCAACAAAGGAGAAATATAGCGATATGACGCTGTTGAGGAAGCCAGAACCAATGGTTTCGAATGAAGTTATTGATGAGAAGGAAAAGCTATCTGGTGAA
GAGAATGTTGTAAATCAGGCTAGTAAGGGATCAACAAGTGACCGAATTGATGGGTTTACTCTTTTTAAGAAGCCAGAAATAGGTGAAAACACAAGACTTGAAAATGAACA
GGATCATAAAAATCTTGATCATTCAGAAAGTAGTACAGTTGATGATAAAAACGAAAATGTGTCTGCCATTTCTGAAGAAACTGAAGACGCCTCATCATCAAAGGAAAATG
GGATACATAATAATTATTTTGCTGTAGGATTACGGCCACCCGAGCCAAGTGATATGGGATATATTGAGGACACACCAGATTTGAGCAAATCATTTAGTGATCTTTTGGAT
TCGACAATAAAATTGTCCAACGAAGCTACGTTATTGGGTAAACCAAAAAGGGTAGATTATTCTTCAAATGAAACATTAAAACTCGGTGGAGAAGAGACCTTGACTCCTGA
TGTTATTGGTGCTGGTGAGACAGAGAACTTCTCAGCTCTTCCTGCTTTGGAGGAACATGAACTTGCTGACTGGACTAAAGCAGAAGATCTGGCGAAGTCGGGAGACAGAG
CTGATGTGGAAATAATAAGCTCGACTACCCGAGGTTTTGTTGTATCATTCGGCTCCATCGTAGGATTTATTCCATATCGTAATCTTTCTGCCAAGTGGAAGTTCTTAGCT
TTCGAGTCTTGGCTAAGACAGAAAGAAATTGATGTAAAAAATGGGAGTGAGCTTACACCTGACATGAAATTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAACTCAA
GTTCTTGTCATCATTTGTCGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCTAACAGAAAATCAAGGAAGCTTATATTTTCCATAAGGCAAAAAGAAAGGGAAGAATTGG
TCGAGAAAAAGAGGAGTTTAATGGCTACTCTGCAAGTAGGAGATGTTGTCAAATGTTGCATCACGAAGATCGCTTATTTTGGTATCTTCGTCGAGATCGAAGGAGTGCCT
GCTTTGATTCATCAGACTGAGGTTGTCGAGGCAAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGTTGAAGCAGATTACACCAGATCCGCTCGCTGA
AGCATTGGAGTCTGTGGTTGGAGATCACGATCCCATGGATGGAAGATTAGAATCAGCCGAAGTGGACACCGAGTGGCCTGACGTAGAATCTCTCATCAACGAACTGCAAA
ATACTGAAGGTATTGAGGCTGTATCCAAAGGGCGATTTTTCCTTAGCCCTGGCTTGGCTCCAACCTTTCAGGTTTATATGGCTTCTATGTATGAAAATCAGTACAAATTA
CTTGCTCGATCAGGAAACAAAATACAGGAGCTGATGGTTCAGACATCATTGGATAAAGAAACAGTGAAATCTGTCATCTTAACTTGCACCAACAGGGTACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGGTCGCGCTCTAACGGCCTCCTCCTTCTTCTCACCTATTGATTTATTGCGACCCAGAAGAGTTGCTGTTAGAAATCCGTGCTTTAATGCCAGACCCAGTAAGTT
TTCGGTTCTTGCTTCCAAAGAAGAGGCTGAGCTCGACAAATGGGACCAAATGGAGCTCAAGTTTGGCCGCATGATTGGCGAAGACCCCAAATTAACACTGGCCAAGATAA
TGAGCAAAAAAATGAACACTGGCGCTTCTTATCTTGAAGTTGAGAAATCATTTTACCAGAAGAAGGGTAAGTCCAACGAGGTAGAGGAACTTTCTCTTGATGGTCTGAAT
TTGGTCAGACCTCAGTTAAAGAAGGAAATGAAGTTAAAAGCTGCCAATAAGCCATCAGCACCAGATTTAAAGAAACCAAGCCAAGCAGTTGCAAAGCCAGCAGTTAGTCC
TAAAGGCAGGGTTCCCAATGTTATTTTGAGGAAACCGACAATTTATAAGGAGGATGATGTTGAAGATAAACCGTCGAGAATAAGAATGAAGCCAAATTTATCATTGAAAA
TGAGCAATGTATCAACAAAGGAGAAATATAGCGATATGACGCTGTTGAGGAAGCCAGAACCAATGGTTTCGAATGAAGTTATTGATGAGAAGGAAAAGCTATCTGGTGAA
GAGAATGTTGTAAATCAGGCTAGTAAGGGATCAACAAGTGACCGAATTGATGGGTTTACTCTTTTTAAGAAGCCAGAAATAGGTGAAAACACAAGACTTGAAAATGAACA
GGATCATAAAAATCTTGATCATTCAGAAAGTAGTACAGTTGATGATAAAAACGAAAATGTGTCTGCCATTTCTGAAGAAACTGAAGACGCCTCATCATCAAAGGAAAATG
GGATACATAATAATTATTTTGCTGTAGGATTACGGCCACCCGAGCCAAGTGATATGGGATATATTGAGGACACACCAGATTTGAGCAAATCATTTAGTGATCTTTTGGAT
TCGACAATAAAATTGTCCAACGAAGCTACGTTATTGGGTAAACCAAAAAGGGTAGATTATTCTTCAAATGAAACATTAAAACTCGGTGGAGAAGAGACCTTGACTCCTGA
TGTTATTGGTGCTGGTGAGACAGAGAACTTCTCAGCTCTTCCTGCTTTGGAGGAACATGAACTTGCTGACTGGACTAAAGCAGAAGATCTGGCGAAGTCGGGAGACAGAG
CTGATGTGGAAATAATAAGCTCGACTACCCGAGGTTTTGTTGTATCATTCGGCTCCATCGTAGGATTTATTCCATATCGTAATCTTTCTGCCAAGTGGAAGTTCTTAGCT
TTCGAGTCTTGGCTAAGACAGAAAGAAATTGATGTAAAAAATGGGAGTGAGCTTACACCTGACATGAAATTGGAGGATCTTCTTCAAATTTATGATCGAGAGAAACTCAA
GTTCTTGTCATCATTTGTCGGCCAGAAAATCAAAGTAAATGTGGTGTTGGCTAACAGAAAATCAAGGAAGCTTATATTTTCCATAAGGCAAAAAGAAAGGGAAGAATTGG
TCGAGAAAAAGAGGAGTTTAATGGCTACTCTGCAAGTAGGAGATGTTGTCAAATGTTGCATCACGAAGATCGCTTATTTTGGTATCTTCGTCGAGATCGAAGGAGTGCCT
GCTTTGATTCATCAGACTGAGGTTGTCGAGGCAAAAGTTCATCAGTTGGATTTTTCACTCGAACGCATATTCTTATCGTTGAAGCAGATTACACCAGATCCGCTCGCTGA
AGCATTGGAGTCTGTGGTTGGAGATCACGATCCCATGGATGGAAGATTAGAATCAGCCGAAGTGGACACCGAGTGGCCTGACGTAGAATCTCTCATCAACGAACTGCAAA
ATACTGAAGGTATTGAGGCTGTATCCAAAGGGCGATTTTTCCTTAGCCCTGGCTTGGCTCCAACCTTTCAGGTTTATATGGCTTCTATGTATGAAAATCAGTACAAATTA
CTTGCTCGATCAGGAAACAAAATACAGGAGCTGATGGTTCAGACATCATTGGATAAAGAAACAGTGAAATCTGTCATCTTAACTTGCACCAACAGGGTACAGTAG
Protein sequenceShow/hide protein sequence
MDGRALTASSFFSPIDLLRPRRVAVRNPCFNARPSKFSVLASKEEAELDKWDQMELKFGRMIGEDPKLTLAKIMSKKMNTGASYLEVEKSFYQKKGKSNEVEELSLDGLN
LVRPQLKKEMKLKAANKPSAPDLKKPSQAVAKPAVSPKGRVPNVILRKPTIYKEDDVEDKPSRIRMKPNLSLKMSNVSTKEKYSDMTLLRKPEPMVSNEVIDEKEKLSGE
ENVVNQASKGSTSDRIDGFTLFKKPEIGENTRLENEQDHKNLDHSESSTVDDKNENVSAISEETEDASSSKENGIHNNYFAVGLRPPEPSDMGYIEDTPDLSKSFSDLLD
STIKLSNEATLLGKPKRVDYSSNETLKLGGEETLTPDVIGAGETENFSALPALEEHELADWTKAEDLAKSGDRADVEIISSTTRGFVVSFGSIVGFIPYRNLSAKWKFLA
FESWLRQKEIDVKNGSELTPDMKLEDLLQIYDREKLKFLSSFVGQKIKVNVVLANRKSRKLIFSIRQKEREELVEKKRSLMATLQVGDVVKCCITKIAYFGIFVEIEGVP
ALIHQTEVVEAKVHQLDFSLERIFLSLKQITPDPLAEALESVVGDHDPMDGRLESAEVDTEWPDVESLINELQNTEGIEAVSKGRFFLSPGLAPTFQVYMASMYENQYKL
LARSGNKIQELMVQTSLDKETVKSVILTCTNRVQ