; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013153 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013153
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr1:47890476..47899671
RNA-Seq ExpressionLag0013153
SyntenyLag0013153
Gene Ontology termsGO:0000070 - mitotic sister chromatid segregation (biological process)
GO:0015074 - DNA integration (biological process)
GO:0000796 - condensin complex (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR011989 - Armadillo-like helical
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR016024 - Armadillo-type fold
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594980.1 Condensin-2 complex subunit G2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.05Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL +SISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+LS+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY+FMDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED+ KPD+ K+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQVYKDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGA C+++PVEGEEAHM F++Q K  FP WLS+VAKIEL +++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSADDREWGD+MLASLQRCYPQIE+E+ QCNG+  HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

XP_022963354.1 uncharacterized protein LOC111463584 [Cucurbita moschata]0.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL +SISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPD  TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+LS+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKC+SFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY+FMDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+ K+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQVYKDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGA C+++PVEGEEAHM F+EQ K  FP W+S+VAKIEL +++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSADDREWGD+MLASLQRCYPQIE+E+ QCNG+  HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

XP_023003702.1 uncharacterized protein LOC111497213 isoform X1 [Cucurbita maxima]0.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTK+SIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL SSISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+ S+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY++MDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+VK+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQV+KDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGAGC+++PVEGEEAH+ F+EQ K  FP WLS+VAKIEL D++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSAD REWGD+MLASLQRCYPQIE+E+ +CNG   HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

XP_023003703.1 uncharacterized protein LOC111497213 isoform X2 [Cucurbita maxima]0.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTK+SIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL SSISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+ S+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY++MDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+VK+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQV+KDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGAGC+++PVEGEEAH+ F+EQ K  FP WLS+VAKIEL D++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSAD REWGD+MLASLQRCYPQIE+E+ +CNG   HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

XP_023518903.1 uncharacterized protein LOC111782285 [Cucurbita pepo subsp. pepo]0.0e+0085.16Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLT LAHDQPIISQKITRLL+PSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL SSISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+LS+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETI+LTLKVICEVSIV+C+DY+FMDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+VK+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQL+TSILGQQVYKDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGAGC+++PVEGEEAH+ F+EQ K  FP WLS+VAKIEL D++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH+ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSADDREWGD+MLASLQRCYPQIE+E+ QCNG+  HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

TrEMBL top hitse value%identityAlignment
A0A0A0KJP5 Uncharacterized protein0.0e+0082.14Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDES DVRVVAVEGCCRILYLFWEIIPS TITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNG+IYLFGNPQSHEILKVILPRLGHLMLDNAL VRVALADLLLLIRDVRDFQFNKVVSLDVLL+VLA DQPIISQKITRLLMPSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI++LVR LI+LVSSS KLD NYIDGLLLSAKYL S IS EPCYK DLKDLFT EKL+CLLSVAQS CARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        NIVS FSPD FTDLL ECMQLITNCRGLS D+EKQ EVRSGH+FFQACDALD+MFEAM+LILQK AYRCHI+FGTE PKLSVSPAKRKKCK SGKILSRL
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKN
        K+FGGKKC++FE+DYFVAVGMSWQVKDLLSDEK K+ALLS QTIE IF +LKVI EVSIVQC++YD+MDV+PVLAYASLALHM+ QKGSQ+I S+SGTKN
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKN

Query:  KIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGKN
        KI    SS E                    +EQTLDHLL+C+KK+YVS+DS  PD+ KQGN K  QH ++K  ES  NQSH L GGCV  S+KTLKQ KN
Subjt:  KIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGKN

Query:  LTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIASI
        LTAVLKFI DAISMGFLS+KYELCLKFVSEYMQ   S L QQ YKDIQF VE KE FLCLKSSLTYAAKLLNQ+LR V+DS+LTQ SIL HNL+DLIA I
Subjt:  LTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIASI

Query:  EVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIVT
        EV+LGSGYAARLVA+AK+W PDLILALGA C+MRPVE + AH+  +EQTK YFPSWLS+VAKIELS+ SE+   EEE  D S D+HN STFKKFLKMIVT
Subjt:  EVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIVT

Query:  FLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFETG
        FLKRDH ILDA+G +FM+GSE+GLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQ CYPQIEREIEQCNG+G HQLD+AK LL+PIWLYHIFETG
Subjt:  FLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFETG

Query:  KLSMMNE
        KLS MNE
Subjt:  KLSMMNE

A0A1S4DTW2 uncharacterized protein LOC103484848 isoform X10.0e+0081.8Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDES DVRVVAVEGCCRILYLFWEIIPS TITKIITKIFDEMS D SNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNG+IYLFGNPQSHEILKVILPRLGHLMLDNAL VRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLI+RSPMAGARFCEFAAS+GASLKSI++LVR LI+LVSSS KLD +YIDGLLLSAKYL S IS EPCYK DLKDLFT EKL+CLLSVAQS CARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        NIVS FSPD FTDLLEECMQLITNCRGLS D+EKQAEVRSGH+FF ACDALD+MFEAM+LILQK AYRCHI+FGTE PKLSVS AKRKKCKFSGK+LS+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKN
        K+FGGKKC++FE+DYFVAVG+ WQVKDLLSDEK K+ALLS QTIETIF +LKVICEVSIVQC++YD+MDV+PVLAYASLALHM+ Q+ SQ+I S+SGTKN
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKN

Query:  KIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGKN
        KI    SS E                    LEQTLDHLL CVKKLYVS+DS  PD+ KQGN K  QH  +K  ES  NQSH L GGCVD S+KTLKQ KN
Subjt:  KIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGKN

Query:  LTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIASI
        LTAVLKFI DAISMGFLS+KYELCLKFVSEYMQ I SIL QQ+YKDIQF VE KE FLCLKSSLTYAAKLLNQ+LR V+ S+LTQ SIL HNL+D+IA I
Subjt:  LTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIASI

Query:  EVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFA-----EEEDDDRSFDQHNFSTFKKFL
        EV+LGSGYAARLVA+AK+W PDLILALGA C+MRPVE E AH+  +EQTK YFPSWLS+VAKIELS+ S EDFA     EEED D SFD+H+ STFKKFL
Subjt:  EVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFA-----EEEDDDRSFDQHNFSTFKKFL

Query:  KMIVTFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYH
        KMIVTFLKRDH ILDA+G +FM+GSE+GLERKDFGLVLGLLQFVCRSLYSA+DREWGDMMLASLQ CYPQIEREIEQCNG+  HQLD+AK LL+PIWLYH
Subjt:  KMIVTFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYH

Query:  IFETGKLSMMNE
        +FETGKLSMMNE
Subjt:  IFETGKLSMMNE

A0A6J1HJV0 uncharacterized protein LOC1114635840.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTKVSIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL +SISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPD  TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+LS+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKC+SFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY+FMDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+ K+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQVYKDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGA C+++PVEGEEAHM F+EQ K  FP W+S+VAKIEL +++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSADDREWGD+MLASLQRCYPQIE+E+ QCNG+  HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

A0A6J1KNC2 uncharacterized protein LOC111497213 isoform X10.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTK+SIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL SSISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+ S+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY++MDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+VK+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQV+KDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGAGC+++PVEGEEAH+ F+EQ K  FP WLS+VAKIEL D++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSAD REWGD+MLASLQRCYPQIE+E+ +CNG   HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

A0A6J1KXD0 uncharacterized protein LOC111497213 isoform X20.0e+0084.83Show/hide
Query:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
        +VANSNVRQNSLHLLLDVFPLENPDATKELKDTLL++QFFLIEKLLMDE SDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST
Subjt:  EVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLST

Query:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC
        LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALAD+LLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLL+PSYFPTK+SIEEACSRC
Subjt:  LNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRC

Query:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF
        ITLIKRSPMAGARFCEFAASEGASLKSI ELV+ LINLVSSS KLD NYIDGLLLSAKYL SSISSE CYKNDLKDLFTGEKL+CLLSVA SG ARSSLF
Subjt:  ITLIKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLF

Query:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL
        +IVS+FSPDG TDLLEECMQLITNC GLSGDL KQAEVRSGHKFFQACDALDVMFEAM L+LQKSAYRCHI+FGTEIPKLSVSPAKRK+CK SGK+ S+L
Subjt:  NIVSLFSPDGFTDLLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRL

Query:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK
        K FGGKKCLSFE DY VAVG+SWQVKDL+SDEKMKSALLSSQTIETIFLTLKVICEVSIV+C+DY++MDV+PVLAY SLALHM L+K  S++ TS+SGTK
Subjt:  KHFGGKKCLSFEDDYFVAVGMSWQVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQK-GSQSITSSSGTK

Query:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK
        NKIVTDSSSSE                  AILEQTLDHLL CVKKLY SED  KPD+VK+G RKL+QHVE+K KESG NQSH L+GGCVD  KKTLKQGK
Subjt:  NKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGK

Query:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS
        NLTA+LKFIVD+ISMGFLSEKYELCLKF SEYMQLITSILGQQV+KDIQFE+ETKE FLCLK SLTYAAKLLNQ+LRHVKDSSLTQI ILGHNLLDLIA 
Subjt:  NLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQVYKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIAS

Query:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV
        +EV+LGSGYAARLVA+AK+WLPDLILALGAGC+++PVEGEEAH+ F+EQ K  FP WLS+VAKIEL D++E+  + EEDDDRS DQH FS FKKFLKM +
Subjt:  IEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIV

Query:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET
        TFL+RDH ILDA+G +FMIGSEIGLE KDFGLVLGLLQFVCRSLYSAD REWGD+MLASLQRCYPQIE+E+ +CNG   HQLDRAKALL+PIWLYHIFET
Subjt:  TFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCYPQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFET

Query:  GKL
        GK+
Subjt:  GKL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-11626.39Show/hide
Query:  DEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGATSK
        D + Y +W+  + A+L  Q V   V G                    PN + + W   ++     +   +S +      S  T+R++ + L+ VY   S 
Subjt:  DEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGATSK

Query:  ARINQLRGILQNTK-KGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDIKTWQELCSILITFEGTLARHSSVTNTREL
        A    LR  L + K      ++ +     +    L  AG  I   D IS+ +  L S Y  I+ +I+       +E  ++       L +   + N    
Subjt:  ARINQLRGILQNTK-KGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDIKTWQELCSILITFEGTLARHSSVTNTREL

Query:  PDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRPTCQLCGKFGHSAPACYLRFEEDFNNPHAVNNTNN
                +N  S+   +     N N   +N    +   P   F+G              N+  +  C  CG+ GH    C       F+    +NN N 
Subjt:  PDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRPTCQLCGKFGHSAPACYLRFEEDFNNPHAVNNTNN

Query:  NNQNVNGAAVASSSAYLA----TPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIKR
         N+     A +   A++        ++++  ++ DSGA+ H+  D        E      +AV       I       V   ++  I L  +L       
Subjt:  NNQNVNGAAVASSSAYLA----TPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIKR

Query:  NLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLWH
        NL+S+ RL  +  + +EF                                  K    +S N   V  ++   ++V    + FQ     ++    +  LWH
Subjt:  NLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLWH

Query:  RRLSHAS-YKVINVVARSC----NLKIPMNENSVFCDSCQQGKSHRLPFS--RSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPM
         R  H S  K++ +  ++     +L   +  +   C+ C  GK  RLPF   +  T  ++PL ++H D+ GP   V+     Y++ FVD FT     Y +
Subjt:  RRLSHAS-YKVINVVARSC----NLKIPMNENSVFCDSCQQGKSHRLPFS--RSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPM

Query:  QNKSQALTLFQQYKTLVENRFGKTIKILQTDWGGEFRS--FPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIY
        + KS   ++FQ +    E  F   +  L  D G E+ S     F    GI +  + PHT Q NGV ER  R I E   T+++ + L   FW +A  TA Y
Subjt:  QNKSQALTLFQQYKTLVENRFGKTIKILQTDWGGEFRS--FPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIY

Query:  IINRLPTPVLGNIS--PWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLSPSGRVYI-SRHVVFNEVD---------
        +INR+P+  L + S  P++    + P  +  + FG + +  ++  Q  KF   S K +F+GY     G+K        +I +R VV +E +         
Subjt:  IINRLPTPVLGNIS--PWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLSPSGRVYI-SRHVVFNEVD---------

Query:  -----------------HQPPCLTSVEFQAPSPAA----ALPSSDASENPTASAGS----PTMAPPVFEPCSN--------------------------L
                         +    +   EF   S        L  S  SEN      S     T  P   + C N                          L
Subjt:  -----------------HQPPCLTSVEFQAPSPAA----ALPSSDASENPTASAGS----PTMAPPVFEPCSN--------------------------L

Query:  GLPSSTAHVDVEPENSTARVVPPTTVVPPV-NDHHMRTRAKSGIVKPKVWGSFMVENTSDAAI---------SEPSS---VKVALQSEIWKKAMDAEIDA
             + + +   E+ TA  +    +  P  ND       +S  +K K   S+  E+ S   +           P+S   ++       W++A++ E++A
Subjt:  GLPSSTAHVDVEPENSTARVVPPTTVVPPV-NDHHMRTRAKSGIVKPKVWGSFMVENTSDAAI---------SEPSS---VKVALQSEIWKKAMDAEIDA

Query:  LARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLDINNAFLNGKLNEAV
           N TW +   P N N++ S+WVF VK    G+  R KARLVA+G+ Q   ID+ ETF+PV +  S R IL+L +     V Q+D+  AFLNG L E +
Subjt:  LARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLDINNAFLNGKLNEAV

Query:  YMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFF--SNTNMVVLVLIYVDDILV-TGDCPPFIEQFVRRLNAVFA
        YM+ P G   S + + VCKL++A+YGL+QA R WF+ F   L    F +S  D  ++     N N  + VL+YVDD+++ TGD    +  F R L   F 
Subjt:  YMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFF--SNTNMVVLVLIYVDDILV-TGDCPPFIEQFVRRLNAVFA

Query:  LKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLS-HTRPDISYIVNKLSQFLQQ
        + D+  + +++G+ +      +YL Q+ Y+  +L +  + +    STP  +         D   N T  RS++G L Y+   TRPD++  VN LS++  +
Subjt:  LKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLS-HTRPDISYIVNKLSQFLQQ

Query:  PTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCV-FLGPSLVSWSSKKQHVVARSSTESEYRSLANTAAEVSWVQ
          S  WQ  KRVLRYLKGT +  L   K  +    I+ Y D+DWA    D+KS           +L+ W++K+Q+ VA SSTE+EY +L     E  W++
Subjt:  PTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCV-FLGPSLVSWSSKKQHVVARSSTESEYRSLANTAAEVSWVQ

Query:  AFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGVL
          L  + +   +P  I+ DN   +S+A NP  H R+K +++  HF R++V    I + Y+P+  QLAD  TK LP ++F  LR KLG+L
Subjt:  AFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGVL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-14629.67Show/hide
Query:  EEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGATSKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVA
        E+W  +D+  +  +   +S  +  +++   T+R +W  LE +Y   SK   N+L                    +K+    L ++     LS L  +   
Subjt:  EEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGATSKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVA

Query:  GLDSEYIPIVCSIDDKDIKTWQELCSILITFEGTLARHSSVTNTRELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGR
        GL ++   +   I+++D K    L S+  +++           T EL D+++ L  N+  + +   QG+     G   +G     + NN+ R G RG+ +
Subjt:  GLDSEYIPIVCSIDDKDIKTWQELCSILITFEGTLARHSSVTNTRELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGR

Query:  GRSNYQRNNNSRPTCQLCGKFGHSAPACYLRFEEDFNNPH-AVNNTNNNNQNVNGAAVASSS-------------AYLATPDILNDPKWLADSGATSHVT
         RS  +  N     C  C + GH        F+ D  NP      T+    + N AA+  ++              +L+ P    + +W+ D+ A SH  
Subjt:  GRSNYQRNNNSRPTCQLCGKFGHSAPACYLRFEEDFNNPH-AVNNTNNNNQNVNGAAVASSS-------------AYLATPDILNDPKWLADSGATSHVT

Query:  TDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIKRNLISISRLTADNNVFVEFHPHFCLVKDKVSK--KVMVQGTLKD
        T   +L  +       T+ +GN +   I+ IG+  +       +VL  + HVP ++ NLIS   L  D      +  +F   K +++K   V+ +G  + 
Subjt:  TDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIKRNLISISRLTADNNVFVEFHPHFCLVKDKVSK--KVMVQGTLKD

Query:  GLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLWHRRLSHASYKVINVVARSCNLKIPMNENSVFCDSCQQGKSHR
         LY+             +   C   ++A+++                     +S  LWH+R+ H S K + ++A+   +          CD C  GK HR
Subjt:  GLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLWHRRLSHASYKVINVVARSCNLKIPMNENSVFCDSCQQGKSHR

Query:  LPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNKSQALTLFQQYKTLVENRFGKTIKILQTDWGGEF--RSFPHFLQNV
        + F  S       L+L++ D+ GP  + S  G KY+++F+DD +R   +Y ++ K Q   +FQ++  LVE   G+ +K L++D GGE+  R F  +  + 
Subjt:  LPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNKSQALTLFQQYKTLVENRFGKTIKILQTDWGGEF--RSFPHFLQNV

Query:  GIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRLPTPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKF
        GI    + P T Q NGV ER +R IVE   ++L  + LP  FW +A  TA Y+INR P+  L    P +    +   +   K FGC  F  +   Q  K 
Subjt:  GIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRLPTPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKF

Query:  QFHSVKCVFLGYSISHKGYKCLSP-SGRVYISRHVVFNEVDHQPPCLTSVEFQAPSPAAALPSSDASENPTASAGSPTMAPPVFEPCSNLG-LPSSTAHV
           S+ C+F+GY     GY+   P   +V  SR VVF E + +    T+ +         +P+        +++ +PT A    +  S  G  P      
Subjt:  QFHSVKCVFLGYSISHKGYKCLSP-SGRVYISRHVVFNEVDHQPPCLTSVEFQAPSPAAALPSSDASENPTASAGSPTMAPPVFEPCSNLG-LPSSTAHV

Query:  DVEPENSTARVVPPTTVVPPVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKVAL---QSEIWKKAMDAEIDALARNKTWRLVPPPPNLNLI
          + +     V  PT        H    R++   V+ + + S      SD    EP S+K  L   +     KAM  E+++L +N T++LV  P     +
Subjt:  DVEPENSTARVVPPTTVVPPVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKVAL---QSEIWKKAMDAEIDALARNKTWRLVPPPPNLNLI

Query:  GSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCK
          KWVFK+K   D    R KARLV KG+ Q  GIDF E FSPVVK  SIR IL+LA S    V QLD+  AFL+G L E +YM+QP+G+  +     VCK
Subjt:  GSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCK

Query:  LDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFF--FSNTNMVVLVLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEV--HST
        L+++LYGL+QAPR W+ +F S + +  +  + +D  V+F  FS  N ++L L+YVDD+L+ G     I +    L+  F +KD+G     LG+++    T
Subjt:  LDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFF--FSNTNMVVLVLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEV--HST

Query:  PAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLS------AFEGDPMPNPTLYRSVLGALQY-LSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRV
           ++L Q KYI  +L R  + + KP STP A   KLS        E         Y S +G+L Y +  TRPDI++ V  +S+FL+ P   HW+A K +
Subjt:  PAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLS------AFEGDPMPNPTLYRSVLGALQY-LSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRV

Query:  LRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYRSLANTAAEVSWVQAFLKELRLCSSSP
        LRYL+GT+  G  L  G S  + +  YTDAD A  +D++KS            +SW SK Q  VA S+TE+EY +   T  E+ W++ FL+EL L     
Subjt:  LRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYRSLANTAAEVSWVQAFLKELRLCSSSP

Query:  PIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGV
         +++CD+ SA+ L++N ++H+R+K +++  H++R+ V  + + V  + + E  AD LTK +P ++F   +  +G+
Subjt:  PIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGV

P92519 Uncharacterized mitochondrial protein AtMg008109.9e-5144.1Show/hide
Query:  VLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYR
        +L+YVDDIL+TG     +   + +L++ F++KD+G + Y+LG+++ + P+G++L QTKY   +L+  G+   KP STP    K  S+      P+P+ +R
Subjt:  VLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYR

Query:  SVLGALQYLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSK
        S++GALQYL+ TRPDISY VN + Q + +PT   +   KRVLRY+KGT  +GL++ K  +S L++ A+ D+DWA     ++S    C FLG +++SWS+K
Subjt:  SVLGALQYLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSK

Query:  KQHVVARSSTESEYRSLANTAAEVSWVQA
        +Q  V+RSSTE+EYR+LA TAAE++W  A
Subjt:  KQHVVARSSTESEYRSLANTAAEVSWVQA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.3e-26237.21Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGAT
        KL   NYL+W   V A+  G ++ G++ G+   P     I T++  +   NP Y  W   D+ +   + G++S ++   V    T+ ++W+ L ++Y   
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGAT

Query:  SKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDI-KTWQELCSILITFEGTLARHSSVTNTR
        S   + QLR  L+   KGT  + +Y+  +    + L L G P+   + +   +  L  EY P++  I  KD   T  E+   L+  E  +   SS T   
Subjt:  SKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDI-KTWQELCSILITFEGTLARHSSVTNTR

Query:  ELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRP---TCQLCGKFGHSAPACYLRFEEDFNNPHAV
           +  +H      + N    +     NR ++N       +  NF              +  NN S+P    CQ+CG  GHSA  C           H +
Subjt:  ELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRP---TCQLCGKFGHSAPACYLRFEEDFNNPHAV

Query:  NNTNNNNQNVNGAAVASSSAYLATPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIK
        ++  N+ Q  +        A LA     +   WL DSGAT H+T+D  NL +   Y G + + V +G+ + ISH G+++++      + L+ +L+VP+I 
Subjt:  NNTNNNNQNVNGAAVASSSAYLATPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIK

Query:  RNLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLW
        +NLIS+ RL   N V VEF P    VKD  +   ++QG  KD LY+  + S Q            V++ AS  S    SS                   W
Subjt:  RNLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLW

Query:  HRRLSHASYKVINVVARSCNLKIPMNENSVF--CDSCQQGKSHRLPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNKS
        H RL H +  ++N V  + +L + +N +  F  C  C   KS+++PFS+S   + +PLE I+ D+W  SP++S   Y+YY+ FVD FTR T +YP++ KS
Subjt:  HRRLSHASYKVINVVARSCNLKIPMNENSVF--CDSCQQGKSHRLPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNKS

Query:  QALTLFQQYKTLVENRFGKTIKILQTDWGGEFRSFPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRLP
        Q    F  +K L+ENRF   I    +D GGEF +   +    GI    S PHT + NG+ ERKHRHIVE GLTLL+ +S+P  +W  AF+ A+Y+INRLP
Subjt:  QALTLFQQYKTLVENRFGKTIKILQTDWGGEFRSFPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRLP

Query:  TPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLS-PSGRVYISRHVVFNE-------------------
        TP+L   SP+QK FG  P++   + FGC+C+P LRPY  +K    S +CVFLGYS++   Y CL   + R+YISRHV F+E                   
Subjt:  TPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLS-PSGRVYISRHVVFNE-------------------

Query:  ------------VDHQPPCLTSVEFQAPSPAAALPSS-----------------------DASENPTA-------------------------SAGSPTM
                    +  + P L +     P  AA  PSS                        +S  PTA                         S  +PT 
Subjt:  ------------VDHQPPCLTSVEFQAPSPAAALPSS-----------------------DASENPTA-------------------------SAGSPTM

Query:  APPVFEPCSNLGLPSSTAHVDVEP----ENSTARVVPPTTVV---------------PPVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKV
          P  +   +L  P+ ++     P     +S+    PP+ ++                P+N H M TRAK+GI+KP    S  V   S AA SEP +   
Subjt:  APPVFEPCSNLGLPSSTAHVDVEP----ENSTARVVPPTTVV---------------PPVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKV

Query:  ALQSEIWKKAMDAEIDALARNKTWRLVPPPP-NLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSV
        AL+ E W+ AM +EI+A   N TW LVPPPP ++ ++G +W+F  K   DGS +R KARLVAKGY+Q PG+D+ ETFSPV+K+ SIRI+L +AV R W +
Subjt:  ALQSEIWKKAMDAEIDALARNKTWRLVPPPP-NLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSV

Query:  RQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVTGDCP
        RQLD+NNAFL G L + VYM QP G++D   P YVCKL +ALYGL+QAPRAW+   R+ LL  GF +S +D+S+F       +V +L+YVDDIL+TG+ P
Subjt:  RQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVTGDCP

Query:  PFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSHTRPD
          +   +  L+  F++KD   L Y+LG+E    P G++L Q +YI DLL R  +   KP +TP A   KLS + G  + +PT YR ++G+LQYL+ TRPD
Subjt:  PFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSHTRPD

Query:  ISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYR
        ISY VN+LSQF+  PT  H QA KR+LRYL GT  +G+FL KG  ++LS+ AY+DADWA   DD  S     V+LG   +SWSSKKQ  V RSSTE+EYR
Subjt:  ISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYR

Query:  SLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLG
        S+ANT++E+ W+ + L EL +  + PP+I+CDNV A  L  NPVFHSR K + +D HF+R++V +  + V +V + +QLAD LTK L  + F +  SK+G
Subjt:  SLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLG

Query:  VLPSP
        V   P
Subjt:  VLPSP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.3e-26236.91Show/hide
Query:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGAT
        KL   NYL+W   V A+  G ++ G++ G+   P     I T++  +   NP Y  W   D+ +   + G++S ++   V    T+ ++W+ L ++Y   
Subjt:  KLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGSMSPAIAADVVSFKTSREVWKALEEVYGAT

Query:  SKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDI-KTWQELCSILITFEGTLARHSSVTNTR
        S   + QLR I +                    + L L G P+   + +   +  L  +Y P++  I  KD   +  E+   LI  E  L       N+ 
Subjt:  SKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDI-KTWQELCSILITFEGTLARHSSVTNTR

Query:  ELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRP-----TCQLCGKFGHSAPACYLRFEEDFNNPH
        E+  ++A++       +R +   R   NRG +   N   N  N++         +  S+  R++N +P      CQ+C   GHSA  C           H
Subjt:  ELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRP-----TCQLCGKFGHSAPACYLRFEEDFNNPH

Query:  AVNNTNNNNQNVNGAAVASSSAYLATPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPS
           +T N  Q+ +        A LA     N   WL DSGAT H+T+D  NL     Y G + + + +G+ + I+H G++++     + + LN++L+VP+
Subjt:  AVNNTNNNNQNVNGAAVASSSAYLATPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPS

Query:  IKRNLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKS
        I +NLIS+ RL   N V VEF P    VKD  +   ++QG  KD LY+  + S Q                               V    S  S  + S
Subjt:  IKRNLISISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKS

Query:  LWHRRLSHASYKVINVVARSCNLKIPMNENSVF-CDSCQQGKSHRLPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNK
         WH RL H S  ++N V  + +L +    + +  C  C   KSH++PFS S   + +PLE I+ D+W  SP++S   Y+YY+ FVD FTR T +YP++ K
Subjt:  LWHRRLSHASYKVINVVARSCNLKIPMNENSVF-CDSCQQGKSHRLPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNK

Query:  SQALTLFQQYKTLVENRFGKTIKILQTDWGGEFRSFPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRL
        SQ    F  +K+LVENRF   I  L +D GGEF     +L   GI    S PHT + NG+ ERKHRHIVE+GLTLL+ +S+P  +W  AFS A+Y+INRL
Subjt:  SQALTLFQQYKTLVENRFGKTIKILQTDWGGEFRSFPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRL

Query:  PTPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLS-PSGRVYISRHVVFNE------------------
        PTP+L   SP+QK FGQ P++   K FGC+C+P LRPY  +K +  S +C F+GYS++   Y CL  P+GR+Y SRHV F+E                  
Subjt:  PTPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPYQSNKFQFHSVKCVFLGYSISHKGYKCLS-PSGRVYISRHVVFNE------------------

Query:  -------------------VDHQPPCL-----------------------------------TSVEFQAPSPAAALPSSDASENPTASAGSPTM------
                           V   PPCL                                   +S E  APS     P++   +   +++ SP +      
Subjt:  -------------------VDHQPPCL-----------------------------------TSVEFQAPSPAAALPSSDASENPTASAGSPTM------

Query:  APPVFEPCSNLGLPS---STAHVDV---------EPENSTARVVPPTTVVP-----------PVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPS
        +P    P  N  LP    S+ H+            P +S+    P   V+P           PVN H M TRAK GI KP    S+    TS AA SEP 
Subjt:  APPVFEPCSNLGLPS---STAHVDV---------EPENSTARVVPPTTVVP-----------PVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPS

Query:  SVKVALQSEIWKKAMDAEIDALARNKTWRLV-PPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSR
        +   A++ + W++AM +EI+A   N TW LV PPPP++ ++G +W+F  K   DGS +R KARLVAKGY+Q PG+D+ ETFSPV+K+ SIRI+L +AV R
Subjt:  SVKVALQSEIWKKAMDAEIDALARNKTWRLV-PPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSR

Query:  GWSVRQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVT
         W +RQLD+NNAFL G L + VYM QP G+VD   P+YVC+L +A+YGL+QAPRAW+   R+ LL  GF +S +D+S+F       ++ +L+YVDDIL+T
Subjt:  GWSVRQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVT

Query:  GDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSH
        G+    ++  +  L+  F++K+   L Y+LG+E    P G++L Q +Y  DLL R  +   KP +TP AT  KL+   G  +P+PT YR ++G+LQYL+ 
Subjt:  GDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSH

Query:  TRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTE
        TRPD+SY VN+LSQ++  PT  HW A KRVLRYL GT ++G+FL KG  ++LS+ AY+DADWA   DD  S     V+LG   +SWSSKKQ  V RSSTE
Subjt:  TRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTE

Query:  SEYRSLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLR
        +EYRS+ANT++E+ W+ + L EL +  S PP+I+CDNV A  L  NPVFHSR K + LD HF+R++V +  + V +V + +QLAD LTK L    F +  
Subjt:  SEYRSLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLR

Query:  SKLGVLPSPSRLRGDVR
         K+GV+  P    G +R
Subjt:  SKLGVLPSPSRLRGDVR

Arabidopsis top hitse value%identityAlignment
AT1G64960.1 ARM repeat superfamily protein5.2e-20443.19Show/hide
Query:  KCVELDVH-FVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGV------LPSPSRLRGDVRQEVANSNVRQNSLHLLLDVFPLENPDATK
        K VELD+   + D  L   ID     S+   A +L + L    F S R+  GV      L  P   R     +VANSNVR N+LHLLLD+FP+E+PDATK
Subjt:  KCVELDVH-FVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRSKLGV------LPSPSRLRGDVRQEVANSNVRQNSLHLLLDVFPLENPDATK

Query:  ELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLSTLNGIIYLFGNPQSHEILKVILPRLGHLM
        E KDTLLD+QF+L+EKLL DE  DVR VAVEG  R+ YLFWE+IPS+TITK++TKIFD+MS +  +EVRLST+NGI YL  NPQSH ILKVILPRLGHLM
Subjt:  ELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDISNEVRLSTLNGIIYLFGNPQSHEILKVILPRLGHLM

Query:  LDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRCITLIKRSPMAGARFCEFAASEGASLKSI
        LD+  SVRVA+ DLLLLIRDVR FQFN VVSLDVLL+VLA DQ  +++ I RLL+PSYFP++   EEAC RC TLI R+P AGARFCEF  S GA++KS+
Subjt:  LDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRCITLIKRSPMAGARFCEFAASEGASLKSI

Query:  MELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLFNIVSLFSPDGFTDLLEECMQLITNCRGL
        + LV   +N V S DKL  N  +GLL +A YL   + ++      LK+L  GEKL+ LL+ A +  A+SS+ +I+++ SPD  +++LE+CM L+ NC GL
Subjt:  MELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLFNIVSLFSPDGFTDLLEECMQLITNCRGL

Query:  SGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRLKHFGGKKCLSFEDDYFVAVGMSWQVKDL
          D  +Q E+RS HK   + +A   +      I+QK+AYRC I FG E+ + ++   KRKK K SGK   R KH  GK  +SFE+DY VAVG++WQ+KDL
Subjt:  SGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRLKHFGGKKCLSFEDDYFVAVGMSWQVKDL

Query:  LSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKNKIVTDSSSSEASAYLMLDDKTMLYCDVL
        L+ E  + ++L S  IE + L+LKV+   SI+Q    ++MDV P                                                        
Subjt:  LSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKNKIVTDSSSSEASAYLMLDDKTMLYCDVL

Query:  AILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQG------KNLTAVLKFIVDAISMGFLSEKYE
         I++QT+DH+L C  +L+ + D   P     G      ++ +K   S  NQ  R +    D + +  K+G      K LTA+ KF V++  MG  S    
Subjt:  AILEQTLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQG------KNLTAVLKFIVDAISMGFLSEKYE

Query:  LCLKFVSEYMQLITSILGQQVYKDIQFE-VETKESFLCLKSSLTYAAKLLNQILRHVKDSS--LTQISILGHNLLDLIASIEVNLGSGYAARLVALAKAW
           KF S Y++ + SI        ++FE  + KE  LC KSS +YA K +N ++RH  ++S  L +   L ++LLDL   +E +LGS YA+R+V+    W
Subjt:  LCLKFVSEYMQLITSILGQQVYKDIQFE-VETKESFLCLKSSLTYAAKLLNQILRHVKDSS--LTQISILGHNLLDLIASIEVNLGSGYAARLVALAKAW

Query:  LPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIVTFLKRDHRILDAIGVLFMIG
        +PDL+LALG  C +     E ++   +   K  FPSWL   AKIEL ++++ED  E            F   KK    I T +K + ++LDAIG + ++ 
Subjt:  LPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYFPSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIVTFLKRDHRILDAIGVLFMIG

Query:  SEIGLERKDFGLVLGLLQFVCRSLYSADDREWG--DMMLASLQRCYPQIEREIEQCNGEG-LHQLDRAKALLQPIWLYHIFETGKLSMMNE
          + +E++D+   LGLL  VC  L  ++DREW   D ML  L R YP IEREI +   E  +  L+ A+ LLQP+W+YH++ETG+  MM+E
Subjt:  SEIGLERKDFGLVLGLLQFVCRSLYSADDREWG--DMMLASLQRCYPQIEREIEQCNGEG-LHQLDRAKALLQPIWLYHIFETGKLSMMNE

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.1e-11139.68Show/hide
Query:  SNLGLPSSTAHVDVEPENSTARVVPPTTVVPPVNDHHMRTRAKSGI------------------------VKPKVWGSFMVENTSDAAISEPSSVKVALQ
        S+    +S++ +D+ P  +    VP     P V+  H RTR  + +                        V P ++ SF+V     A   EPS+   A +
Subjt:  SNLGLPSSTAHVDVEPENSTARVVPPTTVVPPVNDHHMRTRAKSGI------------------------VKPKVWGSFMVENTSDAAISEPSSVKVALQ

Query:  SEIWKKAMDAEIDALARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLD
          +W  AMD EI A+    TW +   PPN   IG KWV+K+K   DG+ +R KARLVAKGY Q  GIDF ETFSPV K  S+++ILA++    +++ QLD
Subjt:  SEIWKKAMDAEIDALARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLD

Query:  INNAFLNGKLNEAVYMKQPDGYV----DSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVTGDCP
        I+NAFLNG L+E +YMK P GY     DS  P  VC L +++YGL+QA R WF +F  TL+ +GF  S +D + F      + + VL+YVDDI++  +  
Subjt:  INNAFLNGKLNEAVYMKQPDGYV----DSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSSSKADSSVFFFSNTNMVVLVLIYVDDILVTGDCP

Query:  PFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSHTRPD
          +++   +L + F L+D+G L Y+LG+E+  + AG+ + Q KY  DLL   GL   KPSS P       SA  G    +   YR ++G L YL  TR D
Subjt:  PFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYRSVLGALQYLSHTRPD

Query:  ISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYR
        IS+ VNKLSQF + P   H QA  ++L Y+KGT   GLF      + + +  ++DA +    D ++S    C+FLG SL+SW SKKQ VV++SS E+EYR
Subjt:  ISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQHVVARSSTESEYR

Query:  SLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDK
        +L+    E+ W+  F +EL+L  S P +++CDN +A+ +A N VFH R+K +E D H VR++
Subjt:  SLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDK

ATMG00240.1 Gag-Pol-related retrotransposon family protein3.0e-1041.25Show/hide
Query:  YLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATC
        YL+ TRPD+++ VN+LSQF     +   QA  +VL Y+KGT   GLF     +S L + A+ D+DWA   D ++S+   C
Subjt:  YLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATC

ATMG00810.1 DNA/RNA polymerases superfamily protein7.0e-5244.1Show/hide
Query:  VLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYR
        +L+YVDDIL+TG     +   + +L++ F++KD+G + Y+LG+++ + P+G++L QTKY   +L+  G+   KP STP    K  S+      P+P+ +R
Subjt:  VLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDPMPNPTLYR

Query:  SVLGALQYLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSK
        S++GALQYL+ TRPDISY VN + Q + +PT   +   KRVLRY+KGT  +GL++ K  +S L++ A+ D+DWA     ++S    C FLG +++SWS+K
Subjt:  SVLGALQYLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSK

Query:  KQHVVARSSTESEYRSLANTAAEVSWVQA
        +Q  V+RSSTE+EYR+LA TAAE++W  A
Subjt:  KQHVVARSSTESEYRSLANTAAEVSWVQA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.4e-2953.91Show/hide
Query:  MRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKVALQSEIWKKAMDAEIDALARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGY
        M TR+K+GI K     S  +  T      EP SV  AL+   W +AM  E+DAL+RNKTW LVPPP N N++G KWVFK K   DG+ DR KARLVAKG+
Subjt:  MRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKVALQSEIWKKAMDAEIDALARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCKARLVAKGY

Query:  HQLPGIDFHETFSPVVKAPSIRIILALA
        HQ  GI F ET+SPVV+  +IR IL +A
Subjt:  HQLPGIDFHETFSPVVKAPSIRIILALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAATGATAGCTCGTCTTCCTCTGTAGCTGCTGCAGTCTCTAGCCCTAATCCTTCGACTGCTAGCCTCATTAGCTCATCCTTTGGTCATCCCCTGAGCACTGTGCT
CACGGTGAAACTTGATGAGAAGAATTACCTCTTATGGAGAGGGATGGTTCTCGCGATTCTCCGAGGCCAAAAAGTAGATGGCTACGTTCTTGGAACGAAAGCCCAGCCAG
CTGAGTTCGTTGAAATTTCTACTGAGTCCGGTAAGAAGTCTGAACCGAATCCTCTGTATGAAGAATGGGTTACGGTTGACCAAGCGTTGTCCGGCTGGTTATTTGGTTCT
ATGTCTCCGGCGATTGCGGCCGATGTTGTCAGTTTTAAGACTTCTAGAGAAGTCTGGAAAGCGTTAGAGGAAGTTTACGGTGCGACAAGCAAGGCGAGGATAAATCAACT
TCGTGGTATTCTGCAAAACACGAAGAAAGGGACTACAAAAATGGTGGAATACCTAGCCACCATGAAGCAAGCCTCTGAGAACTTACAACTAGCTGGTAATCCTATCTCTC
TAAGTGACTTGATTTCTTATGCTGTAGCTGGCCTAGACTCAGAGTATATACCCATCGTGTGCTCCATTGACGATAAAGATATTAAGACCTGGCAAGAGTTGTGTTCCATT
CTTATTACTTTTGAAGGGACTCTTGCTCGCCACAGTTCCGTTACAAATACGAGGGAGTTACCTGATCTGTCTGCACATCTAGCTTACAATCAGTTTAGTCAAAATCGGTT
TTCTGGACAAGGTCGTTTTAATCCCAACAGAGGTAGTCATAATCAAGGAAATGTACAAGGCAATGCCCCAAATAATTTCTTTCGAGGTGGTAATCGAGGAAGGGGTCGTG
GAAGAAGCAACTACCAAAGGAACAACAACTCGCGACCTACTTGTCAATTGTGCGGTAAGTTCGGGCACTCTGCTCCAGCTTGCTATCTCAGGTTCGAGGAGGATTTTAAT
AATCCACATGCTGTGAATAACACCAACAACAACAATCAGAATGTGAATGGTGCAGCAGTTGCTTCTTCTTCAGCCTATTTGGCAACACCAGATATCCTGAACGATCCTAA
GTGGCTAGCTGATTCTGGTGCTACAAGCCATGTGACTACTGACGCTGGTAACTTGGGTATGAAGAATGAGTACAATGGTAATGAAACTCTAGCTGTTGGTAATGGAACTA
GGCTTAATATTTCTCATATTGGTAATAGTACAGTAACGTGCCCTGATAATACTCATATTGTTCTGAATAGAATGCTTCATGTACCTAGTATAAAGAGAAATCTGATTAGC
ATATCTCGTCTGACTGCGGATAATAACGTGTTTGTTGAGTTTCACCCTCATTTTTGTCTTGTGAAGGACAAGGTTTCAAAGAAGGTGATGGTGCAAGGAACTCTTAAAGA
TGGGTTGTATCAATTGGAGCTCCCCTCAATACAAAAACGTCAATCCAATGTCAGTACCAACTCGTGTTTTGTTGCTATTTCTGCTTCCAAAGAGTCATCTGTTGCTTGCT
CGTCTGTTGTGTTTCAAGTAGAAAAGTCTAGGTCTGAGTCTAGTGTCTTGTCAAAGTCTTTGTGGCATCGACGTCTAAGTCATGCATCTTATAAAGTTATCAATGTTGTA
GCTCGGTCTTGTAATTTGAAAATTCCTATGAATGAAAACTCTGTGTTCTGTGATTCATGTCAACAAGGAAAGTCTCATCGTCTTCCTTTTTCTCGCTCTATTACTTGTAC
ACGGCAACCACTTGAACTCATACACTGTGATCTTTGGGGACCTTCCCCAGTTGTCTCGACCTTAGGTTATAAATACTATATCAGCTTCGTAGATGATTTTACTCGCTTAA
CTCACATCTACCCAATGCAAAATAAGAGTCAGGCCCTAACCTTGTTCCAACAATATAAAACTCTAGTTGAAAACAGATTTGGCAAAACCATCAAAATCCTTCAGACTGAT
TGGGGGGGTGAGTTCCGTTCTTTTCCTCATTTTCTTCAAAATGTAGGAATTGAGTTTAGACATTCTTGCCCCCATACTAGTCAACAGAATGGTGTGGTTGAGCGCAAACA
CCGACATATAGTTGAAGTTGGTCTCACACTCCTTGCCCAGTCCTCTCTGCCGTTATATTTCTGGTGGGATGCGTTCTCTACTGCCATTTATATTATTAATCGTCTTCCTA
CACCTGTTCTCGGTAATATCTCTCCTTGGCAAAAAGCCTTTGGTCAAATCCCTGATTTTCGCTTCTTTAAAACCTTTGGCTGCTCTTGCTTCCCCTGTCTTCGTCCTTAT
CAATCCAACAAGTTCCAATTTCATAGTGTTAAGTGTGTCTTTCTTGGCTATAGTATTTCTCACAAAGGTTACAAGTGTTTGAGTCCAAGTGGTCGTGTTTACATATCTCG
TCATGTCGTCTTTAATGAAGTTGATCATCAACCTCCTTGCCTTACGTCTGTTGAGTTCCAAGCACCATCTCCTGCTGCGGCCCTGCCTTCATCTGACGCTTCTGAGAATC
CTACTGCCAGTGCTGGCTCTCCTACTATGGCACCACCTGTTTTTGAGCCTTGTTCTAATTTGGGGTTGCCTAGTTCCACTGCTCATGTAGATGTTGAACCTGAGAACTCT
ACTGCCCGTGTTGTGCCTCCTACCACAGTTGTTCCACCTGTGAATGATCACCATATGCGTACGAGAGCCAAGAGTGGCATTGTCAAGCCTAAAGTCTGGGGGTCCTTTAT
GGTTGAAAATACCTCAGATGCTGCCATCTCTGAGCCCTCTTCAGTCAAGGTTGCTTTACAATCTGAGATATGGAAGAAAGCTATGGATGCTGAGATAGACGCTTTAGCAC
GAAACAAGACTTGGAGGTTAGTTCCTCCACCGCCTAATCTTAACTTAATTGGCAGTAAGTGGGTATTTAAAGTGAAAACCAAGCCCGATGGTTCTTTTGATCGATGCAAG
GCTCGGCTTGTCGCGAAAGGGTATCATCAGTTACCTGGCATAGATTTTCATGAGACCTTTAGTCCGGTAGTTAAAGCTCCCAGCATTCGTATTATCCTTGCTTTAGCTGT
CTCGCGAGGTTGGTCAGTTAGACAGCTTGACATAAATAATGCTTTCCTCAATGGGAAGCTTAATGAGGCGGTCTACATGAAACAACCCGATGGTTATGTTGACTCTTCCC
ATCCAGAATATGTTTGTAAACTAGATCGTGCTCTCTATGGTTTACGACAGGCTCCAAGGGCGTGGTTTGATCGCTTTCGTTCTACTCTGTTAGCTTGGGGTTTCTCTAGT
TCCAAGGCCGACAGTTCTGTCTTCTTCTTCTCGAATACTAATATGGTCGTGCTGGTTCTTATCTATGTTGATGACATATTGGTTACAGGTGATTGTCCACCGTTTATTGA
GCAGTTTGTTCGTCGTCTGAATGCTGTTTTTGCTCTCAAGGACATTGGAGCTTTAAGCTATTATCTCGGTGTTGAGGTTCATTCTACTCCTGCAGGTATGTATCTTCTTC
AGACCAAGTACATCAATGACTTGCTACATCGGGTTGGTTTATCTCATCTCAAACCTTCCTCAACTCCGGCTGCCACTGGTAAGAAACTGTCTGCATTTGAAGGTGATCCT
ATGCCTAATCCCACACTGTATAGGAGTGTTCTAGGGGCCTTACAGTACTTATCTCACACTCGACCGGATATTTCATATATTGTGAACAAATTAAGTCAATTTCTCCAACA
ACCCACGTCGGTGCACTGGCAAGCAGCTAAACGTGTGCTCCGCTATCTCAAGGGCACCTCTGAATATGGCCTTTTTCTTCCTAAGGGCCATTCTTCATCCTTGTCCATTG
TTGCTTATACGGATGCGGATTGGGCCTGTGGAGTTGATGACAAGAAATCCATTGCTGCTACCTGTGTGTTTCTCGGCCCTTCCTTGGTCTCTTGGTCCTCGAAGAAACAA
CATGTCGTTGCTCGCTCCAGTACAGAGTCCGAGTATCGGTCTCTTGCCAACACTGCTGCTGAGGTCTCGTGGGTGCAAGCTTTCCTTAAAGAGCTTCGCCTTTGCTCATC
CTCGCCGCCGATTATATGGTGTGATAATGTTAGTGCACTCTCGTTAGCTCAGAATCCAGTATTTCACTCACGCAGCAAATGTGTCGAACTTGATGTTCACTTCGTCCGCG
ATAAAGTCCTTGCCAAAGAAATTGATGTCAGGTATGTTCCTTCTACAGAGCAGCTGGCTGATGCCTTGACTAAAGCTCTTCCCGAGTCTCAGTTCTCATCCCTTCGCAGC
AAACTCGGTGTGCTTCCCTCACCCTCTCGTTTGAGGGGGGATGTTAGGCAGGAGGTTGCAAATTCAAATGTTCGTCAAAATTCATTGCATCTACTTTTGGATGTGTTTCC
TCTCGAAAATCCAGATGCTACGAAAGAGCTCAAAGACACATTGCTGGATAGACAGTTCTTTTTAATAGAGAAACTTCTCATGGATGAAAGTTCAGATGTGAGAGTTGTAG
CAGTTGAAGGTTGTTGTCGCATTCTTTATTTATTTTGGGAAATTATCCCTTCATCAACCATCACAAAGATTATTACCAAAATTTTTGACGAAATGTCGCGTGATATATCC
AATGAAGTCAGGCTTTCAACATTGAATGGCATAATCTATTTGTTTGGAAACCCCCAATCCCATGAGATTCTTAAAGTGATCTTGCCGAGGTTGGGGCATTTGATGCTGGA
TAATGCTCTTTCGGTAAGAGTAGCTCTAGCAGATCTCCTCCTTCTTATCAGGGATGTTCGTGATTTCCAGTTTAATAAGGTGGTCAGTTTAGATGTATTGTTAACGGTAC
TTGCACATGATCAACCCATTATTAGTCAGAAAATTACTAGATTGTTGATGCCATCGTATTTTCCCACCAAAGTATCAATTGAAGAGGCATGCAGTCGCTGCATAACACTG
ATAAAGAGATCTCCCATGGCTGGAGCGAGATTTTGCGAATTTGCTGCATCAGAAGGGGCATCTCTTAAGTCAATTATGGAACTTGTTCGAGTATTAATCAATTTGGTTTC
GTCCTCTGACAAGCTGGATGCAAATTACATTGATGGTTTACTTCTTTCAGCTAAATACCTCAGCAGCAGCATTTCGAGTGAACCATGTTACAAGAACGATCTCAAAGATT
TGTTCACTGGTGAAAAATTGGAGTGCTTGCTCTCTGTAGCACAATCAGGGTGTGCACGATCTTCTTTATTCAACATTGTGTCCTTGTTCTCTCCTGATGGTTTTACGGAT
CTTCTTGAGGAGTGTATGCAATTAATTACTAACTGTCGTGGTTTATCGGGAGACTTAGAAAAGCAAGCTGAAGTGAGGTCTGGACACAAGTTTTTCCAGGCTTGTGATGC
ACTGGATGTTATGTTCGAAGCCATGGCACTGATACTACAGAAATCTGCTTATCGTTGCCACATTAAATTTGGTACTGAGATACCGAAGCTCAGTGTTTCCCCTGCAAAGA
GAAAGAAATGCAAATTCTCAGGGAAAATACTATCTAGATTAAAGCATTTTGGCGGAAAAAAGTGTCTGAGCTTTGAAGATGACTATTTTGTTGCAGTAGGAATGTCGTGG
CAAGTTAAAGATTTGCTTTCAGATGAAAAAATGAAGAGTGCTTTATTGAGTTCCCAGACTATAGAGACAATATTTCTTACTTTAAAGGTTATCTGCGAAGTAAGTATTGT
GCAATGCTTAGATTATGACTTTATGGATGTAGCTCCCGTTCTAGCATATGCATCTCTTGCTTTGCACATGACTCTTCAGAAGGGCAGCCAAAGTATTACGAGTAGCAGTG
GAACCAAGAACAAAATTGTGACGGATTCTTCTAGTTCAGAGGCAAGTGCTTATTTAATGTTAGACGATAAAACCATGTTATATTGTGATGTTCTGGCAATATTGGAGCAG
ACGCTTGATCACTTGCTTAAATGCGTTAAGAAACTGTACGTGTCAGAGGATTCTATAAAACCTGATAAAGTTAAGCAAGGTAACAGAAAATTGGCTCAGCATGTAGAACA
AAAGCAGAAGGAATCTGGGATAAATCAATCTCATCGTCTTGATGGAGGATGTGTTGATACTTCAAAAAAGACGTTGAAGCAGGGGAAGAACCTGACAGCAGTCCTCAAGT
TCATTGTTGATGCTATTTCCATGGGTTTTCTTTCAGAAAAGTATGAATTGTGCTTGAAATTCGTGTCGGAGTATATGCAGCTCATCACGTCGATTTTAGGCCAACAAGTT
TATAAAGATATTCAGTTCGAGGTGGAAACAAAAGAAAGTTTTCTTTGTTTAAAGAGCTCGTTGACCTATGCTGCCAAGTTATTGAATCAAATATTAAGACATGTGAAAGA
TTCCTCGTTGACACAGATTTCTATTCTGGGTCATAATTTGCTTGATTTGATTGCCTCAATTGAAGTAAACTTGGGTTCTGGTTATGCAGCCCGTCTTGTTGCACTGGCAA
AGGCTTGGCTTCCTGATCTGATTTTGGCATTAGGAGCTGGTTGCTTAATGAGACCTGTTGAAGGTGAGGAGGCGCACATGTTGTTCTATGAGCAGACCAAATCCTATTTT
CCCTCATGGTTATCGGTGGTTGCAAAGATTGAACTTTCAGATATGAGTGAAGAAGATTTTGCCGAGGAGGAGGACGATGACAGATCTTTTGACCAACATAATTTTTCCAC
GTTTAAGAAATTTCTAAAGATGATTGTCACATTTTTAAAAAGAGATCACCGCATCTTGGATGCAATTGGAGTGCTTTTTATGATTGGGTCAGAAATTGGGTTGGAAAGAA
AGGATTTTGGGCTTGTACTGGGACTTCTACAGTTCGTGTGCCGAAGCCTGTATAGTGCTGACGACAGAGAGTGGGGTGATATGATGTTGGCTTCTTTGCAACGTTGTTAC
CCTCAGATAGAAAGAGAAATAGAACAATGTAATGGAGAAGGACTTCATCAGTTAGATAGGGCAAAGGCTTTGCTACAACCTATATGGTTGTACCACATTTTCGAAACTGG
TAAGCTTTCAATGATGAATGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAATGATAGCTCGTCTTCCTCTGTAGCTGCTGCAGTCTCTAGCCCTAATCCTTCGACTGCTAGCCTCATTAGCTCATCCTTTGGTCATCCCCTGAGCACTGTGCT
CACGGTGAAACTTGATGAGAAGAATTACCTCTTATGGAGAGGGATGGTTCTCGCGATTCTCCGAGGCCAAAAAGTAGATGGCTACGTTCTTGGAACGAAAGCCCAGCCAG
CTGAGTTCGTTGAAATTTCTACTGAGTCCGGTAAGAAGTCTGAACCGAATCCTCTGTATGAAGAATGGGTTACGGTTGACCAAGCGTTGTCCGGCTGGTTATTTGGTTCT
ATGTCTCCGGCGATTGCGGCCGATGTTGTCAGTTTTAAGACTTCTAGAGAAGTCTGGAAAGCGTTAGAGGAAGTTTACGGTGCGACAAGCAAGGCGAGGATAAATCAACT
TCGTGGTATTCTGCAAAACACGAAGAAAGGGACTACAAAAATGGTGGAATACCTAGCCACCATGAAGCAAGCCTCTGAGAACTTACAACTAGCTGGTAATCCTATCTCTC
TAAGTGACTTGATTTCTTATGCTGTAGCTGGCCTAGACTCAGAGTATATACCCATCGTGTGCTCCATTGACGATAAAGATATTAAGACCTGGCAAGAGTTGTGTTCCATT
CTTATTACTTTTGAAGGGACTCTTGCTCGCCACAGTTCCGTTACAAATACGAGGGAGTTACCTGATCTGTCTGCACATCTAGCTTACAATCAGTTTAGTCAAAATCGGTT
TTCTGGACAAGGTCGTTTTAATCCCAACAGAGGTAGTCATAATCAAGGAAATGTACAAGGCAATGCCCCAAATAATTTCTTTCGAGGTGGTAATCGAGGAAGGGGTCGTG
GAAGAAGCAACTACCAAAGGAACAACAACTCGCGACCTACTTGTCAATTGTGCGGTAAGTTCGGGCACTCTGCTCCAGCTTGCTATCTCAGGTTCGAGGAGGATTTTAAT
AATCCACATGCTGTGAATAACACCAACAACAACAATCAGAATGTGAATGGTGCAGCAGTTGCTTCTTCTTCAGCCTATTTGGCAACACCAGATATCCTGAACGATCCTAA
GTGGCTAGCTGATTCTGGTGCTACAAGCCATGTGACTACTGACGCTGGTAACTTGGGTATGAAGAATGAGTACAATGGTAATGAAACTCTAGCTGTTGGTAATGGAACTA
GGCTTAATATTTCTCATATTGGTAATAGTACAGTAACGTGCCCTGATAATACTCATATTGTTCTGAATAGAATGCTTCATGTACCTAGTATAAAGAGAAATCTGATTAGC
ATATCTCGTCTGACTGCGGATAATAACGTGTTTGTTGAGTTTCACCCTCATTTTTGTCTTGTGAAGGACAAGGTTTCAAAGAAGGTGATGGTGCAAGGAACTCTTAAAGA
TGGGTTGTATCAATTGGAGCTCCCCTCAATACAAAAACGTCAATCCAATGTCAGTACCAACTCGTGTTTTGTTGCTATTTCTGCTTCCAAAGAGTCATCTGTTGCTTGCT
CGTCTGTTGTGTTTCAAGTAGAAAAGTCTAGGTCTGAGTCTAGTGTCTTGTCAAAGTCTTTGTGGCATCGACGTCTAAGTCATGCATCTTATAAAGTTATCAATGTTGTA
GCTCGGTCTTGTAATTTGAAAATTCCTATGAATGAAAACTCTGTGTTCTGTGATTCATGTCAACAAGGAAAGTCTCATCGTCTTCCTTTTTCTCGCTCTATTACTTGTAC
ACGGCAACCACTTGAACTCATACACTGTGATCTTTGGGGACCTTCCCCAGTTGTCTCGACCTTAGGTTATAAATACTATATCAGCTTCGTAGATGATTTTACTCGCTTAA
CTCACATCTACCCAATGCAAAATAAGAGTCAGGCCCTAACCTTGTTCCAACAATATAAAACTCTAGTTGAAAACAGATTTGGCAAAACCATCAAAATCCTTCAGACTGAT
TGGGGGGGTGAGTTCCGTTCTTTTCCTCATTTTCTTCAAAATGTAGGAATTGAGTTTAGACATTCTTGCCCCCATACTAGTCAACAGAATGGTGTGGTTGAGCGCAAACA
CCGACATATAGTTGAAGTTGGTCTCACACTCCTTGCCCAGTCCTCTCTGCCGTTATATTTCTGGTGGGATGCGTTCTCTACTGCCATTTATATTATTAATCGTCTTCCTA
CACCTGTTCTCGGTAATATCTCTCCTTGGCAAAAAGCCTTTGGTCAAATCCCTGATTTTCGCTTCTTTAAAACCTTTGGCTGCTCTTGCTTCCCCTGTCTTCGTCCTTAT
CAATCCAACAAGTTCCAATTTCATAGTGTTAAGTGTGTCTTTCTTGGCTATAGTATTTCTCACAAAGGTTACAAGTGTTTGAGTCCAAGTGGTCGTGTTTACATATCTCG
TCATGTCGTCTTTAATGAAGTTGATCATCAACCTCCTTGCCTTACGTCTGTTGAGTTCCAAGCACCATCTCCTGCTGCGGCCCTGCCTTCATCTGACGCTTCTGAGAATC
CTACTGCCAGTGCTGGCTCTCCTACTATGGCACCACCTGTTTTTGAGCCTTGTTCTAATTTGGGGTTGCCTAGTTCCACTGCTCATGTAGATGTTGAACCTGAGAACTCT
ACTGCCCGTGTTGTGCCTCCTACCACAGTTGTTCCACCTGTGAATGATCACCATATGCGTACGAGAGCCAAGAGTGGCATTGTCAAGCCTAAAGTCTGGGGGTCCTTTAT
GGTTGAAAATACCTCAGATGCTGCCATCTCTGAGCCCTCTTCAGTCAAGGTTGCTTTACAATCTGAGATATGGAAGAAAGCTATGGATGCTGAGATAGACGCTTTAGCAC
GAAACAAGACTTGGAGGTTAGTTCCTCCACCGCCTAATCTTAACTTAATTGGCAGTAAGTGGGTATTTAAAGTGAAAACCAAGCCCGATGGTTCTTTTGATCGATGCAAG
GCTCGGCTTGTCGCGAAAGGGTATCATCAGTTACCTGGCATAGATTTTCATGAGACCTTTAGTCCGGTAGTTAAAGCTCCCAGCATTCGTATTATCCTTGCTTTAGCTGT
CTCGCGAGGTTGGTCAGTTAGACAGCTTGACATAAATAATGCTTTCCTCAATGGGAAGCTTAATGAGGCGGTCTACATGAAACAACCCGATGGTTATGTTGACTCTTCCC
ATCCAGAATATGTTTGTAAACTAGATCGTGCTCTCTATGGTTTACGACAGGCTCCAAGGGCGTGGTTTGATCGCTTTCGTTCTACTCTGTTAGCTTGGGGTTTCTCTAGT
TCCAAGGCCGACAGTTCTGTCTTCTTCTTCTCGAATACTAATATGGTCGTGCTGGTTCTTATCTATGTTGATGACATATTGGTTACAGGTGATTGTCCACCGTTTATTGA
GCAGTTTGTTCGTCGTCTGAATGCTGTTTTTGCTCTCAAGGACATTGGAGCTTTAAGCTATTATCTCGGTGTTGAGGTTCATTCTACTCCTGCAGGTATGTATCTTCTTC
AGACCAAGTACATCAATGACTTGCTACATCGGGTTGGTTTATCTCATCTCAAACCTTCCTCAACTCCGGCTGCCACTGGTAAGAAACTGTCTGCATTTGAAGGTGATCCT
ATGCCTAATCCCACACTGTATAGGAGTGTTCTAGGGGCCTTACAGTACTTATCTCACACTCGACCGGATATTTCATATATTGTGAACAAATTAAGTCAATTTCTCCAACA
ACCCACGTCGGTGCACTGGCAAGCAGCTAAACGTGTGCTCCGCTATCTCAAGGGCACCTCTGAATATGGCCTTTTTCTTCCTAAGGGCCATTCTTCATCCTTGTCCATTG
TTGCTTATACGGATGCGGATTGGGCCTGTGGAGTTGATGACAAGAAATCCATTGCTGCTACCTGTGTGTTTCTCGGCCCTTCCTTGGTCTCTTGGTCCTCGAAGAAACAA
CATGTCGTTGCTCGCTCCAGTACAGAGTCCGAGTATCGGTCTCTTGCCAACACTGCTGCTGAGGTCTCGTGGGTGCAAGCTTTCCTTAAAGAGCTTCGCCTTTGCTCATC
CTCGCCGCCGATTATATGGTGTGATAATGTTAGTGCACTCTCGTTAGCTCAGAATCCAGTATTTCACTCACGCAGCAAATGTGTCGAACTTGATGTTCACTTCGTCCGCG
ATAAAGTCCTTGCCAAAGAAATTGATGTCAGGTATGTTCCTTCTACAGAGCAGCTGGCTGATGCCTTGACTAAAGCTCTTCCCGAGTCTCAGTTCTCATCCCTTCGCAGC
AAACTCGGTGTGCTTCCCTCACCCTCTCGTTTGAGGGGGGATGTTAGGCAGGAGGTTGCAAATTCAAATGTTCGTCAAAATTCATTGCATCTACTTTTGGATGTGTTTCC
TCTCGAAAATCCAGATGCTACGAAAGAGCTCAAAGACACATTGCTGGATAGACAGTTCTTTTTAATAGAGAAACTTCTCATGGATGAAAGTTCAGATGTGAGAGTTGTAG
CAGTTGAAGGTTGTTGTCGCATTCTTTATTTATTTTGGGAAATTATCCCTTCATCAACCATCACAAAGATTATTACCAAAATTTTTGACGAAATGTCGCGTGATATATCC
AATGAAGTCAGGCTTTCAACATTGAATGGCATAATCTATTTGTTTGGAAACCCCCAATCCCATGAGATTCTTAAAGTGATCTTGCCGAGGTTGGGGCATTTGATGCTGGA
TAATGCTCTTTCGGTAAGAGTAGCTCTAGCAGATCTCCTCCTTCTTATCAGGGATGTTCGTGATTTCCAGTTTAATAAGGTGGTCAGTTTAGATGTATTGTTAACGGTAC
TTGCACATGATCAACCCATTATTAGTCAGAAAATTACTAGATTGTTGATGCCATCGTATTTTCCCACCAAAGTATCAATTGAAGAGGCATGCAGTCGCTGCATAACACTG
ATAAAGAGATCTCCCATGGCTGGAGCGAGATTTTGCGAATTTGCTGCATCAGAAGGGGCATCTCTTAAGTCAATTATGGAACTTGTTCGAGTATTAATCAATTTGGTTTC
GTCCTCTGACAAGCTGGATGCAAATTACATTGATGGTTTACTTCTTTCAGCTAAATACCTCAGCAGCAGCATTTCGAGTGAACCATGTTACAAGAACGATCTCAAAGATT
TGTTCACTGGTGAAAAATTGGAGTGCTTGCTCTCTGTAGCACAATCAGGGTGTGCACGATCTTCTTTATTCAACATTGTGTCCTTGTTCTCTCCTGATGGTTTTACGGAT
CTTCTTGAGGAGTGTATGCAATTAATTACTAACTGTCGTGGTTTATCGGGAGACTTAGAAAAGCAAGCTGAAGTGAGGTCTGGACACAAGTTTTTCCAGGCTTGTGATGC
ACTGGATGTTATGTTCGAAGCCATGGCACTGATACTACAGAAATCTGCTTATCGTTGCCACATTAAATTTGGTACTGAGATACCGAAGCTCAGTGTTTCCCCTGCAAAGA
GAAAGAAATGCAAATTCTCAGGGAAAATACTATCTAGATTAAAGCATTTTGGCGGAAAAAAGTGTCTGAGCTTTGAAGATGACTATTTTGTTGCAGTAGGAATGTCGTGG
CAAGTTAAAGATTTGCTTTCAGATGAAAAAATGAAGAGTGCTTTATTGAGTTCCCAGACTATAGAGACAATATTTCTTACTTTAAAGGTTATCTGCGAAGTAAGTATTGT
GCAATGCTTAGATTATGACTTTATGGATGTAGCTCCCGTTCTAGCATATGCATCTCTTGCTTTGCACATGACTCTTCAGAAGGGCAGCCAAAGTATTACGAGTAGCAGTG
GAACCAAGAACAAAATTGTGACGGATTCTTCTAGTTCAGAGGCAAGTGCTTATTTAATGTTAGACGATAAAACCATGTTATATTGTGATGTTCTGGCAATATTGGAGCAG
ACGCTTGATCACTTGCTTAAATGCGTTAAGAAACTGTACGTGTCAGAGGATTCTATAAAACCTGATAAAGTTAAGCAAGGTAACAGAAAATTGGCTCAGCATGTAGAACA
AAAGCAGAAGGAATCTGGGATAAATCAATCTCATCGTCTTGATGGAGGATGTGTTGATACTTCAAAAAAGACGTTGAAGCAGGGGAAGAACCTGACAGCAGTCCTCAAGT
TCATTGTTGATGCTATTTCCATGGGTTTTCTTTCAGAAAAGTATGAATTGTGCTTGAAATTCGTGTCGGAGTATATGCAGCTCATCACGTCGATTTTAGGCCAACAAGTT
TATAAAGATATTCAGTTCGAGGTGGAAACAAAAGAAAGTTTTCTTTGTTTAAAGAGCTCGTTGACCTATGCTGCCAAGTTATTGAATCAAATATTAAGACATGTGAAAGA
TTCCTCGTTGACACAGATTTCTATTCTGGGTCATAATTTGCTTGATTTGATTGCCTCAATTGAAGTAAACTTGGGTTCTGGTTATGCAGCCCGTCTTGTTGCACTGGCAA
AGGCTTGGCTTCCTGATCTGATTTTGGCATTAGGAGCTGGTTGCTTAATGAGACCTGTTGAAGGTGAGGAGGCGCACATGTTGTTCTATGAGCAGACCAAATCCTATTTT
CCCTCATGGTTATCGGTGGTTGCAAAGATTGAACTTTCAGATATGAGTGAAGAAGATTTTGCCGAGGAGGAGGACGATGACAGATCTTTTGACCAACATAATTTTTCCAC
GTTTAAGAAATTTCTAAAGATGATTGTCACATTTTTAAAAAGAGATCACCGCATCTTGGATGCAATTGGAGTGCTTTTTATGATTGGGTCAGAAATTGGGTTGGAAAGAA
AGGATTTTGGGCTTGTACTGGGACTTCTACAGTTCGTGTGCCGAAGCCTGTATAGTGCTGACGACAGAGAGTGGGGTGATATGATGTTGGCTTCTTTGCAACGTTGTTAC
CCTCAGATAGAAAGAGAAATAGAACAATGTAATGGAGAAGGACTTCATCAGTTAGATAGGGCAAAGGCTTTGCTACAACCTATATGGTTGTACCACATTTTCGAAACTGG
TAAGCTTTCAATGATGAATGAGTAG
Protein sequenceShow/hide protein sequence
MANDSSSSSVAAAVSSPNPSTASLISSSFGHPLSTVLTVKLDEKNYLLWRGMVLAILRGQKVDGYVLGTKAQPAEFVEISTESGKKSEPNPLYEEWVTVDQALSGWLFGS
MSPAIAADVVSFKTSREVWKALEEVYGATSKARINQLRGILQNTKKGTTKMVEYLATMKQASENLQLAGNPISLSDLISYAVAGLDSEYIPIVCSIDDKDIKTWQELCSI
LITFEGTLARHSSVTNTRELPDLSAHLAYNQFSQNRFSGQGRFNPNRGSHNQGNVQGNAPNNFFRGGNRGRGRGRSNYQRNNNSRPTCQLCGKFGHSAPACYLRFEEDFN
NPHAVNNTNNNNQNVNGAAVASSSAYLATPDILNDPKWLADSGATSHVTTDAGNLGMKNEYNGNETLAVGNGTRLNISHIGNSTVTCPDNTHIVLNRMLHVPSIKRNLIS
ISRLTADNNVFVEFHPHFCLVKDKVSKKVMVQGTLKDGLYQLELPSIQKRQSNVSTNSCFVAISASKESSVACSSVVFQVEKSRSESSVLSKSLWHRRLSHASYKVINVV
ARSCNLKIPMNENSVFCDSCQQGKSHRLPFSRSITCTRQPLELIHCDLWGPSPVVSTLGYKYYISFVDDFTRLTHIYPMQNKSQALTLFQQYKTLVENRFGKTIKILQTD
WGGEFRSFPHFLQNVGIEFRHSCPHTSQQNGVVERKHRHIVEVGLTLLAQSSLPLYFWWDAFSTAIYIINRLPTPVLGNISPWQKAFGQIPDFRFFKTFGCSCFPCLRPY
QSNKFQFHSVKCVFLGYSISHKGYKCLSPSGRVYISRHVVFNEVDHQPPCLTSVEFQAPSPAAALPSSDASENPTASAGSPTMAPPVFEPCSNLGLPSSTAHVDVEPENS
TARVVPPTTVVPPVNDHHMRTRAKSGIVKPKVWGSFMVENTSDAAISEPSSVKVALQSEIWKKAMDAEIDALARNKTWRLVPPPPNLNLIGSKWVFKVKTKPDGSFDRCK
ARLVAKGYHQLPGIDFHETFSPVVKAPSIRIILALAVSRGWSVRQLDINNAFLNGKLNEAVYMKQPDGYVDSSHPEYVCKLDRALYGLRQAPRAWFDRFRSTLLAWGFSS
SKADSSVFFFSNTNMVVLVLIYVDDILVTGDCPPFIEQFVRRLNAVFALKDIGALSYYLGVEVHSTPAGMYLLQTKYINDLLHRVGLSHLKPSSTPAATGKKLSAFEGDP
MPNPTLYRSVLGALQYLSHTRPDISYIVNKLSQFLQQPTSVHWQAAKRVLRYLKGTSEYGLFLPKGHSSSLSIVAYTDADWACGVDDKKSIAATCVFLGPSLVSWSSKKQ
HVVARSSTESEYRSLANTAAEVSWVQAFLKELRLCSSSPPIIWCDNVSALSLAQNPVFHSRSKCVELDVHFVRDKVLAKEIDVRYVPSTEQLADALTKALPESQFSSLRS
KLGVLPSPSRLRGDVRQEVANSNVRQNSLHLLLDVFPLENPDATKELKDTLLDRQFFLIEKLLMDESSDVRVVAVEGCCRILYLFWEIIPSSTITKIITKIFDEMSRDIS
NEVRLSTLNGIIYLFGNPQSHEILKVILPRLGHLMLDNALSVRVALADLLLLIRDVRDFQFNKVVSLDVLLTVLAHDQPIISQKITRLLMPSYFPTKVSIEEACSRCITL
IKRSPMAGARFCEFAASEGASLKSIMELVRVLINLVSSSDKLDANYIDGLLLSAKYLSSSISSEPCYKNDLKDLFTGEKLECLLSVAQSGCARSSLFNIVSLFSPDGFTD
LLEECMQLITNCRGLSGDLEKQAEVRSGHKFFQACDALDVMFEAMALILQKSAYRCHIKFGTEIPKLSVSPAKRKKCKFSGKILSRLKHFGGKKCLSFEDDYFVAVGMSW
QVKDLLSDEKMKSALLSSQTIETIFLTLKVICEVSIVQCLDYDFMDVAPVLAYASLALHMTLQKGSQSITSSSGTKNKIVTDSSSSEASAYLMLDDKTMLYCDVLAILEQ
TLDHLLKCVKKLYVSEDSIKPDKVKQGNRKLAQHVEQKQKESGINQSHRLDGGCVDTSKKTLKQGKNLTAVLKFIVDAISMGFLSEKYELCLKFVSEYMQLITSILGQQV
YKDIQFEVETKESFLCLKSSLTYAAKLLNQILRHVKDSSLTQISILGHNLLDLIASIEVNLGSGYAARLVALAKAWLPDLILALGAGCLMRPVEGEEAHMLFYEQTKSYF
PSWLSVVAKIELSDMSEEDFAEEEDDDRSFDQHNFSTFKKFLKMIVTFLKRDHRILDAIGVLFMIGSEIGLERKDFGLVLGLLQFVCRSLYSADDREWGDMMLASLQRCY
PQIEREIEQCNGEGLHQLDRAKALLQPIWLYHIFETGKLSMMNE