; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006217 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006217
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionFPL domain-containing protein
Genome locationchr6:39448997..39480996
RNA-Seq ExpressionLag0006217
SyntenyLag0006217
Gene Ontology termsGO:0001708 - cell fate specification (biological process)
GO:0008333 - endosome to lysosome transport (biological process)
GO:0009718 - anthocyanin-containing compound biosynthetic process (biological process)
GO:0016197 - endosomal transport (biological process)
GO:0044090 - positive regulation of vacuole organization (biological process)
GO:1901096 - regulation of autophagosome maturation (biological process)
GO:1903415 - flavonoid transport from endoplasmic reticulum to plant-type vacuole (biological process)
GO:0005770 - late endosome (cellular component)
GO:0005794 - Golgi apparatus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0019898 - extrinsic component of membrane (cellular component)
GO:0036020 - endolysosome membrane (cellular component)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR019155 - CLEC16A/TT9, N-terminal
IPR039272 - CLEC16A/TT9
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586113.1 Protein TRANSPARENT TESTA 9, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0091Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIM EFVRIL ISRTA++SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFR+DELLSYYISFLRAISGKLNKNTISLLVKTQ DKVVSFPMYVEAI+FAFHEE MIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
         FFRKQCI LNELVFET+RSTELSTSTILAAVDEIEDNLYYISDV+SAGIPDVG LITDNILRHLIFPLLLPSLRIEAVNGI IGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT
        KIKDLANTISAAFFCPLDAFSPHCE  +NG I QLCCEN  S SSG D +VRQPLDAESLRKEV DSSA K E EDGTVK  C GS LELR+ALLSHIT 
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT

Query:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF
        GDDVQVLGALSVLATLLQTKELDESMLDA+GILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGI IEL+GYL KLKDYGIS F+KVGASPR+HRF
Subjt:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF

Query:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS
        EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNS HLKLLK+SYKYCA+ELL+EARGIWSDFLIILL DEWKKCKR IEAPSPRKEPKCMLLH 
Subjt:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS

Query:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF
        AKASVV D VPPESSFAAG++MSELVKVFVLLHQLQSFS GKALSEQPGI PPSEI E SRAKVA LD SGPK GA++RLDG+VPCRIAFERGKERHFYF
Subjt:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF

Query:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE
        LG SMGT GWIILAEE+PSK N GIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTP KLKTKPFVDGRWILAFQDED+CKSALSMV+EE
Subjt:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE

Query:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        INLQSKEVERRLKPLVDLERAVDSSDAS+CS+KTLTSN TPNLL
Subjt:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

XP_004149506.1 protein TRANSPARENT TESTA 9 isoform X1 [Cucumis sativus]0.0e+0091.45Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KI IVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTAT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCI+LNELV ETMRSTE STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGA TSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSP  E  +NGN+T+L CE+RSQSSG D +VRQPLDAESLR+EV D S PKTE ED TVK    GS+LELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE
        DD+QVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS++TSVKGGIDIELDGYL KLKDYGIS F+K GASPR HRFE
Subjt:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE

Query:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA
        VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKY ATELLQEARGIWSDFLIILL DEWKKCKRAIEAPSPRKEPK MLLHSA
Subjt:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA

Query:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG
        KASVVD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALSEQP IDPPSEIS+ SRAKVAGLD SGPK GA+LRLDGAVPCRIAFERGKERHFYFLG
Subjt:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG

Query:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN
         ++GT GWIILAEE PSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDP KHGTPLKLK KPFVDGRWILAFQD+DTCKSA SMV+EEIN
Subjt:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN

Query:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        LQSKEVERRLKPLV LERAVDSSDA LCS K+LTSNT PNL+
Subjt:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

XP_008451065.1 PREDICTED: uncharacterized protein LOC103492441 isoform X4 [Cucumis melo]0.0e+0091.09Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISR AT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH +YFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCI+LN+LV ETMRS E STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSP  E  +NGN+T+LCCE+RSQSSG D +VRQPLDAESL KEV DSSAPKTE +D TVK  C GS+LELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE
        DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS+NTSVKGGIDIELDGYL KLKDYGIS F+K GASPR HRFE
Subjt:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE

Query:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA
        VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKY ATELLQEARGIWSDFLIILL DEWKKCKRAIEAPSPRKEPK MLLHSA
Subjt:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA

Query:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG
        KAS VD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALS+QP IDPPSEI + SRAKVAGLD SGPK GA+LRLDGAVPCRIAFERGKERHF+FLG
Subjt:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG

Query:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN
         ++GT GWIILAEELPSKLNCGI+RVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDP KHGTPLK K KPFVDGRWILAFQDEDTCK A SMV+EEIN
Subjt:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN

Query:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        LQSKEVERRLKPLV LERAVDS+DA LCS K+LTSNT PNL+
Subjt:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

XP_022965446.1 uncharacterized protein LOC111465350 [Cucurbita maxima]0.0e+0090.76Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIM EFVRIL ISRTA++SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFR+DELLSYYISFLRAISGKLNKNTISLLVKTQ DKVVSFPMYVEAI+FAFHEE MIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
         FFRKQCI LNELVFET+RSTELSTSTILAAVDEIEDNLYYISDV+SAGIPDVG LITDNILRHLIFPLLLPSLRIEAVNGI IGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT
        KIKDLANTISAAFFCPLDAFSPH E  +NG + QLCCEN  S SSG D +VRQPLDAESLRKEV DSSA K E EDGTVK  C GS LELR+ALLSHIT 
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT

Query:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF
        GDDVQVLGALSVLATLLQTKELDESMLDA+GILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGI+IELDGYL KLKDYGIS F+KVGASPR+HRF
Subjt:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF

Query:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS
        EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKYCA+ELL+EARGIWSDFLIILL DEWKKCKR IEAPSPRKEPKCMLLH 
Subjt:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS

Query:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF
        AKASV+ D VPPESSFAAG++MSELVKVFVLLHQLQSFS GKALSEQPGIDPPSEI E SRAKVA LD SGPK GA++RLDG+VPCRIAFERGKERHFYF
Subjt:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF

Query:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE
        LG SMGT GWIILAEE+PSK N GIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFL PAKHGTP KLKTKPFVDGRWILAFQDED+CKSALSMV+EE
Subjt:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE

Query:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        INLQSKEVERRLKPLVDLER+VDSSDAS+CS+KTLTSN TPNLL
Subjt:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

XP_038889909.1 protein TRANSPARENT TESTA 9 isoform X1 [Benincasa hispida]0.0e+0092.87Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTAT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCINLNELVFETMRS E STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSPH EG ++GN+TQLCCE RS+SSG D +VRQPLDAES+RKEV DSSAPKTE ED TVK DC GS++ELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE
        DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS+NTSVKGGIDIELDGYL KLKDYGIS F+K GASPR HRFE
Subjt:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE

Query:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA
        VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNS+HLKLLK+SYKY ATELLQEARGIWSDFLIILL DEWKKCKRAIEAPSPRKEPK MLLHSA
Subjt:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA

Query:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG
        KAS VD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALSEQP ID PSEISE SRAKVAGLD SGPK GA+LRLDGAVPCRIAFERGKERHFYFLG
Subjt:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG

Query:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN
         ++GTSGWIILAEELPSKLNCGIIRVAAPLAGSNPR+DEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLK KPFVDGRWILAFQD+DTCKSALSMV+EEIN
Subjt:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN

Query:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        LQSKEVERRLKPLV LERAVDSSDASLCS K+LTSNT+PNL+
Subjt:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

TrEMBL top hitse value%identityAlignment
A0A0A0LKC6 FPL domain-containing protein0.0e+0091.45Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KI IVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTAT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCI+LNELV ETMRSTE STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGA TSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSP  E  +NGN+T+L CE+RSQSSG D +VRQPLDAESLR+EV D S PKTE ED TVK    GS+LELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE
        DD+QVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS++TSVKGGIDIELDGYL KLKDYGIS F+K GASPR HRFE
Subjt:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE

Query:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA
        VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKY ATELLQEARGIWSDFLIILL DEWKKCKRAIEAPSPRKEPK MLLHSA
Subjt:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA

Query:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG
        KASVVD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALSEQP IDPPSEIS+ SRAKVAGLD SGPK GA+LRLDGAVPCRIAFERGKERHFYFLG
Subjt:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG

Query:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN
         ++GT GWIILAEE PSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDP KHGTPLKLK KPFVDGRWILAFQD+DTCKSA SMV+EEIN
Subjt:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN

Query:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        LQSKEVERRLKPLV LERAVDSSDA LCS K+LTSNT PNL+
Subjt:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

A0A1S3BQN1 uncharacterized protein LOC103492441 isoform X40.0e+0091.09Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISR AT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH +YFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCI+LN+LV ETMRS E STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSP  E  +NGN+T+LCCE+RSQSSG D +VRQPLDAESL KEV DSSAPKTE +D TVK  C GS+LELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE
        DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS+NTSVKGGIDIELDGYL KLKDYGIS F+K GASPR HRFE
Subjt:  DDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRFE

Query:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA
        VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKY ATELLQEARGIWSDFLIILL DEWKKCKRAIEAPSPRKEPK MLLHSA
Subjt:  VLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHSA

Query:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG
        KAS VD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALS+QP IDPPSEI + SRAKVAGLD SGPK GA+LRLDGAVPCRIAFERGKERHF+FLG
Subjt:  KASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLG

Query:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN
         ++GT GWIILAEELPSKLNCGI+RVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDP KHGTPLK K KPFVDGRWILAFQDEDTCK A SMV+EEIN
Subjt:  ISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEIN

Query:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        LQSKEVERRLKPLV LERAVDS+DA LCS K+LTSNT PNL+
Subjt:  LQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

A0A1S3BRC0 uncharacterized protein LOC103492441 isoform X10.0e+0088.47Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISR AT+SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPH +YFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
        TFFRKQCI+LN+LV ETMRS E STSTILAAVDEIEDNLYYISDV+SAGIPDVGRLITDNILRHLIFPLLLPSLRIE VNG QIGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG
        KIKDLANTISAAFFCPLDAFSP  E  +NGN+T+LCCE+RSQSSG D +VRQPLDAESL KEV DSSAPKTE +D TVK  C GS+LELR ALLSHITTG
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITTG

Query:  DDVQVLGALSVLATLLQTK-------------------------ELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGY
        DDVQVLGALSVLATLLQTK                         ELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSS+NTSVKGGIDIELDGY
Subjt:  DDVQVLGALSVLATLLQTK-------------------------ELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGY

Query:  LSKLKDYGISCFIKVGASPRVHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDE
        L KLKDYGIS F+K GASPR HRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKY ATELLQEARGIWSDFLIILL DE
Subjt:  LSKLKDYGISCFIKVGASPRVHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDE

Query:  WKKCKRAIEAPSPRKEPKCMLLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGAD
        WKKCKRAIEAPSPRKEPK MLLHSAKAS VD VPPESSFAAG+KMSELVKVFVLLHQLQSFSLGKALS+QP IDPPSEI + SRAKVAGLD SGPK GA+
Subjt:  WKKCKRAIEAPSPRKEPKCMLLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGAD

Query:  LRLDGAVPCRIAFERGKERHFYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVD
        LRLDGAVPCRIAFERGKERHF+FLG ++GT GWIILAEELPSKLNCGI+RVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDP KHGTPLK K KPFVD
Subjt:  LRLDGAVPCRIAFERGKERHFYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVD

Query:  GRWILAFQDEDTCKSALSMVMEEINLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        GRWILAFQDEDTCK A SMV+EEINLQSKEVERRLKPLV LERAVDS+DA LCS K+LTSNT PNL+
Subjt:  GRWILAFQDEDTCKSALSMVMEEINLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

A0A6J1FHA0 uncharacterized protein LOC1114441380.0e+0090.76Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQLLKIQIVNEVNKDFV+EALRSISELITYGDQHDASFFEFFMEKQIM EFVRIL ISRTA++SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFR+DELLSYYISFLRAISGKLNKNTISLLVKTQ DKVVSFPMYVEAI+FAFHEE MIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
         FFRKQCI LNELVFET+RSTELSTSTILAAVDEIEDNLYYISDV+SAGIPDVG LITDNILRHLIFPLLLPSLRIEAVNGI IGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT
        KIKDLANTISAAFFCPLDAFS HCE  +NG + QLCCEN  S SSG D  VRQPLDAESLRKEV DSSA K E EDGTVK  C GS LELR+ALLSHIT 
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT

Query:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF
        GDDVQVLGALSVLATLLQTKELDESMLDA+GILPQRKQHKKLLLEALVGEDSGEEQLFSSENT+VKGGI IELDGYL KLKDYGIS F+KVGASPR+HRF
Subjt:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF

Query:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS
        EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKYCA+ELL+EARGIWSDFLIILL DEWKKCKR IEAPSPRKEPKCMLLH 
Subjt:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS

Query:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF
        AKASVV D VPPESSFAAG++MSELVKVFVLLHQLQSFS GKALSEQPGI PPSEI E SRAKVA LD SGPK GA++RLDG+VPCRIAFERGKERHFYF
Subjt:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF

Query:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE
        LG SMGT GWIILAEE+PSK N GIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTP KLKTKPFVDGRWILAFQDED+CKSALSMV+EE
Subjt:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE

Query:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        INLQSKEVERRLKPLVDLERAVDSSDAS+CS+KTLTSN TPNLL
Subjt:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

A0A6J1HLP9 uncharacterized protein LOC1114653500.0e+0090.76Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIM EFVRIL ISRTA++SLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITYAFDFR+DELLSYYISFLRAISGKLNKNTISLLVKTQ DKVVSFPMYVEAI+FAFHEE MIRTAVRALTLNVYHVGDDYVNRFITSPPH EYFSNLV
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV
         FFRKQCI LNELVFET+RSTELSTSTILAAVDEIEDNLYYISDV+SAGIPDVG LITDNILRHLIFPLLLPSLRIEAVNGI IGAVTSLYLLCCILRIV
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIV

Query:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT
        KIKDLANTISAAFFCPLDAFSPH E  +NG + QLCCEN  S SSG D +VRQPLDAESLRKEV DSSA K E EDGTVK  C GS LELR+ALLSHIT 
Subjt:  KIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENR-SQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHITT

Query:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF
        GDDVQVLGALSVLATLLQTKELDESMLDA+GILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGI+IELDGYL KLKDYGIS F+KVGASPR+HRF
Subjt:  GDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYGISCFIKVGASPRVHRF

Query:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS
        EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLK+SYKYCA+ELL+EARGIWSDFLIILL DEWKKCKR IEAPSPRKEPKCMLLH 
Subjt:  EVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCMLLHS

Query:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF
        AKASV+ D VPPESSFAAG++MSELVKVFVLLHQLQSFS GKALSEQPGIDPPSEI E SRAKVA LD SGPK GA++RLDG+VPCRIAFERGKERHFYF
Subjt:  AKASVV-DTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYF

Query:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE
        LG SMGT GWIILAEE+PSK N GIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFL PAKHGTP KLKTKPFVDGRWILAFQDED+CKSALSMV+EE
Subjt:  LGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEE

Query:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL
        INLQSKEVERRLKPLVDLER+VDSSDAS+CS+KTLTSN TPNLL
Subjt:  INLQSKEVERRLKPLVDLERAVDSSDASLCSAKTLTSNTTPNLL

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.1e-5138.11Show/hide
Query:  NFKEAIAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRY-LTCTRPDIL
        NFK  +  +F M+D+  + +++GI++   E+ I++SQ  Y K++L KFNM NC  V+ P+        + +DE+   +T  +SL+G L Y + CTRPD+ 
Subjt:  NFKEAIAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRY-LTCTRPDIL

Query:  FSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDF--KLMGYCDSDFAGDIDDRKNTTSYVFFMGN-NAISWCSKKQPNVTLSTCESEYVA
         +V I+SR+            K++LRY++GT+D ++ +  +  F  K++GY DSD+AG   DRK+TT Y+F M + N I W +K+Q +V  S+ E+EY+A
Subjt:  FSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDF--KLMGYCDSDFAGDIDDRKNTTSYVFFMGN-NAISWCSKKQPNVTLSTCESEYVA

Query:  ATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTKPL
               A+WL+ LL  I++   +  KI  DN+    +A N   H R+KHID +YHF RE ++   + LEY+ + +Q+A+IFTKPL
Subjt:  ATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTKPL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-5241.58Show/hide
Query:  MFANFKEAIAREFEMSDMGLMSYYLGIQV--NQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKM---KTDEEK--VDSTMFKSLVGSLRY
        + A  K  +++ F+M D+G     LG+++   +    +++SQE Y + +L++FNM N KPV+ P+    KLSK     T EEK  +    + S VGSL Y
Subjt:  MFANFKEAIAREFEMSDMGLMSYYLGIQV--NQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKM---KTDEEK--VDSTMFKSLVGSLRY

Query:  -LTCTRPDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLST
         + CTRPDI  +VG+VSRFLE P + H  A K ILRY+RGT    + +  S+   L GY D+D AGDID+RK++T Y+F     AISW SK Q  V LST
Subjt:  -LTCTRPDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLST

Query:  CESEYVAATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTK
         E+EY+AAT      IWL+R L+E+ + Q +   +  D++SA  L+KN ++H R+KHID RYH+IRE ++ + +++  + + +  A++ TK
Subjt:  CESEYVAATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTK

Q54GS1 Protein CLEC16A homolog6.3e-5637.14Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYL + L K QI+   NK+ ++E+LR I+E++ +GDQHD  FF+FF+EK IMG F++ L    +  +++QLLQT+SI+++NL++E++IY+L S  ++N+
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        +I + FDF ++E+L Y++S L+A+S KL+K+T++      N     FP+Y EAI+F  H+E MIR A+R LTLN++ V +  +  +I +     YFSN+V
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVN---GIQIGAVTSLYLLCCIL
         F R+ C++L++++    +S   S+  +   +DE+ D  YY+ D+ + G   +  ++ +  +++LI+P+ + SL     N     +I    +LYLL  + 
Subjt:  TFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVN---GIQIGAVTSLYLLCCIL

Query:  RIVKIKDLANTISAA
         I   K L +TIS+A
Subjt:  RIVKIKDLANTISAA

Q8W4P9 Protein TRANSPARENT TESTA 91.0e-28764.03Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNE NKD VIEALRSI+E++TYGDQHD  FFEFFMEKQ+MGEFVRIL++S+T T+S+QLLQTMSI+IQNLKSE AIYYLFS E++N 
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITY FDF+++ELLSYYISFLRA+SGKLNK+TISLL+KT+ND VVSFP+YVE IQFAFHEENMIRTAVRALTLNVYHVGD+ VN ++ SPPH EYFS L+
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRS-TELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRI
        +FF+KQC++L+ +V  T++S +  S   + +AVD IED LYY SDV+SAGIPD+GRLITD+IL+HL  PLLLPSL  EAVN I +  VTSLYLL CILRI
Subjt:  TFFRKQCINLNELVFETMRS-TELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRI

Query:  VKIKDLANTISAAFFCPLDAFSPHCEGTINGNITQ--LCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHI
        VKIKDLAN  +A  FCP+ AF        N ++    L   N     G  +   Q     S    + D        ED T K   + S +  R+ LL +I
Subjt:  VKIKDLANTISAAFFCPLDAFSPHCEGTINGNITQ--LCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHI

Query:  TTGDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKD-YGISCFIKVGA-SPR
        + GDDVQ  G+L VLATLLQTKEL+ESMLDA GILPQRKQHKKLLL++LVGED+GEEQLFS  N S++ G+  ELD YL +L++ +G+ C +   A  PR
Subjt:  TTGDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKD-YGISCFIKVGA-SPR

Query:  VHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCM
        VHR +V+D LV+L CR NISAE LWDGGWLLRQLLPYSEAEFN  HLK+L  SY+ C   L +E +GIW D LI +L DEW+KCKR IEAPSP+KEPK +
Subjt:  VHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCM

Query:  LLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERH
        LL   ++S  D    ESSF AG++M E+VKVFVLLHQLQ FSLG++L EQP I PP++ SE+SRA  AGLDVS PK G +L+L  AVPCRIAFERGKER 
Subjt:  LLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERH

Query:  FYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMV
        F FL +S G SGWI+LA+      + GI+RV APLAG  PRIDEKH RWLHLRIRPSTLP LDP K G   KLK+K  VDGRWILAF+D+++C SA SMV
Subjt:  FYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMV

Query:  MEEINLQSKEVERRLKPLVDLER
          EI+LQ  EVERRL+PL DLER
Subjt:  MEEINLQSKEVERRLKPLVDLER

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.8e-5137.85Show/hide
Query:  PKMFANFKEAIAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTR
        P +  N  + +++ F + D   + Y+LGI+  ++   + +SQ  Y  +LL + NM+  KPV  PM  + KLS + +  +  D T ++ +VGSL+YL  TR
Subjt:  PKMFANFKEAIAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTR

Query:  PDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYV
        PDI ++V  +S+F+ +PTE HL A K+ILRY+ GT ++ IF    N   L  Y D+D+AGD DD  +T  Y+ ++G++ ISW SKKQ  V  S+ E+EY 
Subjt:  PDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYV

Query:  AATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTKPLN
        +  + +    W+  LL E+ +       I  DN  A  L  N VFH R KHI   YHFIR  ++   + + +V + DQ+A+  TKPL+
Subjt:  AATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTKPLN

Arabidopsis top hitse value%identityAlignment
AT3G28430.1 unknown protein7.2e-28964.03Show/hide
Query:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK
        +RYLTDQL KIQIVNE NKD VIEALRSI+E++TYGDQHD  FFEFFMEKQ+MGEFVRIL++S+T T+S+QLLQTMSI+IQNLKSE AIYYLFS E++N 
Subjt:  IRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASFFEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNK

Query:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV
        LITY FDF+++ELLSYYISFLRA+SGKLNK+TISLL+KT+ND VVSFP+YVE IQFAFHEENMIRTAVRALTLNVYHVGD+ VN ++ SPPH EYFS L+
Subjt:  LITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEAIQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLV

Query:  TFFRKQCINLNELVFETMRS-TELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRI
        +FF+KQC++L+ +V  T++S +  S   + +AVD IED LYY SDV+SAGIPD+GRLITD+IL+HL  PLLLPSL  EAVN I +  VTSLYLL CILRI
Subjt:  TFFRKQCINLNELVFETMRS-TELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILRHLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRI

Query:  VKIKDLANTISAAFFCPLDAFSPHCEGTINGNITQ--LCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHI
        VKIKDLAN  +A  FCP+ AF        N ++    L   N     G  +   Q     S    + D        ED T K   + S +  R+ LL +I
Subjt:  VKIKDLANTISAAFFCPLDAFSPHCEGTINGNITQ--LCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEPEDGTVKIDCSGSQLELRKALLSHI

Query:  TTGDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKD-YGISCFIKVGA-SPR
        + GDDVQ  G+L VLATLLQTKEL+ESMLDA GILPQRKQHKKLLL++LVGED+GEEQLFS  N S++ G+  ELD YL +L++ +G+ C +   A  PR
Subjt:  TTGDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKD-YGISCFIKVGA-SPR

Query:  VHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCM
        VHR +V+D LV+L CR NISAE LWDGGWLLRQLLPYSEAEFN  HLK+L  SY+ C   L +E +GIW D LI +L DEW+KCKR IEAPSP+KEPK +
Subjt:  VHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPKCM

Query:  LLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERH
        LL   ++S  D    ESSF AG++M E+VKVFVLLHQLQ FSLG++L EQP I PP++ SE+SRA  AGLDVS PK G +L+L  AVPCRIAFERGKER 
Subjt:  LLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERH

Query:  FYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMV
        F FL +S G SGWI+LA+      + GI+RV APLAG  PRIDEKH RWLHLRIRPSTLP LDP K G   KLK+K  VDGRWILAF+D+++C SA SMV
Subjt:  FYFLGISMGTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMV

Query:  MEEINLQSKEVERRLKPLVDLER
          EI+LQ  EVERRL+PL DLER
Subjt:  MEEINLQSKEVERRLKPLVDLER

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.6e-4136.44Show/hide
Query:  FEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTRPDILFSVGIVSRFL
        F++ D+G + Y+LG+++ +    I + Q  YA +LL +  +L CKP + PM+ +   S   +  + VD+  ++ L+G L YL  TR DI F+V  +S+F 
Subjt:  FEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTRPDILFSVGIVSRFL

Query:  EVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYVAATSYACHAIWLRR
        E P  +H  A  KIL YI+GT+   +FYSS  + +L  + D+ F    D R++T  Y  F+G + ISW SKKQ  V+ S+ E+EY A +      +WL +
Subjt:  EVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYVAATSYACHAIWLRR

Query:  LLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRE
          +E+ +     T +  DN +A  +A N VFH+R+KHI++  H +RE
Subjt:  LLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.9e-1037.66Show/hide
Query:  YLTCTRPDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSY
        YLT TRPD+ F+V  +S+F      + + A  K+L Y++GT+   +FYS+++D +L  + DSD+A   D R++ T +
Subjt:  YLTCTRPDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSY

ATMG00810.1 DNA/RNA polymerases superfamily protein5.7e-3636.82Show/hide
Query:  IAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTRPDILFSVGIV
        ++  F M D+G + Y+LGIQ+      +F+SQ  YA+++L    ML+CKP++ P+    KL+   +  +  D + F+S+VG+L+YLT TRPDI ++V IV
Subjt:  IAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEEKVDSTMFKSLVGSLRYLTCTRPDILFSVGIV

Query:  SRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYVAATSYACHAI
         + +  PT +     K++LRY++GT+ + ++   ++   +  +CDSD+AG    R++TT +  F+G N ISW +K+QP V+ S+ E+EY A    A    
Subjt:  SRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPNVTLSTCESEYVAATSYACHAI

Query:  W
        W
Subjt:  W


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCTCAAACAAGCACCAAGAGCCTGAAATAGTCGACTTGACAAGTATTTGCAAGAAAATGGTTTTTCCAAATGCCAACATGAGTGACTGTCCAAAGATGTTTGCAAA
CTTTAAGGAGGCCATTGCTCGAGAGTTTGAGATGAGTGACATGGGCCTCATGTCCTATTACTTGGGAATCCAAGTAAATCAGATGGAGGAGGACATCTTTGTTTCACAAG
AAGTCTATGCAAAGGAGTTACTCAAGAAATTCAACATGCTTAATTGCAAGCCGGTCAACGCCCCAATGGAGTGTGCATCAAAGTTGTCCAAGATGAAAACAGATGAAGAA
AAGGTTGACTCTACCATGTTCAAGAGCCTTGTTGGAAGTTTGAGATACTTGACTTGCACAAGGCCTGATATTCTTTTTTCAGTTGGGATCGTGAGTCGTTTCTTGGAGGT
TCCTACTGAATCTCACTTGATGGCAGCCAAGAAGATACTTCGCTACATTAGAGGTACTCTTGACTATGAGATATTTTATTCATCCTCTAATGATTTCAAGTTGATGGGTT
ACTGTGACAGTGACTTTGCTGGAGATATTGATGATAGAAAAAACACTACTAGTTATGTGTTCTTTATGGGAAATAATGCAATATCTTGGTGTTCAAAGAAGCAACCAAAT
GTAACACTCTCCACTTGTGAAAGTGAGTATGTTGCAGCCACATCTTATGCTTGCCATGCTATATGGCTACGAAGATTGTTGAAGGAGATTCATATGGATCAAGGGGATGC
TACGAAGATCCTAATTGACAACAAGTCAGCTCAAGTCTTAGCTAAGAATCATGTGTTCCATGACCGCAGCAAGCACATTGACACCCGATATCACTTTATTCGGGAATGTA
TTGAGATGAAAAAAGTGGAGCTCGAGTATGTCAAGTCTATGGACCAAGTTGCTAACATCTTTACAAAGCCCTTAAACACGCCGTGGAAGATCAGCTGTCTCAAAACTCTA
GCGATCTATGAAGGGTTGCAGGTGATTCCGACGACTAATGTTCCTATCCTAGTGGAGAGCGATTTCCTTCAGATGGTGAGGTTGCTTTGTGGGGAGGAAGATGATATGAC
TGAACTAGCAAAATGGCTTGATTTTATATTTATGTTTCAAAATATCAACACGCTGTGGAAGATCAGCTGGCTCAGAACTCTAGCGATCTACGAAGGGTTGCGGGCGATTT
CGACAACTGATGTTCTTATCCTAGTGGAGAGCGATCCCTTCAGGTGGTCCAACGGAAGCCCAATTATCTCTATACAGTGGCCCAATCCACCGACTAAGTCCGACTTCATC
AATACAATTCTTTTGAGTCCGACGGAGAGGAATCTGGTTTGCAAGCTCCAAGCGTTTCTTCCTCTGTCTATCAAATCTTCCGTTTTGATTGAAAAGCTTTCTTCAATTGC
CCTGTGTCGGAGGCAGGCAGTTGCCGTGGTAGAGAAGAACTTCGCTGAAATCATATTGAAACCTTCAATTGTGGGATGGATTGGGTTTAGGGTACAGAATGATTACTGGG
TATCTGATTTCGGGCCTCTGATTGTTCAGCCATGTGGTTTTCCTTTTGGAGATCCAGGGATCGATTCTCGTTGGAAGAACTCAGAAAATCGATGTCTCGACGGCTGTGCA
CGACGACGCTTCACTCGGTTGGTTTCGGTCATCCCCTTTGCTGCCACTCCTCTCCGAAGCCGTCGACCATCGACTCTGATACCAATTGTTAGCCGATGTCTCGACGGCTA
TGCACGACGACGCTTCACTCGACTAGTTCTGGTCAGCCCTTCCACCGCCACTCCTCTCCGAAGCCGTCGACCACTGGCTCTGATACCAATTATTAGGTATTTAACTGATC
AGCTGCTAAAGATTCAAATAGTCAATGAGGTTAACAAGGATTTTGTTATTGAGGCACTGCGATCAATATCAGAGTTGATAACTTATGGTGACCAGCACGATGCTTCCTTC
TTTGAGTTTTTCATGGAGAAGCAAATCATGGGAGAATTTGTACGTATTTTGAAGATTAGCAGGACTGCAACTATTTCACTTCAATTGTTGCAGACAATGAGCATTATAAT
TCAGAACCTTAAAAGTGAACATGCTATATATTACCTATTCAGTACTGAACATATGAACAAGCTAATAACATATGCATTTGATTTCCGGAATGATGAGTTGTTGTCGTACT
ATATATCTTTCTTAAGAGCAATTAGTGGAAAGCTGAACAAGAACACAATCTCCCTACTTGTGAAAACTCAAAATGATAAAGTAGTATCTTTTCCAATGTATGTAGAGGCT
ATACAGTTTGCTTTTCATGAAGAAAATATGATCCGCACTGCAGTTCGGGCTTTGACGCTTAATGTATACCATGTTGGAGATGATTATGTCAATAGATTCATCACCAGCCC
TCCTCATGAAGAATATTTCTCAAATTTAGTGACCTTTTTTAGGAAGCAGTGCATTAACTTGAATGAGTTGGTCTTTGAGACAATGAGAAGCACTGAGCTCTCAACCTCTA
CAATTCTTGCTGCTGTAGATGAAATTGAAGACAACCTATACTATATTAGTGATGTTGTTTCTGCAGGGATTCCTGACGTGGGAAGATTAATAACCGATAACATTTTGCGG
CACTTGATCTTTCCTTTGCTTCTTCCGTCTCTGAGGATAGAGGCTGTTAATGGAATTCAAATTGGAGCTGTCACTTCTCTGTATTTACTTTGTTGCATTTTGCGCATAGT
TAAAATAAAAGATCTGGCAAACACCATCTCTGCTGCATTTTTCTGTCCATTGGATGCCTTCTCCCCACACTGTGAAGGTACAATTAATGGAAATATAACTCAATTATGTT
GCGAAAATAGAAGCCAGTCATCAGGAGGTGATGATGTTGTGAGGCAGCCGTTGGATGCTGAATCTTTAAGAAAAGAAGTATTGGATTCATCTGCTCCTAAAACTGAGCCA
GAAGATGGGACTGTGAAGATTGATTGTTCAGGCTCCCAATTGGAATTAAGGAAAGCTTTGCTTTCTCATATTACAACTGGGGATGATGTACAAGTCTTGGGTGCTTTAAG
CGTTCTAGCTACACTATTGCAGACTAAAGAACTGGATGAGTCGATGCTGGATGCTCTCGGAATCCTTCCTCAAAGAAAACAACATAAGAAATTGTTATTGGAAGCCTTAG
TTGGTGAGGATTCAGGCGAAGAACAACTATTTTCTTCAGAAAACACATCAGTGAAAGGTGGCATCGATATTGAACTTGATGGTTACCTATCGAAGCTTAAGGATTATGGC
ATTTCATGTTTTATTAAAGTAGGTGCAAGCCCTCGTGTCCATAGGTTTGAGGTGCTAGATGCATTGGTCAGTCTCTTTTGTCGTTCAAATATATCTGCAGAAATATTGTG
GGATGGCGGGTGGCTTCTGCGGCAGTTGTTACCTTATAGTGAGGCAGAGTTTAACAGCCATCATCTAAAATTGCTGAAAAATTCATATAAGTACTGCGCTACTGAGCTCC
TACAGGAAGCTAGAGGGATTTGGTCTGATTTCCTCATAATACTTCTTTATGACGAGTGGAAAAAGTGCAAAAGAGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAG
TGCATGCTCTTGCACTCTGCAAAGGCTTCTGTCGTAGACACGGTACCACCTGAATCATCATTTGCTGCTGGTAAAAAAATGTCTGAGTTGGTAAAGGTATTTGTTCTTCT
ACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAGCCCGGTATTGACCCTCCCTCAGAAATTTCTGAATCTTCCCGTGCAAAGGTTGCTGGACTAGATG
TTTCAGGACCTAAAGCGGGTGCTGATTTGAGACTTGATGGTGCTGTGCCTTGTAGAATTGCATTCGAGAGGGGCAAAGAGCGCCATTTTTACTTTCTTGGAATTTCTATG
GGAACTTCAGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTCGAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATCGATGAAAA
GCATTCAAGGTGGCTGCACTTGAGGATTCGGCCATCAACATTACCCTTTCTGGATCCTGCTAAACATGGTACCCCGTTAAAGTTAAAGACAAAACCTTTTGTGGATGGGA
GATGGATCCTAGCATTCCAGGACGAGGATACTTGCAAATCTGCTTTGTCTATGGTTATGGAGGAGATTAATCTGCAAAGCAAGGAGGTAGAGAGAAGACTGAAACCATTA
GTCGACCTCGAAAGAGCTGTAGATTCTTCAGACGCATCTTTATGTTCTGCTAAGACACTGACCTCTAATACAACTCCTAATCTATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCTCAAACAAGCACCAAGAGCCTGAAATAGTCGACTTGACAAGTATTTGCAAGAAAATGGTTTTTCCAAATGCCAACATGAGTGACTGTCCAAAGATGTTTGCAAA
CTTTAAGGAGGCCATTGCTCGAGAGTTTGAGATGAGTGACATGGGCCTCATGTCCTATTACTTGGGAATCCAAGTAAATCAGATGGAGGAGGACATCTTTGTTTCACAAG
AAGTCTATGCAAAGGAGTTACTCAAGAAATTCAACATGCTTAATTGCAAGCCGGTCAACGCCCCAATGGAGTGTGCATCAAAGTTGTCCAAGATGAAAACAGATGAAGAA
AAGGTTGACTCTACCATGTTCAAGAGCCTTGTTGGAAGTTTGAGATACTTGACTTGCACAAGGCCTGATATTCTTTTTTCAGTTGGGATCGTGAGTCGTTTCTTGGAGGT
TCCTACTGAATCTCACTTGATGGCAGCCAAGAAGATACTTCGCTACATTAGAGGTACTCTTGACTATGAGATATTTTATTCATCCTCTAATGATTTCAAGTTGATGGGTT
ACTGTGACAGTGACTTTGCTGGAGATATTGATGATAGAAAAAACACTACTAGTTATGTGTTCTTTATGGGAAATAATGCAATATCTTGGTGTTCAAAGAAGCAACCAAAT
GTAACACTCTCCACTTGTGAAAGTGAGTATGTTGCAGCCACATCTTATGCTTGCCATGCTATATGGCTACGAAGATTGTTGAAGGAGATTCATATGGATCAAGGGGATGC
TACGAAGATCCTAATTGACAACAAGTCAGCTCAAGTCTTAGCTAAGAATCATGTGTTCCATGACCGCAGCAAGCACATTGACACCCGATATCACTTTATTCGGGAATGTA
TTGAGATGAAAAAAGTGGAGCTCGAGTATGTCAAGTCTATGGACCAAGTTGCTAACATCTTTACAAAGCCCTTAAACACGCCGTGGAAGATCAGCTGTCTCAAAACTCTA
GCGATCTATGAAGGGTTGCAGGTGATTCCGACGACTAATGTTCCTATCCTAGTGGAGAGCGATTTCCTTCAGATGGTGAGGTTGCTTTGTGGGGAGGAAGATGATATGAC
TGAACTAGCAAAATGGCTTGATTTTATATTTATGTTTCAAAATATCAACACGCTGTGGAAGATCAGCTGGCTCAGAACTCTAGCGATCTACGAAGGGTTGCGGGCGATTT
CGACAACTGATGTTCTTATCCTAGTGGAGAGCGATCCCTTCAGGTGGTCCAACGGAAGCCCAATTATCTCTATACAGTGGCCCAATCCACCGACTAAGTCCGACTTCATC
AATACAATTCTTTTGAGTCCGACGGAGAGGAATCTGGTTTGCAAGCTCCAAGCGTTTCTTCCTCTGTCTATCAAATCTTCCGTTTTGATTGAAAAGCTTTCTTCAATTGC
CCTGTGTCGGAGGCAGGCAGTTGCCGTGGTAGAGAAGAACTTCGCTGAAATCATATTGAAACCTTCAATTGTGGGATGGATTGGGTTTAGGGTACAGAATGATTACTGGG
TATCTGATTTCGGGCCTCTGATTGTTCAGCCATGTGGTTTTCCTTTTGGAGATCCAGGGATCGATTCTCGTTGGAAGAACTCAGAAAATCGATGTCTCGACGGCTGTGCA
CGACGACGCTTCACTCGGTTGGTTTCGGTCATCCCCTTTGCTGCCACTCCTCTCCGAAGCCGTCGACCATCGACTCTGATACCAATTGTTAGCCGATGTCTCGACGGCTA
TGCACGACGACGCTTCACTCGACTAGTTCTGGTCAGCCCTTCCACCGCCACTCCTCTCCGAAGCCGTCGACCACTGGCTCTGATACCAATTATTAGGTATTTAACTGATC
AGCTGCTAAAGATTCAAATAGTCAATGAGGTTAACAAGGATTTTGTTATTGAGGCACTGCGATCAATATCAGAGTTGATAACTTATGGTGACCAGCACGATGCTTCCTTC
TTTGAGTTTTTCATGGAGAAGCAAATCATGGGAGAATTTGTACGTATTTTGAAGATTAGCAGGACTGCAACTATTTCACTTCAATTGTTGCAGACAATGAGCATTATAAT
TCAGAACCTTAAAAGTGAACATGCTATATATTACCTATTCAGTACTGAACATATGAACAAGCTAATAACATATGCATTTGATTTCCGGAATGATGAGTTGTTGTCGTACT
ATATATCTTTCTTAAGAGCAATTAGTGGAAAGCTGAACAAGAACACAATCTCCCTACTTGTGAAAACTCAAAATGATAAAGTAGTATCTTTTCCAATGTATGTAGAGGCT
ATACAGTTTGCTTTTCATGAAGAAAATATGATCCGCACTGCAGTTCGGGCTTTGACGCTTAATGTATACCATGTTGGAGATGATTATGTCAATAGATTCATCACCAGCCC
TCCTCATGAAGAATATTTCTCAAATTTAGTGACCTTTTTTAGGAAGCAGTGCATTAACTTGAATGAGTTGGTCTTTGAGACAATGAGAAGCACTGAGCTCTCAACCTCTA
CAATTCTTGCTGCTGTAGATGAAATTGAAGACAACCTATACTATATTAGTGATGTTGTTTCTGCAGGGATTCCTGACGTGGGAAGATTAATAACCGATAACATTTTGCGG
CACTTGATCTTTCCTTTGCTTCTTCCGTCTCTGAGGATAGAGGCTGTTAATGGAATTCAAATTGGAGCTGTCACTTCTCTGTATTTACTTTGTTGCATTTTGCGCATAGT
TAAAATAAAAGATCTGGCAAACACCATCTCTGCTGCATTTTTCTGTCCATTGGATGCCTTCTCCCCACACTGTGAAGGTACAATTAATGGAAATATAACTCAATTATGTT
GCGAAAATAGAAGCCAGTCATCAGGAGGTGATGATGTTGTGAGGCAGCCGTTGGATGCTGAATCTTTAAGAAAAGAAGTATTGGATTCATCTGCTCCTAAAACTGAGCCA
GAAGATGGGACTGTGAAGATTGATTGTTCAGGCTCCCAATTGGAATTAAGGAAAGCTTTGCTTTCTCATATTACAACTGGGGATGATGTACAAGTCTTGGGTGCTTTAAG
CGTTCTAGCTACACTATTGCAGACTAAAGAACTGGATGAGTCGATGCTGGATGCTCTCGGAATCCTTCCTCAAAGAAAACAACATAAGAAATTGTTATTGGAAGCCTTAG
TTGGTGAGGATTCAGGCGAAGAACAACTATTTTCTTCAGAAAACACATCAGTGAAAGGTGGCATCGATATTGAACTTGATGGTTACCTATCGAAGCTTAAGGATTATGGC
ATTTCATGTTTTATTAAAGTAGGTGCAAGCCCTCGTGTCCATAGGTTTGAGGTGCTAGATGCATTGGTCAGTCTCTTTTGTCGTTCAAATATATCTGCAGAAATATTGTG
GGATGGCGGGTGGCTTCTGCGGCAGTTGTTACCTTATAGTGAGGCAGAGTTTAACAGCCATCATCTAAAATTGCTGAAAAATTCATATAAGTACTGCGCTACTGAGCTCC
TACAGGAAGCTAGAGGGATTTGGTCTGATTTCCTCATAATACTTCTTTATGACGAGTGGAAAAAGTGCAAAAGAGCAATTGAAGCCCCATCACCAAGGAAAGAACCAAAG
TGCATGCTCTTGCACTCTGCAAAGGCTTCTGTCGTAGACACGGTACCACCTGAATCATCATTTGCTGCTGGTAAAAAAATGTCTGAGTTGGTAAAGGTATTTGTTCTTCT
ACACCAACTTCAGTCATTTTCCCTTGGCAAGGCTTTGTCAGAACAGCCCGGTATTGACCCTCCCTCAGAAATTTCTGAATCTTCCCGTGCAAAGGTTGCTGGACTAGATG
TTTCAGGACCTAAAGCGGGTGCTGATTTGAGACTTGATGGTGCTGTGCCTTGTAGAATTGCATTCGAGAGGGGCAAAGAGCGCCATTTTTACTTTCTTGGAATTTCTATG
GGAACTTCAGGATGGATAATTCTTGCTGAAGAACTGCCATCAAAACTGAATTGTGGAATTATTCGAGTTGCTGCACCTCTTGCTGGATCAAATCCTAGAATCGATGAAAA
GCATTCAAGGTGGCTGCACTTGAGGATTCGGCCATCAACATTACCCTTTCTGGATCCTGCTAAACATGGTACCCCGTTAAAGTTAAAGACAAAACCTTTTGTGGATGGGA
GATGGATCCTAGCATTCCAGGACGAGGATACTTGCAAATCTGCTTTGTCTATGGTTATGGAGGAGATTAATCTGCAAAGCAAGGAGGTAGAGAGAAGACTGAAACCATTA
GTCGACCTCGAAAGAGCTGTAGATTCTTCAGACGCATCTTTATGTTCTGCTAAGACACTGACCTCTAATACAACTCCTAATCTATTGTGA
Protein sequenceShow/hide protein sequence
MGSNKHQEPEIVDLTSICKKMVFPNANMSDCPKMFANFKEAIAREFEMSDMGLMSYYLGIQVNQMEEDIFVSQEVYAKELLKKFNMLNCKPVNAPMECASKLSKMKTDEE
KVDSTMFKSLVGSLRYLTCTRPDILFSVGIVSRFLEVPTESHLMAAKKILRYIRGTLDYEIFYSSSNDFKLMGYCDSDFAGDIDDRKNTTSYVFFMGNNAISWCSKKQPN
VTLSTCESEYVAATSYACHAIWLRRLLKEIHMDQGDATKILIDNKSAQVLAKNHVFHDRSKHIDTRYHFIRECIEMKKVELEYVKSMDQVANIFTKPLNTPWKISCLKTL
AIYEGLQVIPTTNVPILVESDFLQMVRLLCGEEDDMTELAKWLDFIFMFQNINTLWKISWLRTLAIYEGLRAISTTDVLILVESDPFRWSNGSPIISIQWPNPPTKSDFI
NTILLSPTERNLVCKLQAFLPLSIKSSVLIEKLSSIALCRRQAVAVVEKNFAEIILKPSIVGWIGFRVQNDYWVSDFGPLIVQPCGFPFGDPGIDSRWKNSENRCLDGCA
RRRFTRLVSVIPFAATPLRSRRPSTLIPIVSRCLDGYARRRFTRLVLVSPSTATPLRSRRPLALIPIIRYLTDQLLKIQIVNEVNKDFVIEALRSISELITYGDQHDASF
FEFFMEKQIMGEFVRILKISRTATISLQLLQTMSIIIQNLKSEHAIYYLFSTEHMNKLITYAFDFRNDELLSYYISFLRAISGKLNKNTISLLVKTQNDKVVSFPMYVEA
IQFAFHEENMIRTAVRALTLNVYHVGDDYVNRFITSPPHEEYFSNLVTFFRKQCINLNELVFETMRSTELSTSTILAAVDEIEDNLYYISDVVSAGIPDVGRLITDNILR
HLIFPLLLPSLRIEAVNGIQIGAVTSLYLLCCILRIVKIKDLANTISAAFFCPLDAFSPHCEGTINGNITQLCCENRSQSSGGDDVVRQPLDAESLRKEVLDSSAPKTEP
EDGTVKIDCSGSQLELRKALLSHITTGDDVQVLGALSVLATLLQTKELDESMLDALGILPQRKQHKKLLLEALVGEDSGEEQLFSSENTSVKGGIDIELDGYLSKLKDYG
ISCFIKVGASPRVHRFEVLDALVSLFCRSNISAEILWDGGWLLRQLLPYSEAEFNSHHLKLLKNSYKYCATELLQEARGIWSDFLIILLYDEWKKCKRAIEAPSPRKEPK
CMLLHSAKASVVDTVPPESSFAAGKKMSELVKVFVLLHQLQSFSLGKALSEQPGIDPPSEISESSRAKVAGLDVSGPKAGADLRLDGAVPCRIAFERGKERHFYFLGISM
GTSGWIILAEELPSKLNCGIIRVAAPLAGSNPRIDEKHSRWLHLRIRPSTLPFLDPAKHGTPLKLKTKPFVDGRWILAFQDEDTCKSALSMVMEEINLQSKEVERRLKPL
VDLERAVDSSDASLCSAKTLTSNTTPNLL