; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G000670 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G000670
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationchr07:671977..680130
RNA-Seq ExpressionLsi07G000670
SyntenyLsi07G000670
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044617.1 mis18-binding protein 1-like isoform X1 [Cucumis melo var. makuwa]0.0e+0079.86Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
        LT+K GSTSDPRQPLVKREEASSII    +      +     D+    + T+++      S+S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
Subjt:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL

Query:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY
        RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA 
Subjt:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY

Query:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD
        + + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D
Subjt:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD

Query:  DEVIMLNILSTISGR
        DEVIMLNILSTISGR
Subjt:  DEVIMLNILSTISGR

KGN53109.2 hypothetical protein Csa_015143 [Cucumis sativus]0.0e+0079.67Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+ DGFNPK   SE PQSP R VDSA +ISA    FPLIV+NQ  D EV+NS TSASA  +PETSV KMV+CDSAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLD+EL KEPLKVDAVHDF TL   EDG QDVA+DE + KDFA S+LS DGNQDC KEELV+E QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML  EEKIADQQ NDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNGEGIGIVC
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC

Query:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS
        PTRSMQMKVNKSHEPD+G KKAK+SRR+ARE  + EMH+N+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYS
Subjt:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS

Query:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY
        SLT+K GSTSDPRQPLVKREEASSII    +      +     D     + T+++      S+S DSD+DKYYHSIQRPAF VEGEPNFDSGPPEDGLEY
Subjt:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY

Query:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA
        LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA
Subjt:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA

Query:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL
         + +SHQ +E + STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+
Subjt:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL

Query:  DDEVIMLNILSTISGRYFGQLEN
        D+EVIMLNILSTISGRYF Q EN
Subjt:  DDEVIMLNILSTISGRYFGQLEN

XP_004152287.1 uncharacterized protein LOC101215637 [Cucumis sativus]0.0e+0079.67Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+ DGFNPK   SE PQSP R VDSA +ISA    FPLIV+NQ  D EV+NS TSASA  +PETSV KMV+CDSAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLD+EL KEPLKVDAVHDF TL   EDG QDVA+DE + KDFA S+LS DGNQDC KEELV+E QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML  EEKIADQQ NDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNGEGIGIVC
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC

Query:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS
        PTRSMQMKVNKSHEPD+G KKAK+SRR+ARE  + EMH+N+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYS
Subjt:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS

Query:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY
        SLT+K GSTSDPRQPLVKREEASSII    +      +     D     + T+++      S+S DSD+DKYYHSIQRPAF VEGEPNFDSGPPEDGLEY
Subjt:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY

Query:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA
        LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA
Subjt:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA

Query:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL
         + +SHQ +E + STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+
Subjt:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL

Query:  DDEVIMLNILSTISGRYFGQLEN
        D+EVIMLNILSTISGRYF Q EN
Subjt:  DDEVIMLNILSTISGRYFGQLEN

XP_008454478.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103494875 [Cucumis melo]0.0e+0079.64Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KK  +SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
        LT+K GSTSDPRQPLVKREEASSII    +      +     D+    + T+++      S+S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
Subjt:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL

Query:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY
        RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA 
Subjt:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY

Query:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD
        + + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D
Subjt:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD

Query:  DEVIMLNILSTISGRYFGQLEN
        DEVIMLNILSTISGRYFGQ EN
Subjt:  DEVIMLNILSTISGRYFGQLEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]0.0e+0082.61Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+GDGFNPK S SEN QS C+P+DSAFKISA DK FPLIV+NQ QD EV+NSA SAS   NPETSVHK     SAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVELRKEPLKVDAVHDFETL AVEDG QDVAID E EKDFA S+LSFDGN DC+KEELVQEVQLAAD     KEAF RTEEL KKETD ESILE+KK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLE+LDAMLVPGD+IHLEKGNNPPSS G VD CSKT+L DEEKIAD+QNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
         RSMQMKVNKSHEPDRG KKAKRSRR+AREA V EM++NLGNVNELDKVNGRQK AEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLP VAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LT-----MKIGSTSDPRQPLVKREEASSII----------SVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGE
        LT     MKIGSTSDPRQPLVKREEASSII           +    G     N    + SC +            SVS D DEDKYY SIQRPAFLVEGE
Subjt:  LT-----MKIGSTSDPRQPLVKREEASSII----------SVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGE

Query:  PNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSV
        PNFDSGPPEDGLEYLRRVRWEASHIPNVT+AKVDRSNFKKEQSVYMPVIP IAKCPDHLLPSKEWENAFLADFS LR+ALSHSEEF QSDFILHEKIDS 
Subjt:  PNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSV

Query:  IPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLL
        IPD +AQP VLPAYNI+SHQ EE N STSAKENSCNDYPSLSAISKMNSVF VSSL+KRINSLETQTTLS+TDCLWLFALSAAVDTPLD DTCAAFRSLL
Subjt:  IPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLL

Query:  RKCASLRAEKTELDDEVIMLNILSTISGRYFGQLEN
        RKCASLRA+KTELDDEVIMLNILSTISGRYFGQ EN
Subjt:  RKCASLRAEKTELDDEVIMLNILSTISGRYFGQLEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein0.0e+0079.67Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+ DGFNPK   SE PQSP R VDSA +ISA    FPLIV+NQ  D EV+NS TSASA  +PETSV KMV+CDSAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLD+EL KEPLKVDAVHDF TL   EDG QDVA+DE + KDFA S+LS DGNQDC KEELV+E QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML  EEKIADQQ NDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNGEGIGIVC
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC

Query:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS
        PTRSMQMKVNKSHEPD+G KKAK+SRR+ARE  + EMH+N+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYS
Subjt:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS

Query:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY
        SLT+K GSTSDPRQPLVKREEASSII    +      +     D     + T+++      S+S DSD+DKYYHSIQRPAF VEGEPNFDSGPPEDGLEY
Subjt:  SLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEY

Query:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA
        LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA
Subjt:  LRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPA

Query:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL
         + +SHQ +E + STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+
Subjt:  YNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTEL

Query:  DDEVIMLNILSTISGRYFGQLEN
        D+EVIMLNILSTISGRYF Q EN
Subjt:  DDEVIMLNILSTISGRYFGQLEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948750.0e+0079.64Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KK  +SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
        LT+K GSTSDPRQPLVKREEASSII    +      +     D+    + T+++      S+S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
Subjt:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL

Query:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY
        RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA 
Subjt:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY

Query:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD
        + + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D
Subjt:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD

Query:  DEVIMLNILSTISGRYFGQLEN
        DEVIMLNILSTISGRYFGQ EN
Subjt:  DEVIMLNILSTISGRYFGQLEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X10.0e+0079.86Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
        LT+K GSTSDPRQPLVKREEASSII    +      +     D+    + T+++      S+S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
Subjt:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL

Query:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY
        RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA 
Subjt:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY

Query:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD
        + + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D
Subjt:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD

Query:  DEVIMLNILSTISGR
        DEVIMLNILSTISGR
Subjt:  DEVIMLNILSTISGR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X10.0e+0079.86Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
        LT+K GSTSDPRQPLVKREEASSII    +      +     D+    + T+++      S+S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL
Subjt:  LTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYL

Query:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY
        RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA 
Subjt:  RRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAY

Query:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD
        + + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D
Subjt:  NINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELD

Query:  DEVIMLNILSTISGR
        DEVIMLNILSTISGR
Subjt:  DEVIMLNILSTISGR

A0A6J1F307 uncharacterized protein LOC1114392132.3e-26771.37Show/hide
Query:  MADEISSDHGDGFNPKLSLSE-NPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEV-LNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVG
        MA+ +SS  GDGF+ K S SE + +SP  P          + KFPLIV+N    CEV +NS++SAS   N ETSV KMVVCD   ASSENG N GSL V 
Subjt:  MADEISSDHGDGFNPKLSLSE-NPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEV-LNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVG

Query:  KIQNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFAT-SLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILE
        + + LDVEL +E  KVDAVHDFE +GAVEDGNQ+VA+DE E KDF T S+ SFDGNQDC K+E+VQEVQ +   EA+ KEAF RTEEL +KE D+ESILE
Subjt:  KIQNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFAT-SLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILE

Query:  MKKKLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGI
        MKKKLLLE+L+AMLVPG+EIHLEK           D C K ML DEEKIA QQNDSEN +VLR+SHLSL NSLKIEVIDETALVEPVHVSKIGNGE I I
Subjt:  MKKKLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGI

Query:  VCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVARE
        +CPTRSMQ+ V+KSHEP+R  KKA+RSRRRAREA + E+H+NLGNVNELDK     KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  VCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVARE

Query:  YSSLT-----MKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGP
        YSSLT     MK GSTS PRQ   K EEASS I    +      +     D+       ++ K    LSVS DS++D+YY+SIQRPAFLVEGEPNF+SGP
Subjt:  YSSLT-----MKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGP

Query:  PEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQ
        PEDGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP +LLPSKEWE+AFLADFSKLRQ LS  E  MQSDFI HEKIDSV PD + Q
Subjt:  PEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQ

Query:  P-IVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASL
        P IVLPA NI+S QPEEPN+STS+KENS N+YPSLSAISKMNSVF VSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLD DTCA+FRSLLRKCASL
Subjt:  P-IVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASL

Query:  RAEKTELDDEVIMLNILSTISGRYFGQLEN
        RAEK+ELDDEVIMLNIL+TISGRYFGQ EN
Subjt:  RAEKTELDDEVIMLNILSTISGRYFGQLEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 25.6e-1328.8Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV
           +++DS V             + +      +     +  E+   DY      P LS +S+MN   +V+S+ + +++   +   +     WL+AL A +
Subjt:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV

Query:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

O42260 Gem-associated protein 26.6e-1429.32Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQAL-------------------SHSEE
        S PP    EYLRRV+ EA+  P+V +A++D    +K+Q+V +  +      PD   PS  W+   +A FS +RQ+L                   S  +E
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQAL-------------------SHSEE

Query:  FMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD
             F L E++ S   D  A      A N  S  P                 P LS +S+M+     S L   +N  E +         WL+AL A ++
Subjt:  FMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD

Query:  TPLDGDTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILSTISGRYFGQ
         PL  +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q
Subjt:  TPLDGDTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILSTISGRYFGQ

Q54KN2 Gem-associated protein 26.4e-1725.23Show/hide
Query:  QRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQ
        Q  AF V  E   D   P  G EYL+RV+W ++  P+V VA +D S  K     + Y  + P+I KC   LLP+  WE  FL DFS+ RQ L + +    
Subjt:  QRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQ

Query:  S---------------------------------------------------DFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSST---------
        S                                                   D  + +  D+   D   +      YN N  + EE              
Subjt:  S---------------------------------------------------DFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSST---------

Query:  -------------SAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD
                     S K+ +  + P++  + +++ V +V+ +   I  LE +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T L+D
Subjt:  -------------SAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD

Query:  -EVIMLNILSTISGRYFGQLE
          +  +NIL TI  +YF QLE
Subjt:  -EVIMLNILSTISGRYFGQLE

Q9CQQ4 Gem-associated protein 21.1e-1329.13Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +A FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDSVIPDFVAQPIV-----LPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFAL
           +++DS     VA P          + +      E  +  S +E+   DY      P LS +S+MN   +++S+ + +++   +   +     W +AL
Subjt:  ILHEKIDSVIPDFVAQPIV-----LPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFAL

Query:  SAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
         A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  SAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

Q9QZP1 Gem-associated protein 27.3e-1328.4Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV
           +++DS V             + +      E  +  S  E+   DY      P LS +S+MN   +++S+ + +++   +   +     W +AL A +
Subjt:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV

Query:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related1.5e-6631.51Show/hide
Query:  ATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPG---DEIHLEKGNNPPSSGGIVDGCSKTML
        A + +S DG +   + +  +++    D +A+  +    T E      ++  + E+K+    +  +   V G   + ++ E+      +  +++   + +L
Subjt:  ATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPG---DEIHLEKGNNPPSSGGIVDGCSKTML

Query:  SDEEKIADQQNDSENMNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYN
        ++ E +      S +++ L +   ++ N + KIE++D TALV+ VH                                    KR      E + P     
Subjt:  SDEEKIADQQNDSENMNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYN

Query:  LGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNI
        +G+   + + +G + N +  + +Y+RK +E++RF ++  Q+ LW  +   +LP V  EY SL              VK  ++S    V       N  N+
Subjt:  LGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNI

Query:  FANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIA
           + +      DY            +D++  Y+SI RPAF V+GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKEQSVYMP+IP I 
Subjt:  FANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIA

Query:  KCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSV
        KCP++LLP KEWE++ L DF  LRQ L+ S    +         D +I     + +++  +N + H  E+ +      +           I  M+SV  V
Subjt:  KCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSV

Query:  SSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILSTISGRYFGQL
        S L+KRI  +E ++ L  +DC W+ AL A+++TPLD DTCA  R LLRKCAS+RAE + E+ DE  + M N+L TI+GRYFGQ+
Subjt:  SSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILSTISGRYFGQL

AT2G42510.1 FUNCTIONS IN: molecular_function unknown2.3e-2269.01Show/hide
Query:  SDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIP
        ++++  Y+SI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KEQSVYMP IP
Subjt:  SDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown4.8e-3629.96Show/hide
Query:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR
        DG +    SD  KI    N S++       V  + +  +  S+ I+++D+TAL + V   K G       +   T     + +K    ++ + K   S  
Subjt:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR

Query:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIS
          R A+       + +     ++NG+Q      +I+YSR  ME++R+ ++A Q++LW  +   LLP +  EY      I +     Q  V +EE +    
Subjt:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIS

Query:  VVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-K
                                              D+D+   Y+SI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ K
Subjt:  VVAARGRWNHMNIFANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-K

Query:  EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPS
        EQSVYMP IP          P+       L  F    + L+HS               S IP F  Q   L  + +         S T  K    + +  
Subjt:  EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPS

Query:  LSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILS
                               E ++ L  +DC W+ AL A+VDTP D DT A  R+L+RKCASLRA    L+  V+ +N LS
Subjt:  LSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATGAGATAAGTTCTGACCATGGCGATGGGTTTAATCCAAAATTATCACTATCTGAGAACCCCCAATCTCCTTGTCGACCGGTTGATTCTGCCTTTAAGATCTC
TGCCCACGACAAGAAGTTCCCTTTGATCGTCACGAATCAAAAGCAGGACTGTGAAGTCCTAAACAGTGCGACTTCCGCTTCTGCCCACGTGAACCCAGAAACTTCTGTCC
ACAAGATGGTCGTTTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGCAAATACGGGAAGTCTGGTGGTGGGCAAGATTCAGAATCTTGATGTGGAGCTCAGAAAAGAA
CCTCTCAAGGTGGACGCTGTCCATGATTTTGAAACGCTCGGTGCTGTGGAAGATGGTAATCAAGATGTTGCGATCGATGAAGAAGAAGAGAAAGATTTTGCAACAAGTCT
CCTAAGTTTTGATGGGAATCAAGATTGTACGAAGGAAGAACTTGTTCAAGAAGTTCAGTTGGCTGCTGACACTGAAGCCAACGGAAAAGAAGCCTTTCCACGAACAGAGG
AGTTGTTTAAGAAAGAAACTGATTCTGAGAGCATTTTGGAAATGAAAAAGAAATTACTATTGGAAAAACTCGATGCCATGTTGGTTCCTGGAGATGAAATTCATCTAGAG
AAGGGAAACAATCCCCCTAGCTCAGGAGGGATTGTGGATGGTTGCAGCAAAACGATGCTTAGTGATGAGGAGAAGATTGCTGATCAGCAAAATGATTCTGAAAACATGAA
TGTTCTCAGACGAAGTCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCATTAGTTGAACCGGTTCATGTCTCCAAAATTGGAAATGGAGAAG
GGATTGGTATTGTTTGTCCAACAAGGTCAATGCAGATGAAGGTGAACAAATCCCATGAACCTGATAGAGGGGTGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAA
GCTAATGTCCCTGAGATGCATTATAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAGAAAAATGCAGAAGGAAACAAGATAGTGTATTCGAGGAAAGA
TATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGTAAGGAACTTTTGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAATGA
AGATAGGCTCCACCTCTGATCCTAGGCAGCCTTTAGTGAAGAGAGAAGAAGCCTCTTCAATTATAAGTGTTGTTGCTGCAAGGGGAAGATGGAATCATATGAATATTTTT
GCAAATGATAGTTCTTGTCTGTACAAATATACCGATTACATAAAATTTCTGTTCCTTCTTAGTGTCAGTGGAGATAGTGATGAGGATAAATATTACCACAGTATTCAGAG
ACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATTCCAAATGTGA
CGGTGGCAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTTATTCCTGCAATTGCCAAGTGCCCCGACCATTTACTGCCTTCAAAAGAGTGG
GAGAATGCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTCTATCACACTCTGAAGAATTTATGCAGTCTGATTTCATTCTCCATGAAAAGATCGATTCTGTAATTCC
GGACTTCGTTGCTCAGCCAATTGTCTTGCCTGCCTACAACATCAACTCGCATCAACCTGAGGAACCGAATAGCAGTACTTCAGCAAAGGAAAATAGTTGCAACGATTATC
CATCTCTATCAGCAATCTCAAAGATGAATTCGGTGTTTAGTGTTTCATCGTTGAGGAAGCGTATAAACTCATTAGAAACACAGACAACACTGTCAAGGACTGATTGTCTT
TGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGGAGATACGTGTGCCGCTTTCAGAAGTCTGCTTCGGAAATGTGCCAGCTTGCGGGCTGAGAAGACCGA
GCTTGACGACGAGGTGATAATGCTCAATATTCTTTCCACCATTTCCGGAAGGTACTTTGGACAGTTGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
TGCTAAACCCTCTTGCTTTTTTCTTCGAGCAATGGCGGATGAGATAAGTTCTGACCATGGCGATGGGTTTAATCCAAAATTATCACTATCTGAGAACCCCCAATCTCCTT
GTCGACCGGTTGATTCTGCCTTTAAGATCTCTGCCCACGACAAGAAGTTCCCTTTGATCGTCACGAATCAAAAGCAGGACTGTGAAGTCCTAAACAGTGCGACTTCCGCT
TCTGCCCACGTGAACCCAGAAACTTCTGTCCACAAGATGGTCGTTTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGCAAATACGGGAAGTCTGGTGGTGGGCAAGAT
TCAGAATCTTGATGTGGAGCTCAGAAAAGAACCTCTCAAGGTGGACGCTGTCCATGATTTTGAAACGCTCGGTGCTGTGGAAGATGGTAATCAAGATGTTGCGATCGATG
AAGAAGAAGAGAAAGATTTTGCAACAAGTCTCCTAAGTTTTGATGGGAATCAAGATTGTACGAAGGAAGAACTTGTTCAAGAAGTTCAGTTGGCTGCTGACACTGAAGCC
AACGGAAAAGAAGCCTTTCCACGAACAGAGGAGTTGTTTAAGAAAGAAACTGATTCTGAGAGCATTTTGGAAATGAAAAAGAAATTACTATTGGAAAAACTCGATGCCAT
GTTGGTTCCTGGAGATGAAATTCATCTAGAGAAGGGAAACAATCCCCCTAGCTCAGGAGGGATTGTGGATGGTTGCAGCAAAACGATGCTTAGTGATGAGGAGAAGATTG
CTGATCAGCAAAATGATTCTGAAAACATGAATGTTCTCAGACGAAGTCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCATTAGTTGAACCG
GTTCATGTCTCCAAAATTGGAAATGGAGAAGGGATTGGTATTGTTTGTCCAACAAGGTCAATGCAGATGAAGGTGAACAAATCCCATGAACCTGATAGAGGGGTGAAAAA
GGCTAAAAGATCGAGGAGGAGGGCAAGGGAAGCTAATGTCCCTGAGATGCATTATAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAGAAAAATGCAG
AAGGAAACAAGATAGTGTATTCGAGGAAAGATATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGTAAGGAACTTTTGCCCGTT
GTGGCAAGGGAATACAGTAGCTTAACAATGAAGATAGGCTCCACCTCTGATCCTAGGCAGCCTTTAGTGAAGAGAGAAGAAGCCTCTTCAATTATAAGTGTTGTTGCTGC
AAGGGGAAGATGGAATCATATGAATATTTTTGCAAATGATAGTTCTTGTCTGTACAAATATACCGATTACATAAAATTTCTGTTCCTTCTTAGTGTCAGTGGAGATAGTG
ATGAGGATAAATATTACCACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTC
AGGTGGGAAGCTTCCCATATTCCAAATGTGACGGTGGCAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTTATTCCTGCAATTGCCAAGTG
CCCCGACCATTTACTGCCTTCAAAAGAGTGGGAGAATGCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTCTATCACACTCTGAAGAATTTATGCAGTCTGATTTCA
TTCTCCATGAAAAGATCGATTCTGTAATTCCGGACTTCGTTGCTCAGCCAATTGTCTTGCCTGCCTACAACATCAACTCGCATCAACCTGAGGAACCGAATAGCAGTACT
TCAGCAAAGGAAAATAGTTGCAACGATTATCCATCTCTATCAGCAATCTCAAAGATGAATTCGGTGTTTAGTGTTTCATCGTTGAGGAAGCGTATAAACTCATTAGAAAC
ACAGACAACACTGTCAAGGACTGATTGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGGAGATACGTGTGCCGCTTTCAGAAGTCTGCTTCGGA
AATGTGCCAGCTTGCGGGCTGAGAAGACCGAGCTTGACGACGAGGTGATAATGCTCAATATTCTTTCCACCATTTCCGGAAGGTACTTTGGACAGTTGGAAAATTGAAGC
TATGATGGTGAGATGATGATTGCTTCATTTTCTTGTTTCTCCCAAGTTAGTATCTTCTAGGAGGAGGATGTTCACTCCTGACTTTGAAACCTTGAGGAGTCTGACTGACA
ATGAAATCTGTAAATTAGTTTCTGTTTTGTGAAAAATCAAAGTGTGTTAAAGATCTTATTTCTTTGGATCATGTTATCAATCCATATTTGATGATTTAGTAGAAAGATTA
CCTTATTAATTGAGGGTTTTTTTTTTGGTGAGTTGAATTTGTAGATTGTTGTCTCTATTTGGCTG
Protein sequenceShow/hide protein sequence
MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKIQNLDVELRKE
PLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPGDEIHLE
KGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRARE
ANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIISVVAARGRWNHMNIF
ANDSSCLYKYTDYIKFLFLLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEW
ENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCL
WLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILSTISGRYFGQLEN