; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001812 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001812
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationChr11:636231..644065
RNA-Seq ExpressionHG10001812
SyntenyHG10001812
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044617.1 mis18-binding protein 1-like isoform X1 [Cucumis melo var. makuwa]0.0e+0084.01Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGR
        VIMLNILSTISGR
Subjt:  VIMLNILSTISGR

KGN53109.2 hypothetical protein Csa_015143 [Cucumis sativus]0.0e+0083.5Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+ DGFNPK   SE PQSP R VDSA +ISA    FPLIV+NQ  D EV+NS TSASA  +PETSV KMV+CDSAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLD+EL KEPLKVDAVHDF TL   EDG QDVA+DE + KDFA S+LS DGNQDC KEELV+E QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML  EEKIADQQ NDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNGEGIGIVC
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC

Query:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS
        PTRSMQMKVNKSHEPD+G KKAK+SRR+ARE  + EMH+N+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYS
Subjt:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS

Query:  SLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLR
        SLT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+ GD+EITN VISEPSCSL  S DSD+DKYYHSIQRPAF VEGEPNFDSGPPEDGLEYLR
Subjt:  SLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLR

Query:  RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYN
        RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA +
Subjt:  RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYN

Query:  INSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD
         +SHQ +E + STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D+
Subjt:  INSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD

Query:  EVIMLNILSTISGRYFGQLEN
        EVIMLNILSTISGRYF Q EN
Subjt:  EVIMLNILSTISGRYFGQLEN

TYK16972.1 mis18-binding protein 1-like isoform X1 [Cucumis melo var. makuwa]0.0e+0084.01Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGR
        VIMLNILSTISGR
Subjt:  VIMLNILSTISGR

XP_008454478.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103494875 [Cucumis melo]0.0e+0083.75Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KK  +SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGRYFGQLEN
        VIMLNILSTISGRYFGQ EN
Subjt:  VIMLNILSTISGRYFGQLEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]0.0e+0087.29Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+GDGFNPK S SEN QS C+P+DSAFKISA DK FPLIV+NQ QD EV+NSA SAS   NPETSVHK     SAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVELRKEPLKVDAVHDFETL AVEDG QDVAID E EKDFA S+LSFDGN DC+KEELVQEVQLAAD     KEAF RTEEL KKETD ESILE+KK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLE+LDAMLVPGD+IHLEKGNNPPSS G VD CSKT+L DEEKIAD+QNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
         RSMQMKVNKSHEPDRG KKAKRSRR+AREA V EM++NLGNVNELDKVNGRQK AEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLP VAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LT-----MKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGL
        LT     MKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNE TN VI EPSCS SVS D DEDKYY SIQRPAFLVEGEPNFDSGPPEDGL
Subjt:  LT-----MKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGL

Query:  EYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLP
        EYLRRVRWEASHIPNVT+AKVDRSNFKKEQSVYMPVIP IAKCPDHLLPSKEWENAFLADFS LR+ALSHSEEF QSDFILHEKIDS IPD +AQP VLP
Subjt:  EYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLP

Query:  AYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE
        AYNI+SHQ EE N STSAKENSCNDYPSLSAISKMNSVF VSSL+KRINSLETQTTLS+TDCLWLFALSAAVDTPLD DTCAAFRSLLRKCASLRA+KTE
Subjt:  AYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE

Query:  LDDEVIMLNILSTISGRYFGQLEN
        LDDEVIMLNILSTISGRYFGQ EN
Subjt:  LDDEVIMLNILSTISGRYFGQLEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein0.0e+0083.5Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEISSD+ DGFNPK   SE PQSP R VDSA +ISA    FPLIV+NQ  D EV+NS TSASA  +PETSV KMV+CDSAC SSENG N GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLD+EL KEPLKVDAVHDF TL   EDG QDVA+DE + KDFA S+LS DGNQDC KEELV+E QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML  EEKIADQQ NDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNGEGIGIVC
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQ-NDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVC

Query:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS
        PTRSMQMKVNKSHEPD+G KKAK+SRR+ARE  + EMH+N+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYS
Subjt:  PTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYS

Query:  SLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLR
        SLT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+ GD+EITN VISEPSCSL  S DSD+DKYYHSIQRPAF VEGEPNFDSGPPEDGLEYLR
Subjt:  SLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLR

Query:  RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYN
        RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA +
Subjt:  RVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYN

Query:  INSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD
         +SHQ +E + STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+D+
Subjt:  INSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD

Query:  EVIMLNILSTISGRYFGQLEN
        EVIMLNILSTISGRYF Q EN
Subjt:  EVIMLNILSTISGRYFGQLEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948750.0e+0083.75Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KK  +SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGRYFGQLEN
        VIMLNILSTISGRYFGQ EN
Subjt:  VIMLNILSTISGRYFGQLEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X10.0e+0084.01Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGR
        VIMLNILSTISGR
Subjt:  VIMLNILSTISGR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X10.0e+0084.01Show/hide
Query:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI
        MADEI+SD+ DGFNPK   SENPQSPCRPVDSA  ISA    FPLIV+N+  DCEV+N+ TSAS   NPE+SV KMV+CDSAC SSENG + GSLVVGKI
Subjt:  MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKI

Query:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK
        QNLDVEL KE LKVDAVHDFETL   ED  Q+VA+DE + KDFA S+LSFDGNQDC KEELVQE QLAAD     KEAF RTE+L KKETDSESILEMKK
Subjt:  QNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKK

Query:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP
        KLLLEK+DAMLVPGDEIHL++G+NPPSSGGIVDGC KTML DEEKIADQQNDSE MNVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+GIGIVCP
Subjt:  KLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCP

Query:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS
        TRSMQM+V KSHEPD+G KKAK+SRR+ARE  + EMH+N+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKELLPVVAREYSS
Subjt:  TRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSS

Query:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
        LT+K GSTSDPRQPLVKREEASSIIREGCSESLDGEIED+EGDNEITN VISEPSCSL  S DSD+DKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
Subjt:  LTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI
        VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CP+HLLPSKEWENAFLADFSKLRQALSHS EE M+SDFILHEKID ++P+ +AQP VLPA + 
Subjt:  VRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHS-EEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNI

Query:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE
        + HQPEE N STSAKE SCNDYPSLSAISKMN +F VSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTE+DDE
Subjt:  NSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE

Query:  VIMLNILSTISGR
        VIMLNILSTISGR
Subjt:  VIMLNILSTISGR

A0A6J1F307 uncharacterized protein LOC1114392135.4e-28574.45Show/hide
Query:  MADEISSDHGDGFNPKLSLSE-NPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEV-LNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVG
        MA+ +SS  GDGF+ K S SE + +SP  P          + KFPLIV+N    CEV +NS++SAS   N ETSV KMVVCD   ASSENG N GSL V 
Subjt:  MADEISSDHGDGFNPKLSLSE-NPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEV-LNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVG

Query:  KIQNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFAT-SLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILE
        + + LDVEL +E  KVDAVHDFE +GAVEDGNQ+VA+DE E KDF T S+ SFDGNQDC K+E+VQEVQ +   EA+ KEAF RTEEL +KE D+ESILE
Subjt:  KIQNLDVELRKEPLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFAT-SLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILE

Query:  MKKKLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGI
        MKKKLLLE+L+AMLVPG+EIHLEK           D C K ML DEEKIA QQNDSEN +VLR+SHLSL NSLKIEVIDETALVEPVHVSKIGNGE I I
Subjt:  MKKKLLLEKLDAMLVPGDEIHLEKGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGI

Query:  VCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVARE
        +CPTRSMQ+ V+KSHEP+R  KKA+RSRRRAREA + E+H+NLGNVNELDK     KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  VCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVARE

Query:  YSSLT-----MKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPE
        YSSLT     MK GSTS PRQ   K EEASS IR+GCSESLD EIED+EGDNEITN    +PSC LSVS DS++D+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLT-----MKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPE

Query:  DGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQP-
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP +LLPSKEWE+AFLADFSKLRQ LS  E  MQSDFI HEKIDSV PD + QP 
Subjt:  DGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQP-

Query:  IVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRA
        IVLPA NI+S QPEEPN+STS+KENS N+YPSLSAISKMNSVF VSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLRA
Subjt:  IVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRA

Query:  EKTELDDEVIMLNILSTISGRYFGQLEN
        EK+ELDDEVIMLNIL+TISGRYFGQ EN
Subjt:  EKTELDDEVIMLNILSTISGRYFGQLEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 27.3e-1328.8Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV
           +++DS V             + +      +     +  E+   DY      P LS +S+MN   +V+S+ + +++   +   +     WL+AL A +
Subjt:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV

Query:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

O42260 Gem-associated protein 26.6e-1429.32Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQAL-------------------SHSEE
        S PP    EYLRRV+ EA+  P+V +A++D    +K+Q+V +  +      PD   PS  W+   +A FS +RQ+L                   S  +E
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQAL-------------------SHSEE

Query:  FMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD
             F L E++ S   D  A      A N  S  P                 P LS +S+M+     S L   +N  E +         WL+AL A ++
Subjt:  FMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD

Query:  TPLDGDTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILSTISGRYFGQ
         PL  +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q
Subjt:  TPLDGDTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILSTISGRYFGQ

Q54KN2 Gem-associated protein 26.4e-1725.23Show/hide
Query:  QRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQ
        Q  AF V  E   D   P  G EYL+RV+W ++  P+V VA +D S  K     + Y  + P+I KC   LLP+  WE  FL DFS+ RQ L + +    
Subjt:  QRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQ

Query:  S---------------------------------------------------DFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSST---------
        S                                                   D  + +  D+   D   +      YN N  + EE              
Subjt:  S---------------------------------------------------DFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSST---------

Query:  -------------SAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD
                     S K+ +  + P++  + +++ V +V+ +   I  LE +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T L+D
Subjt:  -------------SAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDD

Query:  -EVIMLNILSTISGRYFGQLE
          +  +NIL TI  +YF QLE
Subjt:  -EVIMLNILSTISGRYFGQLE

Q9CQQ4 Gem-associated protein 21.1e-1329.13Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +A FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDSVIPDFVAQPIV-----LPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFAL
           +++DS     VA P          + +      E  +  S +E+   DY      P LS +S+MN   +++S+ + +++   +   +     W +AL
Subjt:  ILHEKIDSVIPDFVAQPIV-----LPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFAL

Query:  SAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
         A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  SAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

Q9QZP1 Gem-associated protein 29.5e-1328.4Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K++QSV +  +      P+   P+ +W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDF

Query:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV
           +++DS V             + +      E  +  S  E+   DY      P LS +S+MN   +++S+ + +++   +   +     W +AL A +
Subjt:  ILHEKIDS-VIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDY------PSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAV

Query:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q
Subjt:  DTPLDGDTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILSTISGRYFGQ

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related1.5e-6631.96Show/hide
Query:  ATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPG---DEIHLEKGNNPPSSGGIVDGCSKTML
        A + +S DG +   + +  +++    D +A+  +    T E      ++  + E+K+    +  +   V G   + ++ E+      +  +++   + +L
Subjt:  ATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPG---DEIHLEKGNNPPSSGGIVDGCSKTML

Query:  SDEEKIADQQNDSENMNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYN
        ++ E +      S +++ L +   ++ N + KIE++D TALV+ VH                                    KR      E + P     
Subjt:  SDEEKIADQQNDSENMNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRAREANVPEMHYN

Query:  LGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIED
        +G+   + + +G + N +  + +Y+RK +E++RF ++  Q+ LW  +   +LP V  EY SL              VK  ++S       S  + G  E 
Subjt:  LGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIED

Query:  VEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIAKC
           +N  T     +P      + D+D+   Y+SI RPAF V+GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKEQSVYMP+IP I KC
Subjt:  VEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKEQSVYMPVIPAIAKC

Query:  PDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSS
        P++LLP KEWE++ L DF  LRQ L+ S    +         D +I     + +++  +N + H  E+ +      +           I  M+SV  VS 
Subjt:  PDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSS

Query:  LRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILSTISGRYFGQL
        L+KRI  +E ++ L  +DC W+ AL A+++TPLD DTCA  R LLRKCAS+RAE + E+ DE  + M N+L TI+GRYFGQ+
Subjt:  LRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILSTISGRYFGQL

AT2G42510.1 FUNCTIONS IN: molecular_function unknown8.8e-2230.84Show/hide
Query:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR
        DG +    SD  KI    N S++       V  + +  +  S+ I+++D+TAL + V   K G       +   T     + +K    ++ + K   S  
Subjt:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR

Query:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIR
          R A+       + +     ++NG+Q      +I+YSR  ME++R                 LLP +  EY  L                 +   SI+ 
Subjt:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIR

Query:  EGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQ
        +                      V+ E       + D+D+   Y+SI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KEQ
Subjt:  EGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQ

Query:  SVYMPVIP
        SVYMP IP
Subjt:  SVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown4.8e-3630.08Show/hide
Query:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR
        DG +    SD  KI    N S++       V  + +  +  S+ I+++D+TAL + V   K G       +   T     + +K    ++ + K   S  
Subjt:  DGCSKTMLSDEEKIADQQNDSENM-----NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEG-IGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRR

Query:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIR
          R A+       + +     ++NG+Q      +I+YSR  ME++R+ ++A Q++LW  +   LLP +  EY      I +     Q  V +EE +    
Subjt:  RAREANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIR

Query:  EGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQ
                                            D+D+   Y+SI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KEQ
Subjt:  EGCSESLDGEIEDVEGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KEQ

Query:  SVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLS
        SVYMP IP          P+       L  F    + L+HS               S IP F  Q   L  + +         S T  K    + +    
Subjt:  SVYMPVIPAIAKCPDHLLPSKEWENAFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLS

Query:  AISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILS
                             E ++ L  +DC W+ AL A+VDTP D DT A  R+L+RKCASLRA    L+  V+ +N LS
Subjt:  AISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATGAGATAAGTTCTGACCATGGCGATGGGTTTAATCCAAAATTATCACTATCTGAGAACCCCCAATCTCCTTGTCGACCGGTTGATTCTGCCTTTAAGATCTC
TGCCCACGACAAGAAGTTCCCTTTGATCGTCACGAATCAAAAGCAGGACTGTGAAGTCCTAAACAGTGCGACTTCCGCTTCTGCCCACGTGAACCCAGAAACTTCTGTCC
ACAAGATGGTCGTTTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGCAAATACGGGAAGTCTGGTGGTGGGCAAGATTCAGAATCTTGATGTGGAGCTCAGAAAAGAA
CCTCTCAAGGTGGACGCTGTCCATGATTTTGAAACGCTCGGTGCTGTGGAAGATGGTAATCAAGATGTTGCGATCGATGAAGAAGAAGAGAAAGATTTTGCAACAAGTCT
CCTAAGTTTTGATGGGAATCAAGATTGTACGAAGGAAGAACTTGTTCAAGAAGTTCAGTTGGCTGCTGACACTGAAGCCAACGGAAAAGAAGCCTTTCCACGAACAGAGG
AGTTGTTTAAGAAAGAAACTGATTCTGAGAGCATTTTGGAAATGAAAAAGAAATTACTATTGGAAAAACTCGATGCCATGTTGGTTCCTGGAGATGAAATTCATCTAGAG
AAGGGAAACAATCCCCCTAGCTCAGGAGGGATTGTGGATGGTTGCAGCAAAACGATGCTTAGTGATGAGGAGAAGATTGCTGATCAGCAAAATGATTCTGAAAACATGAA
TGTTCTCAGACGAAGTCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCATTAGTTGAACCGGTTCATGTCTCCAAAATTGGAAATGGAGAAG
GGATTGGTATTGTTTGTCCAACAAGGTCAATGCAGATGAAGGTGAACAAATCCCATGAACCTGATAGAGGGGTGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAA
GCTAATGTCCCTGAGATGCATTATAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAGAAAAATGCAGAAGGAAACAAGATAGTGTATTCGAGGAAAGA
TATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGTAAGGAACTTTTGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAATGA
AGATAGGCTCCACCTCTGATCCTAGGCAGCCTTTAGTGAAGAGAGAAGAAGCCTCTTCAATTATAAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAGAGGACGTG
GAAGGTGATAATGAAATTACAAACGTTGTAATTTCAGAACCCTCTTGCAGTCTTAGTGTCAGTGGAGATAGTGATGAGGATAAATATTACCACAGTATTCAGAGACCTGC
CTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATTCCAAATGTGACGGTGG
CAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTTATTCCTGCAATTGCCAAGTGCCCCGACCATTTACTGCCTTCAAAAGAGTGGGAGAAT
GCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTCTATCACACTCTGAAGAATTTATGCAGTCTGATTTCATTCTCCATGAAAAGATCGATTCTGTAATTCCGGACTT
CGTTGCTCAGCCAATTGTCTTGCCTGCCTACAACATCAACTCGCATCAACCTGAGGAACCGAATAGCAGTACTTCAGCAAAGGAAAATAGTTGCAACGATTATCCATCTC
TATCAGCAATCTCAAAGATGAATTCGGTGTTTAGTGTTTCATCGTTGAGGAAGCGTATAAACTCATTAGAAACACAGACAACACTGTCAAGGACTGATTGTCTTTGGCTG
TTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGGAGATACGTGTGCCGCTTTCAGAAGTCTGCTTCGGAAATGTGCCAGCTTGCGGGCTGAGAAGACCGAGCTTGA
CGACGAGGTGATAATGCTCAATATTCTTTCCACCATTTCCGGAAGGTACTTTGGACAGTTGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGATGAGATAAGTTCTGACCATGGCGATGGGTTTAATCCAAAATTATCACTATCTGAGAACCCCCAATCTCCTTGTCGACCGGTTGATTCTGCCTTTAAGATCTC
TGCCCACGACAAGAAGTTCCCTTTGATCGTCACGAATCAAAAGCAGGACTGTGAAGTCCTAAACAGTGCGACTTCCGCTTCTGCCCACGTGAACCCAGAAACTTCTGTCC
ACAAGATGGTCGTTTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGCAAATACGGGAAGTCTGGTGGTGGGCAAGATTCAGAATCTTGATGTGGAGCTCAGAAAAGAA
CCTCTCAAGGTGGACGCTGTCCATGATTTTGAAACGCTCGGTGCTGTGGAAGATGGTAATCAAGATGTTGCGATCGATGAAGAAGAAGAGAAAGATTTTGCAACAAGTCT
CCTAAGTTTTGATGGGAATCAAGATTGTACGAAGGAAGAACTTGTTCAAGAAGTTCAGTTGGCTGCTGACACTGAAGCCAACGGAAAAGAAGCCTTTCCACGAACAGAGG
AGTTGTTTAAGAAAGAAACTGATTCTGAGAGCATTTTGGAAATGAAAAAGAAATTACTATTGGAAAAACTCGATGCCATGTTGGTTCCTGGAGATGAAATTCATCTAGAG
AAGGGAAACAATCCCCCTAGCTCAGGAGGGATTGTGGATGGTTGCAGCAAAACGATGCTTAGTGATGAGGAGAAGATTGCTGATCAGCAAAATGATTCTGAAAACATGAA
TGTTCTCAGACGAAGTCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCATTAGTTGAACCGGTTCATGTCTCCAAAATTGGAAATGGAGAAG
GGATTGGTATTGTTTGTCCAACAAGGTCAATGCAGATGAAGGTGAACAAATCCCATGAACCTGATAGAGGGGTGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAA
GCTAATGTCCCTGAGATGCATTATAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAGAAAAATGCAGAAGGAAACAAGATAGTGTATTCGAGGAAAGA
TATGGAAGCACTGAGGTTTGTGAATGTTGCAGAACAGAGGAGATTGTGGAAAGCTATATGTAAGGAACTTTTGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAATGA
AGATAGGCTCCACCTCTGATCCTAGGCAGCCTTTAGTGAAGAGAGAAGAAGCCTCTTCAATTATAAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAGAGGACGTG
GAAGGTGATAATGAAATTACAAACGTTGTAATTTCAGAACCCTCTTGCAGTCTTAGTGTCAGTGGAGATAGTGATGAGGATAAATATTACCACAGTATTCAGAGACCTGC
CTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATTCCAAATGTGACGGTGG
CAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTTATTCCTGCAATTGCCAAGTGCCCCGACCATTTACTGCCTTCAAAAGAGTGGGAGAAT
GCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTCTATCACACTCTGAAGAATTTATGCAGTCTGATTTCATTCTCCATGAAAAGATCGATTCTGTAATTCCGGACTT
CGTTGCTCAGCCAATTGTCTTGCCTGCCTACAACATCAACTCGCATCAACCTGAGGAACCGAATAGCAGTACTTCAGCAAAGGAAAATAGTTGCAACGATTATCCATCTC
TATCAGCAATCTCAAAGATGAATTCGGTGTTTAGTGTTTCATCGTTGAGGAAGCGTATAAACTCATTAGAAACACAGACAACACTGTCAAGGACTGATTGTCTTTGGCTG
TTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGGAGATACGTGTGCCGCTTTCAGAAGTCTGCTTCGGAAATGTGCCAGCTTGCGGGCTGAGAAGACCGAGCTTGA
CGACGAGGTGATAATGCTCAATATTCTTTCCACCATTTCCGGAAGGTACTTTGGACAGTTGGAAAATTGA
Protein sequenceShow/hide protein sequence
MADEISSDHGDGFNPKLSLSENPQSPCRPVDSAFKISAHDKKFPLIVTNQKQDCEVLNSATSASAHVNPETSVHKMVVCDSACASSENGANTGSLVVGKIQNLDVELRKE
PLKVDAVHDFETLGAVEDGNQDVAIDEEEEKDFATSLLSFDGNQDCTKEELVQEVQLAADTEANGKEAFPRTEELFKKETDSESILEMKKKLLLEKLDAMLVPGDEIHLE
KGNNPPSSGGIVDGCSKTMLSDEEKIADQQNDSENMNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEGIGIVCPTRSMQMKVNKSHEPDRGVKKAKRSRRRARE
ANVPEMHYNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELLPVVAREYSSLTMKIGSTSDPRQPLVKREEASSIIREGCSESLDGEIEDV
EGDNEITNVVISEPSCSLSVSGDSDEDKYYHSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPDHLLPSKEWEN
AFLADFSKLRQALSHSEEFMQSDFILHEKIDSVIPDFVAQPIVLPAYNINSHQPEEPNSSTSAKENSCNDYPSLSAISKMNSVFSVSSLRKRINSLETQTTLSRTDCLWL
FALSAAVDTPLDGDTCAAFRSLLRKCASLRAEKTELDDEVIMLNILSTISGRYFGQLEN