; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G016350 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G016350
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationCmo_Chr06:11399851..11404875
RNA-Seq ExpressionCmoCh06G016350
SyntenyCmoCh06G016350
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0095.78Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHPD---------EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDE
        MANGLSSGDGDGFSRKF ASE HAR PFHPD         EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDE
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHPD---------EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDE

Query:  TRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
        TRILDV+LGEESFKVDAVHDFEMIG+VEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  TRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEAMLVPGEEIHLEKDNCGK ML+DEEKI+GQQNDSENT+VLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI CPTRSMQINVS
Subjt:  KKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTS
        KSHEPER+GKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAI KELMPVVAREYSSLTSSNYPMKTGSTS
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPN
        NV+VAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFS LRQVLSCPEGLMQSDFIFHEKI+SVSPDSIDQPS+VLPAN+IDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPN

Query:  ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSEL+DEVIMLNILAT
Subjt:  ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0095.78Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHPD---------EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDE
        MANGLSSGDGDGFSRKF ASE HAR PFHPD         EMKFPLIVSNPSLQCEVRMNSSSSASPEE AETSVEKMVVCDWISASSENGGNMGSLVDE
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHPD---------EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDE

Query:  TRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
        TRILDV+LGEESFKVDAVHDFEMIG+VEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  TRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEAMLVPGEEIHLEKDNCGK ML+DEEKI+GQ+NDSENT+VLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI CPTRSMQINVS
Subjt:  KKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTS
        KSHEPER+GKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAI KELMPVVAREYSSLTSSNYPMKTGSTS
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPN
        NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFS LRQVLSCPEGLMQSDFIFHEKI+SVSPDSIDQPS+VLPAN+IDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPN

Query:  ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
Subjt:  ASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

XP_022932760.1 uncharacterized protein LOC111439213 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG
        MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG

Query:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
        EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
Subjt:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL

Query:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG
        EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG
Subjt:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG

Query:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
        KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
Subjt:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG

Query:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
        EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
Subjt:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR

Query:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS
        SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS
Subjt:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS

Query:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
        SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
Subjt:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS

Query:  EN
        EN
Subjt:  EN

XP_022972094.1 uncharacterized protein LOC111470733 [Cucurbita maxima]0.0e+0097.91Show/hide
Query:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEA
        MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWISASSENGGNMGSLVD+ RILDVELGEESFKVD+VHDFEMIGAVEDGNQEVAMDEVEA
Subjt:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEA

Query:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQ
        KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEA+LVPGEEI LEKDNCGKPMLIDEEKIAGQQ
Subjt:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQ

Query:  NDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKN
        NDSENTSVLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGN+NELDKKN
Subjt:  NDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKN

Query:  AEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
        AEG+KIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGS+SGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
Subjt:  AEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE

Query:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
        FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
Subjt:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED

Query:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQ
        AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPS+VLPAN+IDSQQPEEPNASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQ
Subjt:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQ

Query:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
        TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
Subjt:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN

XP_023540169.1 uncharacterized protein LOC111800622 [Cucurbita pepo subsp. pepo]0.0e+0097.44Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG
        MANGLSSGDGDGFSRKFSASE HARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWIS SSENGGNMGSLVDETRILDVELG
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG

Query:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
        EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTI VPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
Subjt:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL

Query:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG
        EAMLVPGEEIHLEKDNCGK ML+DEEKI+GQQNDSENT+VLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI CPTRSMQINVSKSHEPERVG
Subjt:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG

Query:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
        KKARRSRRR REAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
Subjt:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG

Query:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
        EEASSFIRDGCSESLDAEIEDMEGDNEITN EFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
Subjt:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR

Query:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS
        SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPS++LPAN+IDSQ PEEPNASTSSKENS
Subjt:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS

Query:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
         NNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
Subjt:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS

Query:  EN
        EN
Subjt:  EN

TrEMBL top hitse value%identityAlignment
A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948751.5e-26870.92Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +SP  P         D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+CD    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD

Query:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
        + + LDVELG+ES KVDAVHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI
        MKKKLLLE+++AMLVPG+EIHL++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETALVEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKK  +SRR+ARE K+SE+HWN+ NVNE+DK     +NAEG+KI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGSTS PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA++ D  QPEE N STS+KE S N+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGRYFGQSEN
        AEK+E+DDEVIMLNIL+TISGRYFGQSEN
Subjt:  AEKSELDDEVIMLNILATISGRYFGQSEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X11.3e-26470.78Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +SP  P         D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+CD    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD

Query:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
        + + LDVELG+ES KVDAVHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI
        MKKKLLLE+++AMLVPG+EIHL++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETALVEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKKA++SRR+ARE K+SE+HWN+ NVNE+DK     +NAEG+KI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGSTS PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA++ D  QPEE N STS+KE S N+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGR
        AEK+E+DDEVIMLNIL+TISGR
Subjt:  AEKSELDDEVIMLNILATISGR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X11.3e-26470.78Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +SP  P         D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+CD    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHP---------DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSL-VD

Query:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
        + + LDVELG+ES KVDAVHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  ETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI
        MKKKLLLE+++AMLVPG+EIHL++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETALVEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEAMLVPGEEIHLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKKA++SRR+ARE K+SE+HWN+ NVNE+DK     +NAEG+KI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK-----KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGSTS PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA++ D  QPEE N STS+KE S N+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGR
        AEK+E+DDEVIMLNIL+TISGR
Subjt:  AEKSELDDEVIMLNILATISGR

A0A6J1F307 uncharacterized protein LOC1114392130.0e+00100Show/hide
Query:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG
        MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG
Subjt:  MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELG

Query:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
        EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL
Subjt:  EESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEEL

Query:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG
        EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG
Subjt:  EAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVG

Query:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
        KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG
Subjt:  KKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKG

Query:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
        EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR
Subjt:  EEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDR

Query:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS
        SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS
Subjt:  SNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENS

Query:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
        SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS
Subjt:  SNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQS

Query:  EN
        EN
Subjt:  EN

A0A6J1IAH4 uncharacterized protein LOC1114707330.0e+0097.91Show/hide
Query:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEA
        MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWISASSENGGNMGSLVD+ RILDVELGEESFKVD+VHDFEMIGAVEDGNQEVAMDEVEA
Subjt:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEA

Query:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQ
        KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEA+LVPGEEI LEKDNCGKPMLIDEEKIAGQQ
Subjt:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQ

Query:  NDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKN
        NDSENTSVLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGN+NELDKKN
Subjt:  NDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKN

Query:  AEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
        AEG+KIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGS+SGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
Subjt:  AEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE

Query:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
        FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
Subjt:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED

Query:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQ
        AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPS+VLPAN+IDSQQPEEPNASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQ
Subjt:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQ

Query:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
        TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
Subjt:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 22.3e-1127.95Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      P+   P+ +W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF

Query:  IFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA
           +++DS    P S D+         +  +   +     ++ E+   +Y      P LS +S+MN    V+S+ + +++   +   +     WL+AL A
Subjt:  IFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA

Query:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
         ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

O42260 Gem-associated protein 23.2e-1328.74Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDS--VSP
        S PP    EYLRRV+ EA+  P+V +A++D    +K+++V +  +      P    PS  W+   +A FS +RQ L    G  +S     + +DS    P
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDS--VSP

Query:  DSIDQP------------SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLD
         + D+             S +  A N +SQ P                 P LS +S+M+     S L   +N  E +         WL+AL A ++ PL 
Subjt:  DSIDQP------------SIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLD

Query:  ADTCASFRSLLRKCASLRA-EKSELDDEVIMLNILATISGRYFGQSE
         +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  ADTCASFRSLLRKCASLRA-EKSELDDEVIMLNILATISGRYFGQSE

Q54KN2 Gem-associated protein 25.8e-1522.84Show/hide
Query:  NSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKK--ERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEG
        +  Q  AF V  E   +   P  G EYL+RV+W ++  P+V VA +D S  K     + Y  + P+I  C + LLP+  WE  FL DFS+ RQ L   + 
Subjt:  NSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKK--ERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEG

Query:  LMQS------------------------------------------------------------------------DFIFHEKIDSVSPDSIDQPSIVLP
           S                                                                        D+ ++E  +    +  ++      
Subjt:  LMQS------------------------------------------------------------------------DFIFHEKIDSVSPDSIDQPSIVLP

Query:  ANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSE
            + ++ EE     S+K+ +  N P++  + +++ V  V+ +   I  LE +   ++    WL+ L + ++ P+D DTC++ RS +R+ +  R++ + 
Subjt:  ANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSE

Query:  LDD-EVIMLNILATISGRYFGQSE
        L+D  +  +NIL TI  +YF Q E
Subjt:  LDD-EVIMLNILATISGRYFGQSE

Q9CQQ4 Gem-associated protein 22.3e-1127.95Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      P+   P+ +W+   +A FS +RQ +       +S  
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF

Query:  IFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA
           +++DS    P S D+         +  +   E     S++E+   +Y      P LS +S+MN    ++S+ + +++   +   +     W +AL A
Subjt:  IFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA

Query:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
         ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

Q9QZP1 Gem-associated protein 28.7e-1127.63Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANC---PQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQ
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +    +++ C   P+   P+ +W+   +  FS +RQ +       +
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANC---PQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQ

Query:  SDFIFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFA
        S     +++DS    P S D+         +  +   E     S+ E+   +Y      P LS +S+MN    ++S+ + +++   +   +     W +A
Subjt:  SDFIFHEKIDS--VSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNY------PSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFA

Query:  LSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
        L A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  LSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related7.0e-6434.75Show/hide
Query:  KEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSL-KIEVIDETALVEPVH
        + +F +  E  R +   E  L  ++K    E + +L              K  L+ E ++      S +   L +   ++ N + KIE++D TALV+ VH
Subjt:  KEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSL-KIEVIDETALVEPVH

Query:  VSKIGNGEEIDIICPTRSMQI----NVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAI
          K     + +   P +  +I    NV +    E  GK+ RR                                 +Y+RK +E++RF ++  Q  LW  +
Subjt:  VSKIGNGEEIDIICPTRSMQI----NVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAI

Query:  CKELMPVVAREYSSLT-SSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLV
           ++P V  EY SL    NY  K+  ++  R   E G E +    +G  +      ED   DN                     DD  YNSI RPAF V
Subjt:  CKELMPVVAREYSSLT-SSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLV

Query:  EGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNF-KKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEK
        +GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKE+SVYMP+IP I  CP+ LLP KEWED+ L DF  LRQ L+      + + I  + 
Subjt:  EGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNF-KKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEK

Query:  IDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCAS
        ++ +  +                      N    ++E+ S     ++ I  M+SV RVS L+KRI  +E ++ L  +DC W+ AL A+++TPLDADTCA 
Subjt:  IDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCAS

Query:  FRSLLRKCASLRAEKS-ELDDE--VIMLNILATISGRYFGQ
         R LLRKCAS+RAE S E+ DE  + M N+L TI+GRYFGQ
Subjt:  FRSLLRKCASLRAEKS-ELDDE--VIMLNILATISGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown7.8e-2369.86Show/hide
Query:  EDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIP
        E +ED+  YNSI RPAF V+GEP+F+SGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KE+SVYMP IP
Subjt:  EDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown2.4e-3226.68Show/hide
Query:  SPEENAETSVEKMVVCDWISASSE----NGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKK
        S  EN  T +E+  V + I  SSE     G  +G   D  +I DV +  + F V      E + + + G        VEA+  +  +    DG+    + 
Subjt:  SPEENAETSVEKMVVCDWISASSE----NGGNMGSLVDETRILDVELGEESFKVDAVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKK

Query:  EIVQEV----QFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPM--LIDEEKIAGQQNDSEN-----TSVL
        + +Q++    + ST  +    + ++  ++ +   +D + IL  +KK    +++ +         EK N G  +    D  KI    N S++       V 
Subjt:  EIVQEV----QFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKPM--LIDEEKIAGQQNDSEN-----TSVL

Query:  RQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGE-------EIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAE
         + +  +G S+ I+++D+TAL + V   K G            D   P +  ++ V K   P   G  +    R A   K+S+      N  E+   N +
Subjt:  RQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGE-------EIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDKKNAE

Query:  GSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFP
          +I+YSR  ME++R+ +++ Q +LW  +   L+P +  EY    S+  P +T       Q +   EE                                
Subjt:  GSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFP

Query:  KPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIPAIANCPQNLLPSKEWEDA
                +ED++D   YNSI RPAF V+GEP+F+SGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KE+SVYMP IP      + +         
Subjt:  KPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIPAIANCPQNLLPSKEWEDA

Query:  FLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQT
                        GL+    + H                   ++ I   Q    +    + +++ N        ++   +F            E ++
Subjt:  FLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQT

Query:  TLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRA
         L  +DC W+ AL A+VDTP DADT A  R+L+RKCASLRA
Subjt:  TLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATGGGTTAAGTTCCGGCGATGGTGATGGGTTCAGTCGAAAATTCTCTGCCTCTGAGGGCCACGCACGATCTCCTTTTCATCCGGACGAGATGAAGTTTCCTTT
GATCGTCTCGAATCCAAGCCTGCAGTGTGAAGTTAGGATGAACAGTTCGAGTTCTGCTTCTCCAGAAGAGAACGCAGAAACTTCTGTTGAAAAGATGGTCGTCTGCGATT
GGATTTCTGCGTCTTCTGAAAATGGAGGAAACATGGGAAGTCTGGTGGACGAGACTCGGATTCTTGATGTGGAGCTCGGAGAAGAATCTTTCAAGGTGGATGCTGTCCAT
GATTTTGAAATGATAGGTGCCGTGGAAGATGGGAATCAAGAAGTTGCGATGGATGAAGTAGAAGCGAAAGATTTTGTAACAATTAGTGTTCCGAGTTTTGATGGAAATCA
AGATTGTGCGAAGAAAGAAATTGTTCAAGAAGTTCAGTTTTCTACTGCTATGGAAGCCGACAGCAAAGAAGCCTTTGAGCGAACCGAGGAATTGTTGAGGAAAGAAGCTG
ATACTGAGAGCATTCTGGAGATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAGATTCATCTAGAAAAGGATAATTGCGGCAAACCG
ATGCTTATTGATGAGGAGAAGATTGCTGGTCAGCAAAATGATTCTGAGAACACGAGCGTTCTTAGACAAAGCCATTTATCTCTCGGAAATTCATTGAAGATTGAAGTAAT
AGACGAAACTGCATTAGTTGAACCGGTTCATGTTTCCAAAATTGGAAATGGAGAAGAAATTGACATAATTTGTCCAACAAGGTCAATGCAGATCAATGTGAGCAAATCCC
ATGAACCCGAAAGAGTGGGGAAAAAGGCTAGAAGATCGAGGAGGAGGGCGAGGGAAGCGAAGATCTCAGAGGTGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAG
AAAAATGCTGAAGGAAGTAAGATAGTGTACTCAAGGAAGGATATGGAAGCACTGAGGTTTGTAAATGTTTCAGAACAGAGTAGATTGTGGGAAGCTATATGCAAGGAACT
TATGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAAGCTCTAATTACCCAATGAAGACAGGCTCCACCTCTGGTCCTAGGCAGCATTTCGAGAAGGGAGAAGAAGCCT
CTTCATTTATAAGGGATGGCTGTTCAGAAAGCTTGGATGCTGAGATAGAGGACATGGAAGGTGATAATGAAATTACGAATTTTGAATTCCCGAAACCCTCTTGCGGTCTT
AGTGTAAGTGAAGATAGTGAAGATGATAGATATTACAACAGTATACAGAGACCTGCCTTTCTGGTGGAGGGAGAGCCCAATTTTGAGTCAGGACCTCCAGAAGATGGATT
GGAATATCTTAGACGTGTGAGGTGGGAAGCATCCCATATTCCAAATGTGGCGGTGGCTAAAGTTGATAGAAGTAACTTTAAGAAAGAGCGAAGTGTTTATATGCCAGTGA
TTCCTGCCATTGCCAATTGCCCCCAGAATTTACTGCCTTCAAAAGAGTGGGAGGATGCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGTTCTGTCGTGCCCCGAGGGA
CTTATGCAGTCTGATTTCATCTTCCATGAAAAGATCGATTCTGTGAGTCCAGACTCGATCGATCAGCCGAGCATTGTCTTGCCTGCCAATAATATCGACTCGCAGCAACC
CGAGGAACCAAATGCCAGCACTTCTTCAAAGGAAAACAGTTCCAACAACTATCCATCTTTATCAGCAATTTCAAAGATGAACTCGGTGTTTCGTGTTTCTTCGTTGAGGA
AACGTATAAACTCGTTAGAAACACAAACAACACTGTCGAGGACCGACTGTCTTTGGCTATTTGCGTTAAGTGCGGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCT
TCTTTCAGGAGTCTGCTTCGGAAATGTGCCAGTTTGCGGGCCGAGAAGTCCGAGCTCGACGACGAGGTGATAATGCTCAATATTCTAGCCACCATTTCAGGAAGATACTT
TGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CGGTGGCGCCCCTAAAGCCCTTGTGCTTTCTTCCAGCAATGGCGAATGGGTTAAGTTCCGGCGATGGTGATGGGTTCAGTCGAAAATTCTCTGCCTCTGAGGGCCACGCA
CGATCTCCTTTTCATCCGGACGAGATGAAGTTTCCTTTGATCGTCTCGAATCCAAGCCTGCAGTGTGAAGTTAGGATGAACAGTTCGAGTTCTGCTTCTCCAGAAGAGAA
CGCAGAAACTTCTGTTGAAAAGATGGTCGTCTGCGATTGGATTTCTGCGTCTTCTGAAAATGGAGGAAACATGGGAAGTCTGGTGGACGAGACTCGGATTCTTGATGTGG
AGCTCGGAGAAGAATCTTTCAAGGTGGATGCTGTCCATGATTTTGAAATGATAGGTGCCGTGGAAGATGGGAATCAAGAAGTTGCGATGGATGAAGTAGAAGCGAAAGAT
TTTGTAACAATTAGTGTTCCGAGTTTTGATGGAAATCAAGATTGTGCGAAGAAAGAAATTGTTCAAGAAGTTCAGTTTTCTACTGCTATGGAAGCCGACAGCAAAGAAGC
CTTTGAGCGAACCGAGGAATTGTTGAGGAAAGAAGCTGATACTGAGAGCATTCTGGAGATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAG
AAGAGATTCATCTAGAAAAGGATAATTGCGGCAAACCGATGCTTATTGATGAGGAGAAGATTGCTGGTCAGCAAAATGATTCTGAGAACACGAGCGTTCTTAGACAAAGC
CATTTATCTCTCGGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCATTAGTTGAACCGGTTCATGTTTCCAAAATTGGAAATGGAGAAGAAATTGACATAATTTG
TCCAACAAGGTCAATGCAGATCAATGTGAGCAAATCCCATGAACCCGAAAGAGTGGGGAAAAAGGCTAGAAGATCGAGGAGGAGGGCGAGGGAAGCGAAGATCTCAGAGG
TGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAGAAAAATGCTGAAGGAAGTAAGATAGTGTACTCAAGGAAGGATATGGAAGCACTGAGGTTTGTAAATGTTTCA
GAACAGAGTAGATTGTGGGAAGCTATATGCAAGGAACTTATGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAAGCTCTAATTACCCAATGAAGACAGGCTCCACCTC
TGGTCCTAGGCAGCATTTCGAGAAGGGAGAAGAAGCCTCTTCATTTATAAGGGATGGCTGTTCAGAAAGCTTGGATGCTGAGATAGAGGACATGGAAGGTGATAATGAAA
TTACGAATTTTGAATTCCCGAAACCCTCTTGCGGTCTTAGTGTAAGTGAAGATAGTGAAGATGATAGATATTACAACAGTATACAGAGACCTGCCTTTCTGGTGGAGGGA
GAGCCCAATTTTGAGTCAGGACCTCCAGAAGATGGATTGGAATATCTTAGACGTGTGAGGTGGGAAGCATCCCATATTCCAAATGTGGCGGTGGCTAAAGTTGATAGAAG
TAACTTTAAGAAAGAGCGAAGTGTTTATATGCCAGTGATTCCTGCCATTGCCAATTGCCCCCAGAATTTACTGCCTTCAAAAGAGTGGGAGGATGCATTTCTTGCTGATT
TTTCTAAGCTGCGTCAGGTTCTGTCGTGCCCCGAGGGACTTATGCAGTCTGATTTCATCTTCCATGAAAAGATCGATTCTGTGAGTCCAGACTCGATCGATCAGCCGAGC
ATTGTCTTGCCTGCCAATAATATCGACTCGCAGCAACCCGAGGAACCAAATGCCAGCACTTCTTCAAAGGAAAACAGTTCCAACAACTATCCATCTTTATCAGCAATTTC
AAAGATGAACTCGGTGTTTCGTGTTTCTTCGTTGAGGAAACGTATAAACTCGTTAGAAACACAAACAACACTGTCGAGGACCGACTGTCTTTGGCTATTTGCGTTAAGTG
CGGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCTTCTTTCAGGAGTCTGCTTCGGAAATGTGCCAGTTTGCGGGCCGAGAAGTCCGAGCTCGACGACGAGGTGATA
ATGCTCAATATTCTAGCCACCATTTCAGGAAGATACTTTGGACAGTCGGAAAATTGAAGCTATGGTGAGGTTAATAGTATCGTAGATGACTCTGATTGTTCCTTTTGTTT
TGTAAGAGATTCATACATTCATATATACATACATATATTTGATAATTTATAATTTAGTGGAACAATAAACTTATAAATTGATGGTTTTTAAT
Protein sequenceShow/hide protein sequence
MANGLSSGDGDGFSRKFSASEGHARSPFHPDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVCDWISASSENGGNMGSLVDETRILDVELGEESFKVDAVH
DFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEAMLVPGEEIHLEKDNCGKP
MLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETALVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNVNELDK
KNAEGSKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSTSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGL
SVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEG
LMQSDFIFHEKIDSVSPDSIDQPSIVLPANNIDSQQPEEPNASTSSKENSSNNYPSLSAISKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCA
SFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN