; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh06G016360 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh06G016360
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationCma_Chr06:10185996..10191016
RNA-Seq ExpressionCmaCh06G016360
SyntenyCmaCh06G016360
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0095.36Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD
        MANGLSSGDGDGFSRKF ASEDHAR  FHPDDSVAKIS +EMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWISASSENGGNMGSLVD+
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD

Query:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
         RILDV+LGEESFKVD+VHDFEMIG+VEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEA+LVPGEEI LEKDNCGK ML+DEEKI+GQQNDSENT+VLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDI CPTRSMQINVS
Subjt:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS
        KSHEPER+GKKARRSRRRAREAKISEVHWNLGN+NELDKKNAEG+KIVYSRKDMEALRFVNVSEQSRLWEAI KELMPVVAREYSSLTSSNYPMKTGS+S
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN
        NV+VAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFS LRQVLSCPEGLMQSDFIFHEKI+SVSPDSIDQPSVVLPANDIDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN

Query:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSEL+DEVIMLNILAT
Subjt:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0095.36Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD
        MANGLSSGDGDGFSRKF ASEDHAR  FHPDDSVAKIS +EMKFPLIVSNPSLQCEVRMNSSSSASPEE AETSVEKMVV DWISASSENGGNMGSLVD+
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD

Query:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
         RILDV+LGEESFKVD+VHDFEMIG+VEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEA+LVPGEEI LEKDNCGK ML+DEEKI+GQ+NDSENT+VLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDI CPTRSMQINVS
Subjt:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS
        KSHEPER+GKKARRSRRRAREAKISEVHWNLGN+NELDKKNAEG+KIVYSRKDMEALRFVNVSEQSRLWEAI KELMPVVAREYSSLTSSNYPMKTGS+S
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN
        NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFS LRQVLSCPEGLMQSDFIFHEKI+SVSPDSIDQPSVVLPANDIDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN

Query:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
Subjt:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

XP_022932760.1 uncharacterized protein LOC111439213 [Cucurbita moschata]0.0e+0096.48Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD
        MANGLSSGDGDGFSRKFSASE HARS FHP         DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWISASSENGGNMGSLVD+
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD

Query:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
         RILDVELGEESFKVD+VHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEA+LVPGEEI LEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDIICPTRSMQINVS
Subjt:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS
        KSHEPERVGKKARRSRRRAREAKISEVHWNLGN+NELDKKNAEG+KIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGS+S
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN
        NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPS+VLPAN+IDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN

Query:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
Subjt:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

XP_022972094.1 uncharacterized protein LOC111470733 [Cucurbita maxima]0.0e+00100Show/hide
Query:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA
        MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA
Subjt:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA

Query:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ
        KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ
Subjt:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ

Query:  NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN
        NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN
Subjt:  NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN

Query:  AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
        AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
Subjt:  AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE

Query:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
        FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
Subjt:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED

Query:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ
        AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ
Subjt:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ

Query:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
        TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
Subjt:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN

XP_023540169.1 uncharacterized protein LOC111800622 [Cucurbita pepo subsp. pepo]0.0e+0094.94Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD
        MANGLSSGDGDGFSRKFSASEDHARS FHP         DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWIS SSENGGNMGSLVD+
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD

Query:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
         RILDVELGEESFKVD+VHDFEMIGAVEDGNQEVAMDEVEAKDFVTI VPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEA+LVPGEEI LEKDNCGK ML+DEEKI+GQQNDSENT+VLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDI CPTRSMQINVS
Subjt:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS
        KSHEPERVGKKARRSRRR REAKISEVHWNLGN+NELDKKNAEG+KIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGS+S
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITN EFPKPSC LSVSEDSEDD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN
        NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSV+LPANDIDSQ PEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN

Query:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
Subjt:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948754.5e-26870.78Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +S   P DS   IS D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+ D    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD

Query:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
          + LDVELG+ES KVD+VHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI
        MKKKLLLE+++A+LVPG+EI L++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETA+VEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKK  +SRR+ARE K+SE+HWN+ N+NE+DK     +NAEGNKI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGS+S PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA+D D  QPEE N STS+KE SCN+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGRYFGQSEN
        AEK+E+DDEVIMLNIL+TISGRYFGQSEN
Subjt:  AEKSELDDEVIMLNILATISGRYFGQSEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X13.9e-26470.64Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +S   P DS   IS D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+ D    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD

Query:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
          + LDVELG+ES KVD+VHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI
        MKKKLLLE+++A+LVPG+EI L++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETA+VEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKKA++SRR+ARE K+SE+HWN+ N+NE+DK     +NAEGNKI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGS+S PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA+D D  QPEE N STS+KE SCN+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGR
        AEK+E+DDEVIMLNIL+TISGR
Subjt:  AEKSELDDEVIMLNILATISGR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X13.9e-26470.64Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD
        MA+ ++S   DGF+ KF +SE + +S   P DS   IS D   FPLIVSN +L CEV +N+ +SASP+EN E+SV+KMV+ D    SSENGG+MGSL V 
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSL-VD

Query:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE
          + LDVELG+ES KVD+VHDFE +   ED  QEVA+DEV+ KDF   SV SFDGNQDCAK+E+VQE Q      A  KEAF RTE+LL+KE D+ESILE
Subjt:  DARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILE

Query:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI
        MKKKLLLE+++A+LVPG+EI L++           D C K ML+DEEKIA QQNDSE  +VLR+SHLSL NSLKIEVIDETA+VEPVHVS+IGNG+ I I
Subjt:  MKKKLLLEELEALLVPGEEILLEK-----------DNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDI

Query:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE
        +CPTRSMQ+ V KSHEP++ GKKA++SRR+ARE K+SE+HWN+ N+NE+DK     +NAEGNKI+YSRKDMEALRFVNV+EQ RLW+AICKEL+PVVARE
Subjt:  ICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDK-----KNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVARE

Query:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE
        YSSLT     +KTGS+S PRQ   K EEASS IR+GCSESLD EIEDMEGDNEITNF   +PSC  S+S+DS+DD+YY+SIQRPAFLVEGEPNF+SGPPE
Subjt:  YSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPE

Query:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP
        DGLEYLRRVRWEASHIPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWE+AFLADFSKLRQ LS   E  M+SDFI HEKID + P+ I QP
Subjt:  DGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLS-CPEGLMQSDFIFHEKIDSVSPDSIDQP

Query:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR
        S VLPA+D D  QPEE N STS+KE SCN+YPSLSAISKMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVDTPLD DTCA+FRSLLRKCASLR
Subjt:  SVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLR

Query:  AEKSELDDEVIMLNILATISGR
        AEK+E+DDEVIMLNIL+TISGR
Subjt:  AEKSELDDEVIMLNILATISGR

A0A6J1F307 uncharacterized protein LOC1114392130.0e+0096.48Show/hide
Query:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD
        MANGLSSGDGDGFSRKFSASE HARS FHP         DEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVV DWISASSENGGNMGSLVD+
Subjt:  MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDD

Query:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
         RILDVELGEESFKVD+VHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM
Subjt:  ARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEM

Query:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS
        KKKLLLEELEA+LVPGEEI LEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETA+VEPVHVSKIGNGEEIDIICPTRSMQINVS
Subjt:  KKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVS

Query:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS
        KSHEPERVGKKARRSRRRAREAKISEVHWNLGN+NELDKKNAEG+KIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGS+S
Subjt:  KSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSS

Query:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
        GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP
Subjt:  GPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIP

Query:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN
        NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPS+VLPAN+IDSQQPEEPN
Subjt:  NVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPN

Query:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
        ASTSSKENS NNYPSLSAISKMN VFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT
Subjt:  ASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILAT

Query:  ISGRYFGQSEN
        ISGRYFGQSEN
Subjt:  ISGRYFGQSEN

A0A6J1IAH4 uncharacterized protein LOC1114707330.0e+00100Show/hide
Query:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA
        MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA
Subjt:  MKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEA

Query:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ
        KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ
Subjt:  KDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQ

Query:  NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN
        NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN
Subjt:  NDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKN

Query:  AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
        AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE
Subjt:  AEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFE

Query:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
        FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED
Subjt:  FPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWED

Query:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ
        AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ
Subjt:  AFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQ

Query:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
        TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN
Subjt:  TTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 21.4e-1128.57Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      P+   P+ +W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF

Query:  IFHEKIDS--VSPDSIDQPS----VVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAV
           +++DS    P S D+       +      D       N S           P LS +S+MN    V+S+ + +++   +   +     WL+AL A +
Subjt:  IFHEKIDS--VSPDSIDQPS----VVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAV

Query:  DTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  DTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

O42260 Gem-associated protein 27.2e-1327.31Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDS
        S PP    EYLRRV+ EA+  P+V +A++D    +K+++V +  +      P    PS  W+   +A FS +RQ L    G  +S     + +DS     
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDS

Query:  IDQPSVVLPANDIDSQQP-----EEPNASTSSKENSCNNYPS-----------LSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTP
            +V +P+ + +         E   +  ++  NS + +P            LS +S+M+     S L   +N  E +         WL+AL A ++ P
Subjt:  IDQPSVVLPANDIDSQQP-----EEPNASTSSKENSCNNYPS-----------LSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTP

Query:  LDADTCASFRSLLRKCASLRA-EKSELDDEVIMLNILATISGRYFGQSE
        L  +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  LDADTCASFRSLLRKCASLRA-EKSELDDEVIMLNILATISGRYFGQSE

Q54KN2 Gem-associated protein 29.1e-1622.84Show/hide
Query:  NSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKK--ERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEG
        +  Q  AF V  E   +   P  G EYL+RV+W ++  P+V VA +D S  K     + Y  + P+I  C + LLP+  WE  FL DFS+ RQ L   + 
Subjt:  NSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKK--ERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEG

Query:  LMQS------------------------------------------------------------------------DFIFHEKIDSVSPDSIDQPSVVLP
           S                                                                        D+ ++E  +    +  ++      
Subjt:  LMQS------------------------------------------------------------------------DFIFHEKIDSVSPDSIDQPSVVLP

Query:  ANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSE
          + + ++ EE     S+K+ +  N P++  + +++ V  V+ +   I  LE +   ++    WL+ L + ++ P+D DTC++ RS +R+ +  R++ + 
Subjt:  ANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSE

Query:  LDD-EVIMLNILATISGRYFGQSE
        L+D  +  +NIL TI  +YF Q E
Subjt:  LDD-EVIMLNILATISGRYFGQSE

Q9CQQ4 Gem-associated protein 22.3e-1127.95Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      P+   P+ +W+   +A FS +RQ +       +S  
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDF

Query:  IFHEKIDS--VSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNY------PSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA
           +++DS    P S D+         +  +   E     S++E+   +Y      P LS +S+MN    ++S+ + +++   +   +     W +AL A
Subjt:  IFHEKIDS--VSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNY------PSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSA

Query:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
         ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  AVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

Q9QZP1 Gem-associated protein 26.8e-1127.63Show/hide
Query:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANC---PQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQ
        P  L EG +P   S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +    +++ C   P+   P+ +W+   +  FS +RQ +       +
Subjt:  PAFLVEG-EPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPAIANC---PQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQ

Query:  SDFIFHEKIDS--VSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNY------PSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFA
        S     +++DS    P S D+         +  +   E     S+ E+   +Y      P LS +S+MN    ++S+ + +++   +   +     W +A
Subjt:  SDFIFHEKIDS--VSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNY------PSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFA

Query:  LSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE
        L A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  LSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDE-VIMLNILATISGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related3.5e-6334.38Show/hide
Query:  KEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSL-KIEVIDETAIVEPVH
        + +F +  E  R +   E  L  ++K    E + LL              K  L+ E ++      S +   L +   ++ N + KIE++D TA+V+ VH
Subjt:  KEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSL-KIEVIDETAIVEPVH

Query:  VSKIGNGEEIDIICPTRSMQI----NVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAI
          K     + +   P +  +I    NV +    E  GK+ RR                                 +Y+RK +E++RF ++  Q  LW  +
Subjt:  VSKIGNGEEIDIICPTRSMQI----NVSKSHEPERVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAI

Query:  CKELMPVVAREYSSLT-SSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLV
           ++P V  EY SL    NY  K+  S+  R   E G E +    +G  +      ED   DN                     DD  YNSI RPAF V
Subjt:  CKELMPVVAREYSSLT-SSNYPMKTGSSSGPRQHFEKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLV

Query:  EGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNF-KKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEK
        +GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKE+SVYMP+IP I  CP+ LLP KEWED+ L DF  LRQ L+      + + I  + 
Subjt:  EGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNF-KKERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEK

Query:  IDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCAS
        ++ +  +  ++         + +++ E      +              I  M+ V RVS L+KRI  +E ++ L  +DC W+ AL A+++TPLDADTCA 
Subjt:  IDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCAS

Query:  FRSLLRKCASLRAEKS-ELDDE--VIMLNILATISGRYFGQ
         R LLRKCAS+RAE S E+ DE  + M N+L TI+GRYFGQ
Subjt:  FRSLLRKCASLRAEKS-ELDDE--VIMLNILATISGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown7.9e-2369.86Show/hide
Query:  EDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIP
        E +ED+  YNSI RPAF V+GEP+F+SGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KE+SVYMP IP
Subjt:  EDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFK-KERSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown4.9e-3325.96Show/hide
Query:  STFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDF-EMI
        S F  D  +      ++ F       ++   V        S  EN  T +E+  V + I  SSE     G  + D  +   ++ E    V S+ D  E +
Subjt:  STFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILDVELGEESFKVDSVHDF-EMI

Query:  GAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEV----QFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEIL
         + + G        VEA+  +  +    DG+    + + +Q++    + ST  +    + ++  ++ +   +D + IL  +KK    +++ L        
Subjt:  GAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEV----QFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELEALLVPGEEIL

Query:  LEKDNCGKPM--LIDEEKIAGQQNDSEN-----TSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGE-------EIDIICPTRSMQINVSKSHEPE
         EK N G  +    D  KI    N S++       V  + +  +G S+ I+++D+TA+ + V   K G            D   P +  ++ V K   P 
Subjt:  LEKDNCGKPM--LIDEEKIAGQQNDSEN-----TSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGE-------EIDIICPTRSMQINVSKSHEPE

Query:  RVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHF
          G  +    R A   K+S+   N G M      N +  +I+YSR  ME++R+ +++ Q +LW  +   L+P +  EY    S+  P +T       Q +
Subjt:  RVGKKARRSRRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHF

Query:  EKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAK
           EE                                        +ED++D   YNSI RPAF V+GEP+F+SGPPEDG+EYLRRVRWEA  IPNV VAK
Subjt:  EKGEEASSFIRDGCSESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAK

Query:  VDRSNFK-KERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSS
        V  S ++ KE+SVYMP IP      + +                         GL+    + H     +    +   SV L    +  +      + T  
Subjt:  VDRSNFK-KERSVYMPVIPAIANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSS

Query:  KENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRA
        K    + +                         E ++ L  +DC W+ AL A+VDTP DADT A  R+L+RKCASLRA
Subjt:  KENSCNNYPSLSAISKMNLVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAATGGGTTAAGTTCCGGCGATGGTGATGGGTTCAGTCGAAAATTCTCTGCCTCTGAGGACCACGCACGATCTACTTTTCATCCGGACGATTCTGTCGCT
AAGATCTCTGATGACGAGATGAAGTTTCCTTTGATCGTCTCGAATCCAAGCCTGCAGTGTGAAGTTAGGATGAACAGTTCGAGTTCTGCTTCTCCAGAAGAGAAC
GCAGAAACTTCTGTTGAAAAGATGGTCGTCCGCGATTGGATTTCTGCGTCTTCTGAAAATGGAGGAAACATGGGAAGTCTGGTGGACGATGCTCGGATTCTTGAT
GTGGAGCTCGGAGAAGAATCTTTCAAGGTGGATTCTGTCCATGATTTTGAAATGATAGGTGCCGTGGAAGATGGGAATCAAGAAGTTGCGATGGATGAAGTAGAA
GCGAAAGATTTTGTAACAATTAGTGTTCCGAGTTTTGATGGAAATCAAGATTGTGCGAAGAAAGAAATTGTTCAAGAAGTTCAGTTTTCTACTGCTATGGAAGCC
GACAGCAAAGAAGCCTTTGAGCGAACCGAGGAATTGTTGAGGAAAGAAGCTGATACTGAGAGCATTCTGGAGATGAAAAAGAAATTACTATTGGAAGAACTTGAA
GCCTTGTTGGTTCCTGGAGAAGAGATTCTTCTAGAAAAGGATAATTGCGGCAAACCGATGCTTATTGATGAGGAGAAGATTGCCGGTCAGCAAAATGATTCTGAG
AACACGAGCGTTCTTAGACAAAGCCATTTATCTCTCGGAAATTCATTGAAGATTGAAGTAATAGACGAAACTGCAATAGTTGAACCTGTTCATGTTTCCAAAATT
GGAAATGGAGAAGAAATTGACATAATTTGTCCAACAAGGTCAATGCAGATCAATGTGAGCAAATCCCATGAACCCGAAAGAGTGGGGAAAAAGGCTAGAAGATCG
AGGAGGAGGGCGAGGGAAGCGAAGATCTCAGAGGTGCATTGGAATCTGGGGAATATGAATGAACTTGATAAGAAAAATGCTGAAGGAAATAAGATAGTGTACTCA
AGGAAGGATATGGAAGCACTGAGGTTTGTAAATGTTTCAGAACAGAGTAGATTGTGGGAAGCTATATGCAAGGAACTTATGCCCGTTGTGGCGAGGGAATACAGT
AGCTTAACAAGCTCTAATTACCCAATGAAGACAGGCTCCTCCTCTGGTCCTAGGCAGCATTTCGAGAAGGGAGAAGAAGCCTCTTCATTTATAAGGGATGGCTGT
TCAGAAAGCTTGGATGCTGAGATAGAGGACATGGAAGGTGATAATGAAATTACGAATTTTGAATTCCCGAAACCCTCTTGCGGTCTTAGTGTAAGTGAAGATAGT
GAAGATGATAGATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAGCCCAATTTTGAGTCAGGACCTCCAGAAGATGGATTGGAATATCTTAGA
CGTGTGAGGTGGGAAGCATCCCATATTCCAAATGTGGCGGTGGCTAAAGTTGATAGAAGTAACTTCAAGAAAGAGCGAAGTGTTTATATGCCAGTGATTCCTGCC
ATTGCCAATTGCCCCCAGAATTTACTGCCTTCAAAAGAGTGGGAGGATGCATTTCTTGCCGATTTTTCTAAGCTGCGTCAGGTTCTGTCGTGCCCCGAGGGACTT
ATGCAGTCTGATTTCATCTTCCATGAAAAGATCGATTCTGTGAGTCCAGACTCGATCGATCAGCCGAGCGTTGTCTTGCCTGCCAATGATATCGACTCGCAGCAA
CCCGAGGAACCGAATGCCAGCACTTCTTCAAAGGAAAACAGTTGCAACAACTATCCATCTTTATCAGCAATTTCAAAGATGAACTTGGTGTTTCGTGTTTCTTCG
TTGAGGAAACGTATAAACTCGTTAGAAACACAAACAACACTCTCGAGGACGGACTGTCTTTGGCTATTTGCGTTAAGTGCAGCAGTTGATACTCCTCTGGATGCA
GATACTTGTGCTTCTTTCAGGAGTCTGCTTCGGAAATGTGCCAGTTTGCGGGCCGAGAAGTCCGAGCTCGACGACGAGGTGATAATGCTCAATATTCTAGCCACC
ATTTCAGGAAGATACTTTGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
GCGCCCCTAAAGCCCTTGTGCTTTCTTCCAGCAATGGCGAATGGGTTAAGTTCCGGCGATGGTGATGGGTTCAGTCGAAAATTCTCTGCCTCTGAGGACCACGCA
CGATCTACTTTTCATCCGGACGATTCTGTCGCTAAGATCTCTGATGACGAGATGAAGTTTCCTTTGATCGTCTCGAATCCAAGCCTGCAGTGTGAAGTTAGGATG
AACAGTTCGAGTTCTGCTTCTCCAGAAGAGAACGCAGAAACTTCTGTTGAAAAGATGGTCGTCCGCGATTGGATTTCTGCGTCTTCTGAAAATGGAGGAAACATG
GGAAGTCTGGTGGACGATGCTCGGATTCTTGATGTGGAGCTCGGAGAAGAATCTTTCAAGGTGGATTCTGTCCATGATTTTGAAATGATAGGTGCCGTGGAAGAT
GGGAATCAAGAAGTTGCGATGGATGAAGTAGAAGCGAAAGATTTTGTAACAATTAGTGTTCCGAGTTTTGATGGAAATCAAGATTGTGCGAAGAAAGAAATTGTT
CAAGAAGTTCAGTTTTCTACTGCTATGGAAGCCGACAGCAAAGAAGCCTTTGAGCGAACCGAGGAATTGTTGAGGAAAGAAGCTGATACTGAGAGCATTCTGGAG
ATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCTTGTTGGTTCCTGGAGAAGAGATTCTTCTAGAAAAGGATAATTGCGGCAAACCGATGCTTATTGATGAG
GAGAAGATTGCCGGTCAGCAAAATGATTCTGAGAACACGAGCGTTCTTAGACAAAGCCATTTATCTCTCGGAAATTCATTGAAGATTGAAGTAATAGACGAAACT
GCAATAGTTGAACCTGTTCATGTTTCCAAAATTGGAAATGGAGAAGAAATTGACATAATTTGTCCAACAAGGTCAATGCAGATCAATGTGAGCAAATCCCATGAA
CCCGAAAGAGTGGGGAAAAAGGCTAGAAGATCGAGGAGGAGGGCGAGGGAAGCGAAGATCTCAGAGGTGCATTGGAATCTGGGGAATATGAATGAACTTGATAAG
AAAAATGCTGAAGGAAATAAGATAGTGTACTCAAGGAAGGATATGGAAGCACTGAGGTTTGTAAATGTTTCAGAACAGAGTAGATTGTGGGAAGCTATATGCAAG
GAACTTATGCCCGTTGTGGCGAGGGAATACAGTAGCTTAACAAGCTCTAATTACCCAATGAAGACAGGCTCCTCCTCTGGTCCTAGGCAGCATTTCGAGAAGGGA
GAAGAAGCCTCTTCATTTATAAGGGATGGCTGTTCAGAAAGCTTGGATGCTGAGATAGAGGACATGGAAGGTGATAATGAAATTACGAATTTTGAATTCCCGAAA
CCCTCTTGCGGTCTTAGTGTAAGTGAAGATAGTGAAGATGATAGATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAGCCCAATTTTGAGTCA
GGACCTCCAGAAGATGGATTGGAATATCTTAGACGTGTGAGGTGGGAAGCATCCCATATTCCAAATGTGGCGGTGGCTAAAGTTGATAGAAGTAACTTCAAGAAA
GAGCGAAGTGTTTATATGCCAGTGATTCCTGCCATTGCCAATTGCCCCCAGAATTTACTGCCTTCAAAAGAGTGGGAGGATGCATTTCTTGCCGATTTTTCTAAG
CTGCGTCAGGTTCTGTCGTGCCCCGAGGGACTTATGCAGTCTGATTTCATCTTCCATGAAAAGATCGATTCTGTGAGTCCAGACTCGATCGATCAGCCGAGCGTT
GTCTTGCCTGCCAATGATATCGACTCGCAGCAACCCGAGGAACCGAATGCCAGCACTTCTTCAAAGGAAAACAGTTGCAACAACTATCCATCTTTATCAGCAATT
TCAAAGATGAACTTGGTGTTTCGTGTTTCTTCGTTGAGGAAACGTATAAACTCGTTAGAAACACAAACAACACTCTCGAGGACGGACTGTCTTTGGCTATTTGCG
TTAAGTGCAGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCTTCTTTCAGGAGTCTGCTTCGGAAATGTGCCAGTTTGCGGGCCGAGAAGTCCGAGCTCGAC
GACGAGGTGATAATGCTCAATATTCTAGCCACCATTTCAGGAAGATACTTTGGACAGTCGGAAAATTGAAGCTATGGTGAAGTTAATAGTATCGCTTAGGAGGAT
AAACGTAGATGACTCTGATTGTTCCTTTTGTTTTGTAAGAGATTCATAGATACATACATATATACATACATATATTTGATCATTTATAATTTAGTGGAACAATTA
ACTTATAAATTGATGGTTTTTAAT
Protein sequenceShow/hide protein sequence
MANGLSSGDGDGFSRKFSASEDHARSTFHPDDSVAKISDDEMKFPLIVSNPSLQCEVRMNSSSSASPEENAETSVEKMVVRDWISASSENGGNMGSLVDDARILD
VELGEESFKVDSVHDFEMIGAVEDGNQEVAMDEVEAKDFVTISVPSFDGNQDCAKKEIVQEVQFSTAMEADSKEAFERTEELLRKEADTESILEMKKKLLLEELE
ALLVPGEEILLEKDNCGKPMLIDEEKIAGQQNDSENTSVLRQSHLSLGNSLKIEVIDETAIVEPVHVSKIGNGEEIDIICPTRSMQINVSKSHEPERVGKKARRS
RRRAREAKISEVHWNLGNMNELDKKNAEGNKIVYSRKDMEALRFVNVSEQSRLWEAICKELMPVVAREYSSLTSSNYPMKTGSSSGPRQHFEKGEEASSFIRDGC
SESLDAEIEDMEGDNEITNFEFPKPSCGLSVSEDSEDDRYYNSIQRPAFLVEGEPNFESGPPEDGLEYLRRVRWEASHIPNVAVAKVDRSNFKKERSVYMPVIPA
IANCPQNLLPSKEWEDAFLADFSKLRQVLSCPEGLMQSDFIFHEKIDSVSPDSIDQPSVVLPANDIDSQQPEEPNASTSSKENSCNNYPSLSAISKMNLVFRVSS
LRKRINSLETQTTLSRTDCLWLFALSAAVDTPLDADTCASFRSLLRKCASLRAEKSELDDEVIMLNILATISGRYFGQSEN