; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020315 (gene) of Snake gourd v1 genome

Gene IDTan0020315
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationLG10:14094662..14102797
RNA-Seq ExpressionTan0020315
SyntenyTan0020315
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]6.1e-30778.68Show/hide
Query:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK
        MA+ +SS  GDGFSR   ASED  + P  P  S +KISA+E KFPLI+SN + +C+V M+ + SASPE N ETSV++MVV D   ASSENGGNM SLVD+
Subjt:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM
         + LDV+L +E  KVD+VHDFE IG+VEDG +E A+DEVE KD  + SV SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEM
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV
        KKKLLLEELEAMLVPGEEIHLEK           DNCGK ML+DEEKI+ QQNDSENTNVLR+SHLSL NSLKIEVIDETALVEPVHVSKIGN EEI I 
Subjt:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV

Query:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY
        CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+GNVNELD     +KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AI KELMPVVAREY
Subjt:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY

Query:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED
        SSLT SNYPMK GSTS  RQH EK EEASS IR+GCSESLD EIEDMEGDNEI NF F +PSC LS+SEDS+DDKYYNSIQRPAFLVEGEPNFDSGPPED
Subjt:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED

Query:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV
        GLEYLRRVRWEASHIPNV+VAKVDRSNFKKERSVYMPVIPAI  CP++LLPSKEWED+FLADFS LRQ LSC EG MQSDFI HEKI+SV PD IDQPSV
Subjt:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPANDI+SQQPE+PNASTS+KENS N+YPSLSAI KMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        K+EL+DEVIMLNILATISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-30678.54Show/hide
Query:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK
        MA+ +SS  GDGFSR   ASED  + P  P  S +KISA+E KFPLI+SN + +C+V M+ + SASPE   ETSV++MVV D   ASSENGGNM SLVD+
Subjt:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM
         + LDV+L +E  KVD+VHDFE IG+VEDG +E A+DEVE KD  + SV SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEM
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV
        KKKLLLEELEAMLVPGEEIHLEK           DNCGK ML+DEEKI+ Q+NDSENTNVLR+SHLSL NSLKIEVIDETALVEPVHVSKIGN EEI I 
Subjt:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV

Query:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY
        CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+GNVNELD     +KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AI KELMPVVAREY
Subjt:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY

Query:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED
        SSLT SNYPMK GSTS  RQH EK EEASS IR+GCSESLD EIEDMEGDNEI NF F +PSC LS+SEDS+DDKYYNSIQRPAFLVEGEPNFDSGPPED
Subjt:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED

Query:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV
        GLEYLRRVRWEASHIPNV VAKVDRSNFKKERSVYMPVIPAI  CP++LLPSKEWED+FLADFS LRQ LSC EG MQSDFI HEKI+SV PD IDQPSV
Subjt:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPANDI+SQQPE+PNASTS+KENS N+YPSLSAI KMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        K+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

XP_022932760.1 uncharacterized protein LOC111439213 [Cucurbita moschata]2.5e-30578.4Show/hide
Query:  MADEISSDYGDGFSR--TASE-DPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK
        MA+ +SS  GDGFSR  +ASE   +SP  P         DE KFPLI+SN + +C+V M+ + SASPE N ETSV++MVV D   ASSENGGNM SLVD+
Subjt:  MADEISSDYGDGFSR--TASE-DPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM
         + LDVEL +E  KVD+VHDFE IGAVEDG +E A+DEVE KD  + SV SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEM
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV
        KKKLLLEELEAMLVPGEEIHLEK           DNCGK MLIDEEKI  QQNDSENT+VLR+SHLSL NSLKIEVIDETALVEPVHVSKIGN EEI I+
Subjt:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV

Query:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY
        CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+GNVNELD     +KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AICKELMPVVAREY
Subjt:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY

Query:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED
        SSLT SNYPMK GSTS  RQH EK EEASS IR+GCSESLD EIEDMEGDNEI NF F +PSC LS+SEDS+DD+YYNSIQRPAFLVEGEPNF+SGPPED
Subjt:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED

Query:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV
        GLEYLRRVRWEASHIPNV VAKVDRSNFKKERSVYMPVIPAI  CP++LLPSKEWED+FLADFSKLRQ LSC EG MQSDFI HEKIDSV PD IDQPS+
Subjt:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPAN+I+SQQPE+PNASTS+KENS N+YPSLSAI KMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        K+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

XP_023540169.1 uncharacterized protein LOC111800622 [Cucurbita pepo subsp. pepo]1.3e-30478.4Show/hide
Query:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK
        MA+ +SS  GDGFSR  +ASED  +SP  P         DE KFPLI+SN + +C+V M+ + SASPE N ETSV++MVV D    SSENGGNM SLVD+
Subjt:  MADEISSDYGDGFSR--TASED-PQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM
         + LDVEL +E  KVD+VHDFE IGAVEDG +E A+DEVE KD  +  V SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEM
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV
        KKKLLLEELEAMLVPGEEIHLEK           DNCGK ML+DEEKI+ QQNDSENTNVLR+SHLSL NSLKIEVIDETALVEPVHVSKIGN EEI I 
Subjt:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV

Query:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY
        CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+GNVNELD     +KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AICKELMPVVAREY
Subjt:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY

Query:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED
        SSLT SNYPMK GSTS  RQH EK EEASS IR+GCSESLD EIEDMEGDNEI N  F +PSC LS+SEDS+DDKYYNSIQRPAFLVEGEPNFDSGPPED
Subjt:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED

Query:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV
        GLEYLRRVRWEASHIPNV VAKVDRSNFKKERSVYMPVIPAI  CP++LLPSKEWED+FLADFSKLRQ LSC EG MQSDFI HEKIDSV PD IDQPSV
Subjt:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        +LPANDI+SQ PE+PNASTS+KENS N+YPSLSAI KMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        K+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]0.0e+0079.75Show/hide
Query:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK
        MADEISSDYGDGF+   + SE+ QS  +PI SA KISAD+K FPLI+SNQNQ+ +V ++ A SAS + NPETSV     + SAC SSENGGNM SL V K
Subjt:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK
        IQNLDVEL KE LKVD+VHDFET+ AVEDGK++ A+DEV EKD +RSVLSFDGN DC+K+EL QEV+L+A       EAFA T+ LL+KE + ESILE+K
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK

Query:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC
        KKLLLEEL+AMLVPG++IHLEKGNNPPSS G++D+C K +LIDEEKI D+QNDSE  NVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGN E IGIVC
Subjt:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC

Query:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS
        P RSMQ+KVNKSHEP+RGGKK KRSRR+ REAK++E++WN+GNVNELD VN RQK AEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKEL+P VAREYS
Subjt:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS

Query:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG
        SLT SNYPMKIGSTSD RQ   KREEASSIIREGCSESLDGEIEDMEGDNE  NFV  EPSCS S+SED D+DKYY SIQRPAFLVEGEPNFDSGPPEDG
Subjt:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG

Query:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVV
        LEYLRRVRWEASHIPNVT+AKVDRSNFKKE+SVYMPVIP I KCP+HLLPSKEWE++FLADFS LR+ALS SE F QSDFILHEKIDS IPDLI QP  V
Subjt:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVV

Query:  LPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEK
        LPA +I+S Q E+ N STSAKENSCNDYPSLSAI KMNSVFRVSSL+KRINSLETQTTLS+TDCLWLFALSAAVD PLDADTCAAFRSLLRKCASLRA+K
Subjt:  LPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEK

Query:  TELDDEVIMLNILATISGRYFGQSEN
        TELDDEVIMLNIL+TISGRYFGQSEN
Subjt:  TELDDEVIMLNILATISGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948751.6e-30076.89Show/hide
Query:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK
        MADEI+SDY DGF+    +SE+PQSP +P+ SA  ISAD   FPLI+SN+N +C+V ++   SASP+ NPE+SV +MV+ DSAC SSENGG+M SL V K
Subjt:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK
        IQNLDVEL KE LKVD+VHDFET+   ED K+E AVDEV+ KD +RSVLSFDGNQDCAK+EL QE +L+A       EAFA T+ LL+KE +SESILEMK
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK

Query:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC
        KKLLLE+++AMLVPG+EIHL++G+NPPSSGG +D C K ML+DEEKI DQQNDSE  NVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGN + IGIVC
Subjt:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC

Query:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS
        PTRSMQ++V KSHEP++GGKK  +SRR+ RE K++E+HWN+ NVNE+D V+ RQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKEL+PVVAREYS
Subjt:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS

Query:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG
        SLT     +K GSTSD RQ   KREEASSIIREGCSESLDGEIEDMEGDNEI NFV SEPSCSL  S+DSDDDKYY+SIQRPAFLVEGEPNFDSGPPEDG
Subjt:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG

Query:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV
        LEYLRRVRWEASHIPNVTVAKVDRSNFKKE+SVYMPVIPAI +CPEHLLPSKEWE++FLADFSKLRQALS S E  M+SDFILHEKID ++P+LI QPS 
Subjt:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPA+D +  QPE+ N STSAKE SCNDYPSLSAI KMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVD PLD DTCAAFRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        KTE+DDEVIMLNIL+TISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X18.0e-29776.81Show/hide
Query:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK
        MADEI+SDY DGF+    +SE+PQSP +P+ SA  ISAD   FPLI+SN+N +C+V ++   SASP+ NPE+SV +MV+ DSAC SSENGG+M SL V K
Subjt:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK
        IQNLDVEL KE LKVD+VHDFET+   ED K+E AVDEV+ KD +RSVLSFDGNQDCAK+EL QE +L+A       EAFA T+ LL+KE +SESILEMK
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK

Query:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC
        KKLLLE+++AMLVPG+EIHL++G+NPPSSGG +D C K ML+DEEKI DQQNDSE  NVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGN + IGIVC
Subjt:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC

Query:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS
        PTRSMQ++V KSHEP++GGKK K+SRR+ RE K++E+HWN+ NVNE+D V+ RQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKEL+PVVAREYS
Subjt:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS

Query:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG
        SLT     +K GSTSD RQ   KREEASSIIREGCSESLDGEIEDMEGDNEI NFV SEPSCSL  S+DSDDDKYY+SIQRPAFLVEGEPNFDSGPPEDG
Subjt:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG

Query:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV
        LEYLRRVRWEASHIPNVTVAKVDRSNFKKE+SVYMPVIPAI +CPEHLLPSKEWE++FLADFSKLRQALS S E  M+SDFILHEKID ++P+LI QPS 
Subjt:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPA+D +  QPE+ N STSAKE SCNDYPSLSAI KMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVD PLD DTCAAFRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGR
        KTE+DDEVIMLNIL+TISGR
Subjt:  KTELDDEVIMLNILATISGR

A0A5D3CZJ0 Mis18-binding protein 1-like isoform X18.0e-29776.81Show/hide
Query:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK
        MADEI+SDY DGF+    +SE+PQSP +P+ SA  ISAD   FPLI+SN+N +C+V ++   SASP+ NPE+SV +MV+ DSAC SSENGG+M SL V K
Subjt:  MADEISSDYGDGFSR--TASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSL-VDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK
        IQNLDVEL KE LKVD+VHDFET+   ED K+E AVDEV+ KD +RSVLSFDGNQDCAK+EL QE +L+A       EAFA T+ LL+KE +SESILEMK
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMK

Query:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC
        KKLLLE+++AMLVPG+EIHL++G+NPPSSGG +D C K ML+DEEKI DQQNDSE  NVLRRSHLSLRNSLKIEVIDETALVEPVHVS+IGN + IGIVC
Subjt:  KKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVC

Query:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS
        PTRSMQ++V KSHEP++GGKK K+SRR+ RE K++E+HWN+ NVNE+D V+ RQ+NAEGNKI+YSRKDMEALRFVNVAEQ+RLWKAICKEL+PVVAREYS
Subjt:  PTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYS

Query:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG
        SLT     +K GSTSD RQ   KREEASSIIREGCSESLDGEIEDMEGDNEI NFV SEPSCSL  S+DSDDDKYY+SIQRPAFLVEGEPNFDSGPPEDG
Subjt:  SLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDG

Query:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV
        LEYLRRVRWEASHIPNVTVAKVDRSNFKKE+SVYMPVIPAI +CPEHLLPSKEWE++FLADFSKLRQALS S E  M+SDFILHEKID ++P+LI QPS 
Subjt:  LEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCS-EGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPA+D +  QPE+ N STSAKE SCNDYPSLSAI KMN +FRVSSLRKRINS ETQTTLSR DCLWLFALSAAVD PLD DTCAAFRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGR
        KTE+DDEVIMLNIL+TISGR
Subjt:  KTELDDEVIMLNILATISGR

A0A6J1F307 uncharacterized protein LOC1114392131.2e-30578.4Show/hide
Query:  MADEISSDYGDGFSR--TASE-DPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK
        MA+ +SS  GDGFSR  +ASE   +SP  P         DE KFPLI+SN + +C+V M+ + SASPE N ETSV++MVV D   ASSENGGNM SLVD+
Subjt:  MADEISSDYGDGFSR--TASE-DPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDK

Query:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM
         + LDVEL +E  KVD+VHDFE IGAVEDG +E A+DEVE KD  + SV SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEM
Subjt:  IQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEKD-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEM

Query:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV
        KKKLLLEELEAMLVPGEEIHLEK           DNCGK MLIDEEKI  QQNDSENT+VLR+SHLSL NSLKIEVIDETALVEPVHVSKIGN EEI I+
Subjt:  KKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIV

Query:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY
        CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+GNVNELD     +KNAEG+KIVYSRKDMEALRFVNV+EQ RLW+AICKELMPVVAREY
Subjt:  CPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREY

Query:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED
        SSLT SNYPMK GSTS  RQH EK EEASS IR+GCSESLD EIEDMEGDNEI NF F +PSC LS+SEDS+DD+YYNSIQRPAFLVEGEPNF+SGPPED
Subjt:  SSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPED

Query:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV
        GLEYLRRVRWEASHIPNV VAKVDRSNFKKERSVYMPVIPAI  CP++LLPSKEWED+FLADFSKLRQ LSC EG MQSDFI HEKIDSV PD IDQPS+
Subjt:  GLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSV

Query:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE
        VLPAN+I+SQQPE+PNASTS+KENS N+YPSLSAI KMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAE
Subjt:  VLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAE

Query:  KTELDDEVIMLNILATISGRYFGQSEN
        K+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  KTELDDEVIMLNILATISGRYFGQSEN

A0A6J1IAH4 uncharacterized protein LOC1114707334.9e-30280.58Show/hide
Query:  KFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDKIQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEK
        KFPLI+SN + +C+V M+ + SASPE N ETSV++MVVRD   ASSENGGNM SLVD  + LDVEL +E  KVDSVHDFE IGAVEDG +E A+DEVE K
Subjt:  KFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDKIQNLDVELEKELLKVDSVHDFETIGAVEDGKREFAVDEVEEK

Query:  D-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMML
        D  + SV SFDGNQDCAKKE+ QEV+ S +MEAD+ EAF  T+ LLRKEA++ESILEMKKKLLLEELEA+LVPGEEI LEK           DNCGK ML
Subjt:  D-SSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGTMDNCGKMML

Query:  IDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNV
        IDEEKI  QQNDSENT+VLR+SHLSL NSLKIEVIDETA+VEPVHVSKIGN EEI I+CPTRSMQI V+KSHEPER GKK +RSRRR REAKI+EVHWN+
Subjt:  IDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNV

Query:  GNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDG
        GN+NELD     +KNAEGNKIVYSRKDMEALRFVNV+EQ RLW+AICKELMPVVAREYSSLT SNYPMK GS+S  RQH EK EEASS IR+GCSESLD 
Subjt:  GNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDG

Query:  EIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAI
        EIEDMEGDNEI NF F +PSC LS+SEDS+DD+YYNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEASHIPNV VAKVDRSNFKKERSVYMPVIPAI
Subjt:  EIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAI

Query:  VKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVF
          CP++LLPSKEWED+FLADFSKLRQ LSC EG MQSDFI HEKIDSV PD IDQPSVVLPANDI+SQQPE+PNASTS+KENSCN+YPSLSAI KMN VF
Subjt:  VKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVF

Query:  RVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATISGRYFGQSEN
        RVSSLRKRINSLETQTTLSRTDCLWLFALSAAVD PLDADTCA+FRSLLRKCASLRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  RVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATISGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 22.5e-1329.12Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      PE   P+ +W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF

Query:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQP-----EKPNA----STSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL
           +++DS         +V +P ++ E         EK  A      +  E+   DY      P LS + +MN    V+S+ + +++   +   +     
Subjt:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQP-----EKPNA----STSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL

Query:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE
        WL+AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE

O42260 Gem-associated protein 21.3e-1427.71Show/hide
Query:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDL
        S PP    EYLRRV+ EA+  P+V +A++D    +K+++V +  +      P+   PS  W+   +A FS +RQ+L    G  +S     + +DS     
Subjt:  SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDL

Query:  IDQPSVVLPANDIESQQP-----EKPNASTSAKENSCNDYPS-----------LSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMP
            +V +P+ + E         E+  +  +A  NS + +P            LS + +M+     S L   +N  E +         WL+AL A ++ P
Subjt:  IDQPSVVLPANDIESQQP-----EKPNASTSAKENSCNDYPS-----------LSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMP

Query:  LDADTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILATISGRYFGQSE
        L  +  +  R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  LDADTCAAFRSLLRKCASLRA-EKTELDDEVIMLNILATISGRYFGQSE

Q54KN2 Gem-associated protein 24.2e-1623.46Show/hide
Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--ERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEG
        +  Q  AF V  E   D   P  G EYL+RV+W ++  P+V VA +D S  K     + Y  + P+I KC + LLP+  WE  FL DFS+ RQ L   + 
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKK--ERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEG

Query:  FMQS------------------------------------------------------------------------DFILHEKIDSVIPDLIDQPSVVLP
           S                                                                        D+  +E  +    +  ++      
Subjt:  FMQS------------------------------------------------------------------------DFILHEKIDSVIPDLIDQPSVVLP

Query:  ANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTE
          + E ++ E+     S K+ +  + P++  + +++ V  V+ +   I  LE +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T 
Subjt:  ANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTE

Query:  LDD-EVIMLNILATISGRYFGQSE
        L+D  +  +NIL TI  +YF Q E
Subjt:  LDD-EVIMLNILATISGRYFGQSE

Q9CQQ4 Gem-associated protein 23.3e-1327.59Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      PE   P+ +W+   +A FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF

Query:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQPE---------KPNASTSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL
           +++DS         +V +P ++ E    +         +     S +E+   DY      P LS + +MN    ++S+ + +++   +   +     
Subjt:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQPE---------KPNASTSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL

Query:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE
        W +AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE

Q9QZP1 Gem-associated protein 21.6e-1227.2Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF
        P  L EG   FD S PP    EYLRRV+ EA+  P+V VA++D    K+++SV +  +      PE   P+ +W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDF

Query:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQPE---------KPNASTSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL
           +++DS         +V +P ++ E    +         +     S  E+   DY      P LS + +MN    ++S+ + +++   +   +     
Subjt:  ILHEKIDSVIPDLIDQPSVVLPANDIESQQPE---------KPNASTSAKENSCNDY------PSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTDCL

Query:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE
        W +AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  WLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATISGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related3.1e-6734.06Show/hide
Query:  SRSVLSFDG--NQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMKKKLLLEELEAMLVPG---EEIHLEKGNNPPSSGGTMDNCGKM
        +++ +S DG   +   KK+L+ E+   AS  +  ++A     G +    N+  + E+K+    +  E   V G   E ++ E+     S    +    K 
Subjt:  SRSVLSFDG--NQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMKKKLLLEELEAMLVPG---EEIHLEKGNNPPSSGGTMDNCGKM

Query:  MLIDEEKITDQQNDSENTNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVH
         L+ E ++      S + + L +   ++ N + KIE++D TALV+ V                           H P+R G   +  +  PR+ K     
Subjt:  MLIDEEKITDQQNDSENTNVLRRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVH

Query:  WNVG---NVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLT-GSNYPMKIGSTSDHRQHSEKREEASSIIREG
          +G   NV E  GV   + N +  + +Y+RK +E++RF ++  Q+ LW  +   ++P V  EY SL    NY             S K          G
Subjt:  WNVG---NVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLT-GSNYPMKIGSTSDHRQHSEKREEASSIIREG

Query:  CSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKERSV
          E+L  E    EG  + +++           ++D+DD   YNSI RPAF V+GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKE+SV
Subjt:  CSESLDGEIEDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNF-KKERSV

Query:  YMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSA
        YMP+IP I KCPE+LLP KEWEDS L DF  LRQ L+ S    + + I  + ++ ++ ++ ++         + +++ E                  ++ 
Subjt:  YMPVIPAIVKCPEHLLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSA

Query:  ILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILATISGRYFGQ
        I  M+SV RVS L+KRI  +E ++ L  +DC W+ AL A+++ PLDADTCA  R LLRKCAS+RAE + E+ DE  + M N+L TI+GRYFGQ
Subjt:  ILKMNSVFRVSSLRKRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILATISGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown3.6e-2371.62Show/hide
Query:  SEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KERSVYMPVIP
        +ED+DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KE+SVYMP IP
Subjt:  SEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KERSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown5.5e-3229.49Show/hide
Query:  DQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELD
        DQ+   E   V  + +  +  S+ I+++D+TAL + V   K G               +  +   +  R  KKV   +   +    + V  N  +    D
Subjt:  DQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAKIAEVHWNVGNVNELD

Query:  GVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEG
          N  + N +  +I+YSR  ME++R+ ++A Q++LW  +   L+P +  EY                                 EG   ++    +D   
Subjt:  GVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEIEDMEG

Query:  DNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KERSVYMPVIPAIVKCPEH
              +V  E       +ED+DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA  IPNV VAKV  S ++ KE+SVYMP IP      E 
Subjt:  DNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFK-KERSVYMPVIPAIVKCPEH

Query:  LLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLR
        +          L  F  L    S    F Q  ++                SV L    +  +      + T  K    + +                   
Subjt:  LLPSKEWEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLR

Query:  KRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRA
              E ++ L  +DC W+ AL A+VD P DADT A  R+L+RKCASLRA
Subjt:  KRINSLETQTTLSRTDCLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATGAGATAAGTTCCGACTATGGCGATGGGTTCAGTCGAACAGCATCTGAGGACCCTCAATCTCCTTCTCAACCGATCATTTCTGCCTCTAAGATCTCTGCCGA
CGAGAAGAAGTTCCCTTTGATCCTCTCGAATCAAAACCAGGAGTGTGACGTAATGATGGACGGTGCGGGTTCTGCTTCTCCAGAAAATAACCCAGAAACTTCTGTCAAAG
AGATGGTCGTGCGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACATGAGAAGTCTTGTGGACAAGATTCAGAATCTTGATGTGGAGCTCGAAAAAGAACTTCTC
AAGGTGGATTCTGTCCACGATTTTGAAACGATAGGTGCCGTGGAAGATGGTAAGCGAGAATTTGCGGTGGATGAAGTAGAAGAGAAAGATTCTTCAAGAAGTGTACTGAG
TTTTGATGGAAATCAAGATTGTGCGAAGAAAGAACTTGATCAAGAAGTACGGTTGTCTGCTTCTATGGAAGCCGATGCCAGTGAAGCCTTTGCGGGAACTCAGGGTTTGT
TGAGGAAAGAAGCTAATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAAATTCATCTAGAGAAGGGA
AATAATCCTCCTAGCTCAGGAGGGACCATGGATAATTGCGGCAAAATGATGCTTATTGATGAGGAGAAGATTACTGATCAGCAAAATGATTCCGAGAACACGAATGTTCT
TAGACGAAGCCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTCGAACCGGTTCATGTTTCCAAAATTGGAAATAGAGAAGAGATTG
GTATAGTTTGTCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGTTAAAAGATCGAGGAGGAGGCCAAGGGAAGCGAAG
ATTGCTGAGGTGCATTGGAATGTGGGGAATGTGAATGAACTTGATGGAGTCAATCGACGTCAGAAAAATGCGGAGGGAAATAAGATAGTGTATTCGAGGAAGGATATGGA
AGCACTGAGGTTTGTAAATGTTGCAGAGCAAAGGAGATTGTGGAAAGCTATATGCAAGGAACTTATGCCCGTTGTGGCAAGGGAATACAGTAGCTTAACAGGCTCGAATT
ACCCAATGAAGATAGGTTCCACCTCTGATCATAGGCAGCATTCGGAGAAGAGGGAAGAAGCGTCTTCAATTATAAGGGAAGGATGTTCGGAAAGCTTGGATGGTGAGATA
GAGGACATGGAAGGCGATAATGAAATTAAGAATTTTGTATTCTCGGAACCCTCTTGCAGTCTTAGTATCAGCGAAGATAGCGATGATGATAAATATTACAACAGTATACA
GAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGATTAGAGTATCTTAGACGTGTCAGGTGGGAAGCTTCCCATATCCCAAATG
TGACTGTGGCTAAAGTTGATAGAAGTAATTTTAAGAAAGAGCGAAGTGTTTATATGCCAGTGATTCCTGCAATTGTCAAGTGCCCCGAGCATTTACTGCCTTCAAAAGAG
TGGGAGGATTCTTTTCTTGCTGATTTTTCTAAGCTGCGACAGGCTCTGTCATGCTCCGAGGGATTTATGCAGTCTGATTTCATCCTCCATGAAAAGATCGATTCTGTAAT
TCCGGACTTGATTGATCAGCCAAGTGTTGTCTTGCCTGCTAACGACATTGAATCACAGCAACCTGAGAAACCGAATGCCAGCACTTCTGCAAAGGAAAACAGTTGCAACG
ATTATCCATCTTTATCAGCAATCTTAAAGATGAACTCTGTGTTCCGTGTTTCGTCGTTGAGGAAACGTATAAACTCGTTAGAAACACAGACAACACTGTCAAGGACCGAT
TGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATATGCCTCTGGATGCAGATACTTGTGCTGCTTTCAGGAGTCTGCTTCGGAAATGTGCCAGCTTGCGGGCCGAGAA
GACCGAGCTCGACGATGAGGTGATAATGCTCAATATTCTTGCCACCATTTCTGGAAGGTACTTTGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
CGCTGCTAAAACCCTTCTGCTTCTCCTCGAGCAATGGCGGATGAGATAAGTTCCGACTATGGCGATGGGTTCAGTCGAACAGCATCTGAGGACCCTCAATCTCCTTCTCA
ACCGATCATTTCTGCCTCTAAGATCTCTGCCGACGAGAAGAAGTTCCCTTTGATCCTCTCGAATCAAAACCAGGAGTGTGACGTAATGATGGACGGTGCGGGTTCTGCTT
CTCCAGAAAATAACCCAGAAACTTCTGTCAAAGAGATGGTCGTGCGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACATGAGAAGTCTTGTGGACAAGATTCAG
AATCTTGATGTGGAGCTCGAAAAAGAACTTCTCAAGGTGGATTCTGTCCACGATTTTGAAACGATAGGTGCCGTGGAAGATGGTAAGCGAGAATTTGCGGTGGATGAAGT
AGAAGAGAAAGATTCTTCAAGAAGTGTACTGAGTTTTGATGGAAATCAAGATTGTGCGAAGAAAGAACTTGATCAAGAAGTACGGTTGTCTGCTTCTATGGAAGCCGATG
CCAGTGAAGCCTTTGCGGGAACTCAGGGTTTGTTGAGGAAAGAAGCTAATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTG
GTTCCTGGAGAAGAAATTCATCTAGAGAAGGGAAATAATCCTCCTAGCTCAGGAGGGACCATGGATAATTGCGGCAAAATGATGCTTATTGATGAGGAGAAGATTACTGA
TCAGCAAAATGATTCCGAGAACACGAATGTTCTTAGACGAAGCCATTTGTCTCTCAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTCGAACCGGTTC
ATGTTTCCAAAATTGGAAATAGAGAAGAGATTGGTATAGTTTGTCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGTT
AAAAGATCGAGGAGGAGGCCAAGGGAAGCGAAGATTGCTGAGGTGCATTGGAATGTGGGGAATGTGAATGAACTTGATGGAGTCAATCGACGTCAGAAAAATGCGGAGGG
AAATAAGATAGTGTATTCGAGGAAGGATATGGAAGCACTGAGGTTTGTAAATGTTGCAGAGCAAAGGAGATTGTGGAAAGCTATATGCAAGGAACTTATGCCCGTTGTGG
CAAGGGAATACAGTAGCTTAACAGGCTCGAATTACCCAATGAAGATAGGTTCCACCTCTGATCATAGGCAGCATTCGGAGAAGAGGGAAGAAGCGTCTTCAATTATAAGG
GAAGGATGTTCGGAAAGCTTGGATGGTGAGATAGAGGACATGGAAGGCGATAATGAAATTAAGAATTTTGTATTCTCGGAACCCTCTTGCAGTCTTAGTATCAGCGAAGA
TAGCGATGATGATAAATATTACAACAGTATACAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGATTAGAGTATCTTAGAC
GTGTCAGGTGGGAAGCTTCCCATATCCCAAATGTGACTGTGGCTAAAGTTGATAGAAGTAATTTTAAGAAAGAGCGAAGTGTTTATATGCCAGTGATTCCTGCAATTGTC
AAGTGCCCCGAGCATTTACTGCCTTCAAAAGAGTGGGAGGATTCTTTTCTTGCTGATTTTTCTAAGCTGCGACAGGCTCTGTCATGCTCCGAGGGATTTATGCAGTCTGA
TTTCATCCTCCATGAAAAGATCGATTCTGTAATTCCGGACTTGATTGATCAGCCAAGTGTTGTCTTGCCTGCTAACGACATTGAATCACAGCAACCTGAGAAACCGAATG
CCAGCACTTCTGCAAAGGAAAACAGTTGCAACGATTATCCATCTTTATCAGCAATCTTAAAGATGAACTCTGTGTTCCGTGTTTCGTCGTTGAGGAAACGTATAAACTCG
TTAGAAACACAGACAACACTGTCAAGGACCGATTGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATATGCCTCTGGATGCAGATACTTGTGCTGCTTTCAGGAGTCT
GCTTCGGAAATGTGCCAGCTTGCGGGCCGAGAAGACCGAGCTCGACGATGAGGTGATAATGCTCAATATTCTTGCCACCATTTCTGGAAGGTACTTTGGACAGTCGGAAA
ATTGAAGCTATGGTGTGATGATGGCTGCCTCATTTTCTTTTTTCTCCCATGTTATGTAGTATCTGTTAAGAGGATAATGTTCACTACTGAATTTGAAACGTAGATGAGTC
TGATTGACAATAAAACTTGTAAATTAGTTCCTTTTGTTTTTGTGAAGAATCGAAGTGTGTTAATGATTTTTTTCCTTTTTTGTGGATATGTTATCAATCTATATTTGATG
TTTTAGTGAAA
Protein sequenceShow/hide protein sequence
MADEISSDYGDGFSRTASEDPQSPSQPIISASKISADEKKFPLILSNQNQECDVMMDGAGSASPENNPETSVKEMVVRDSACASSENGGNMRSLVDKIQNLDVELEKELL
KVDSVHDFETIGAVEDGKREFAVDEVEEKDSSRSVLSFDGNQDCAKKELDQEVRLSASMEADASEAFAGTQGLLRKEANSESILEMKKKLLLEELEAMLVPGEEIHLEKG
NNPPSSGGTMDNCGKMMLIDEEKITDQQNDSENTNVLRRSHLSLRNSLKIEVIDETALVEPVHVSKIGNREEIGIVCPTRSMQIKVNKSHEPERGGKKVKRSRRRPREAK
IAEVHWNVGNVNELDGVNRRQKNAEGNKIVYSRKDMEALRFVNVAEQRRLWKAICKELMPVVAREYSSLTGSNYPMKIGSTSDHRQHSEKREEASSIIREGCSESLDGEI
EDMEGDNEIKNFVFSEPSCSLSISEDSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASHIPNVTVAKVDRSNFKKERSVYMPVIPAIVKCPEHLLPSKE
WEDSFLADFSKLRQALSCSEGFMQSDFILHEKIDSVIPDLIDQPSVVLPANDIESQQPEKPNASTSAKENSCNDYPSLSAILKMNSVFRVSSLRKRINSLETQTTLSRTD
CLWLFALSAAVDMPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATISGRYFGQSEN