; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005270 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005270
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationscaffold11:34465508..34472863
RNA-Seq ExpressionSpg005270
SyntenySpg005270
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]8.3e-28975.65Show/hide
Query:  DEISSEFSASED-PQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE
        D  S +F ASED  + P  P  S +KISA+E K PLIVSN   +C+V +NS+SSAS EEN   +ETSV KMVVCD   ASSENGGN+ SLV + + LDV+
Subjt:  DEISSEFSASED-PQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE

Query:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLE
        L +ESFKVD VH FE IG+VEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD           L+KEAD+ESILEMKKKLLLE
Subjt:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLE

Query:  ELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQ
        ELEAMLVPGEEIHLEK           DNCGKTM+ DEEKI+ QQN+SEN NVL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I CPTRSMQ
Subjt:  ELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQ

Query:  IKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSN
        I V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNA G+KIVYSRKDMEALRFVNVSEQ RLW+AI  ELMP+VAREYSSL  SN
Subjt:  IKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSN

Query:  YPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSG
        YPMK GSTS PRQH EK EEASS IR+GCSESLD EI+D+EGDNE  NF F + SC LSV           SE S+DDKYYNSIQRPAFLVEGEPNFDSG
Subjt:  YPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSG

Query:  PPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLID
        PPEDGLEYLRRVRWEAS IPNV+VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFS LRQ LSC +G MQSDFI HEK +SV  D ID
Subjt:  PPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLID

Query:  QPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCAS
        QPSVVLPAND+DSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ +LSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCAS
Subjt:  QPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCAS

Query:  LRAEKTELDDEVIMLNILATICGRYFGQSEN
        LRAEK+EL+DEVIMLNILATI GRYFGQSEN
Subjt:  LRAEKTELDDEVIMLNILATICGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]4.1e-28875.51Show/hide
Query:  DEISSEFSASED-PQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE
        D  S +F ASED  + P  P  S +KISA+E K PLIVSN   +C+V +NS+SSAS EE    +ETSV KMVVCD   ASSENGGN+ SLV + + LDV+
Subjt:  DEISSEFSASED-PQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE

Query:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLE
        L +ESFKVD VH FE IG+VEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD           L+KEAD+ESILEMKKKLLLE
Subjt:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLE

Query:  ELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQ
        ELEAMLVPGEEIHLEK           DNCGKTM+ DEEKI+ Q+N+SEN NVL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I CPTRSMQ
Subjt:  ELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQ

Query:  IKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSN
        I V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNA G+KIVYSRKDMEALRFVNVSEQ RLW+AI  ELMP+VAREYSSL  SN
Subjt:  IKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSN

Query:  YPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSG
        YPMK GSTS PRQH EK EEASS IR+GCSESLD EI+D+EGDNE  NF F + SC LSV           SE S+DDKYYNSIQRPAFLVEGEPNFDSG
Subjt:  YPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSG

Query:  PPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLID
        PPEDGLEYLRRVRWEAS IPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFS LRQ LSC +G MQSDFI HEK +SV  D ID
Subjt:  PPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLID

Query:  QPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCAS
        QPSVVLPAND+DSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ +LSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCAS
Subjt:  QPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCAS

Query:  LRAEKTELDDEVIMLNILATICGRYFGQSEN
        LRAEK+ELDDEVIMLNILATI GRYFGQSEN
Subjt:  LRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_008454478.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103494875 [Cucumis melo]1.2e-28774.32Show/hide
Query:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-
        MADEI+S+        F +SE+PQSP +P+ SA  ISAD    PLIVSN++ +C+V IN+ +SAS +EN    E+SV KMV+CDSAC SSENGG++ SL 
Subjt:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK
        VGKIQNLDVEL KES KVD VH FET+   ED  QEVA+DEV+ +D ARSVLSFDGNQ C K+E VQE +LAA  EA    +  LKKE DSESILEMKKK
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK

Query:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT
        LLLE+++AMLVPG+EIHL++G NPPSSGGIVD C KTM+ DEEKIA QQN+SE MNVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+ IGIVCPT
Subjt:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT

Query:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL
        RSMQ++V KSHEP++GGKK  +SRR+ARE K+ EMHWN+ NVNE+DKV+GRQ+NA GNKI+YSRKDMEALRFVNV+EQKRLWKAIC EL+P+VAREYSSL
Subjt:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL

Query:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN
              +K GSTS+PRQ L KREEASSIIREGCSESLDGEI+D+EGDNE  NFV SE SCSL             S+ SDDDKYY+SIQRPAFLVEGEPN
Subjt:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN

Query:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI
        FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKLRQALS S+   M+SDFILHEK D ++
Subjt:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI

Query:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL
         +LI QPS VLPA+D D  Q EE N STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ +LSR DCLWLFALSAAVDTPLD DTCAAFRSLL
Subjt:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL

Query:  RKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        RKCASLRAEKTE+DDEVIMLNIL+TI GRYFGQSEN
Subjt:  RKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_022145184.1 uncharacterized protein LOC111014690 isoform X1 [Momordica charantia]1.8e-28874.2Show/hide
Query:  MADEISS---------EFSASEDPQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSAS----------AEENQEISETSVAKMVVCDSACAS
        MADEISS          FSA+E+PQ P+QPI SAS+IS DE K P I SNQ Q+ +VMIN  +SAS          AEEN EI+ETSV KMVVCDSA   
Subjt:  MADEISS---------EFSASEDPQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSAS----------AEENQEISETSVAKMVVCDSACAS

Query:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEADGN--------
        SENGG+++SLV +++NLD  +E+EKES KVD V  FET  AVEDGNQEVA+DEVEE+D ARS+ S D NQ   KKE VQEV+L+AA+EADG         
Subjt:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEADGN--------

Query:  -LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPV
         L+KEA S S LEMKKKLLLEELEAMLV       EKG NPP+S GIVDNC    +  + KIA QQN SENMNVL RS+LSLRNSLKIEVIDETALVEPV
Subjt:  -LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPV

Query:  HVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWK
        HVS+IGNGEE+GIVCP+RSMQIK+NK  EPE+GGKKAKRSRRRAREAKIPE+H  L NVNELDK N RQKN  GNKIVYSRKDMEALRFVNV+EQ+RLWK
Subjt:  HVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWK

Query:  AICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDK
        AIC ELMP+VAREYSSL  SN  MKIGSTS+ +QHLEKR+EASSIIREGCSESLDGEI+D+EGD+E KNFV S+ SCSLSV+E GNTIL+ECS  SDDDK
Subjt:  AICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDK

Query:  YYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG
        YY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WEDAFL DFS+LRQALS S+G
Subjt:  YYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG

Query:  SMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAV
         +QSDFILHEK D +  +LIDQPS VLPAN++DSQQ E+P A+T AKE++C D+PS+S ISKM+SVFRVSSLRKRINL+ET  +LSR DC WLFALSAA+
Subjt:  SMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILA I GRYFGQSEN
Subjt:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]2.0e-29577.41Show/hide
Query:  MADEISSE--------FSASEDPQSPIQPISSASKISADEK-LPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-
        MADEISS+        FS SE+ QS  +PI SA KISAD+K  PLIVSNQ+Q+ +V INSA+SAS +EN    ETSV K     SAC SSENGGN+ SL 
Subjt:  MADEISSE--------FSASEDPQSPIQPISSASKISADEK-LPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK
        VGKIQNLDVEL KE  KVD VH FET+ AVEDG Q+VA+DEV E+D ARSVLSFDGN  C+K+E VQEV+LAA  EA    +  LKKE D ESILE+KKK
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK

Query:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT
        LLLEEL+AMLVPG++IHLEKG+NPPSS G VD+C KT++ DEEKIA +QN+SE MNVL RSHLSLRNSLKIEVIDETALVEPVHVSKIGNGE IGIVCP 
Subjt:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT

Query:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL
        RSMQ+KVNKSHEP+RGGKKAKRSRR+AREAK+ EM+WNLGNVNELDKVNGRQK A GNKIVYSRKDMEALRFVNV+EQ+RLWKAIC EL+P VAREYSSL
Subjt:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL

Query:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN
          SNYPMKIGSTS+PRQ L KREEASSIIREGCSESLDGEI+D+EGDNES NFV  E SCS SV           SE  D+DKYY SIQRPAFLVEGEPN
Subjt:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN

Query:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIR
        FDSGPPEDGLEYLRRVRWEAS IPNVT+AKVDRSNFKKEQSVYMPVIP IAKCP+HLLPSKEWE+AFL DFS LR+ALS S+   QSDFILHEK DS I 
Subjt:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIR

Query:  DLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLR
        DLI QP  VLPA ++DS Q EE N STSAKENSCNDYPSLS ISKMNSVFRVSSL+KRIN LETQ +LS+TDCLWLFALSAAVDTPLDADTCAAFRSLLR
Subjt:  DLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLR

Query:  KCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        KCASLRA+KTELDDEVIMLNIL+TI GRYFGQSEN
Subjt:  KCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein2.1e-28574.22Show/hide
Query:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-
        MADEISS+        F +SE PQSP + + SA +ISAD    PLIVSNQ+ + +V INS +SASA+E+    ETSV KMV+CDSAC SSENGGN+ SL 
Subjt:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK
        VGKIQNLD+EL KE  KVD VH F T+   EDG Q+VA+DEV+ +D ARSVLS DGNQ C K+E V+E +LAA  EA    +  LKKE DSESILEMKKK
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK

Query:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNN-SENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCP
        LLLE+++AMLVPG+EIHL++G NPPSSGGIVD C KTM+  EEKIA QQNN SE MNVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNGE IGIVCP
Subjt:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNN-SENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCP

Query:  TRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSS
        TRSMQ+KVNKSHEP++GGKKAK+SRR+ARE K+ EMHWN+GN+NE+DKVNGRQ+NA GNKIVYSRKDMEALRFVNV+EQKRLWKAIC EL+P+VAREYSS
Subjt:  TRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSS

Query:  LAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEP
        L      +K GSTS+PRQ L KREEASSIIREGCSESLDGEI+D+ GD+E  NFV SE SCSL             S+ SDDDKYY+SIQRPAF VEGEP
Subjt:  LAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEP

Query:  NFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSV
        NFDSGPPEDGLEYLRRVRWEAS IPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKLRQALS S+   M+SDFILHEK D +
Subjt:  NFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSV

Query:  IRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSL
        + +LI QPS VLPAND DS Q +E + STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ +LSR DCLWLFALSAAVDTPLD DTCAAFRSL
Subjt:  IRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSL

Query:  LRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        LRKCASLRAEKTE+D+EVIMLNIL+TI GRYF QSEN
Subjt:  LRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948755.8e-28874.32Show/hide
Query:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-
        MADEI+S+        F +SE+PQSP +P+ SA  ISAD    PLIVSN++ +C+V IN+ +SAS +EN    E+SV KMV+CDSAC SSENGG++ SL 
Subjt:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK
        VGKIQNLDVEL KES KVD VH FET+   ED  QEVA+DEV+ +D ARSVLSFDGNQ C K+E VQE +LAA  EA    +  LKKE DSESILEMKKK
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK

Query:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT
        LLLE+++AMLVPG+EIHL++G NPPSSGGIVD C KTM+ DEEKIA QQN+SE MNVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+ IGIVCPT
Subjt:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT

Query:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL
        RSMQ++V KSHEP++GGKK  +SRR+ARE K+ EMHWN+ NVNE+DKV+GRQ+NA GNKI+YSRKDMEALRFVNV+EQKRLWKAIC EL+P+VAREYSSL
Subjt:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL

Query:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN
              +K GSTS+PRQ L KREEASSIIREGCSESLDGEI+D+EGDNE  NFV SE SCSL             S+ SDDDKYY+SIQRPAFLVEGEPN
Subjt:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN

Query:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI
        FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKLRQALS S+   M+SDFILHEK D ++
Subjt:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI

Query:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL
         +LI QPS VLPA+D D  Q EE N STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ +LSR DCLWLFALSAAVDTPLD DTCAAFRSLL
Subjt:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL

Query:  RKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        RKCASLRAEKTE+DDEVIMLNIL+TI GRYFGQSEN
Subjt:  RKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A5A7TRY3 Mis18-binding protein 1-like isoform X13.0e-28474.35Show/hide
Query:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-
        MADEI+S+        F +SE+PQSP +P+ SA  ISAD    PLIVSN++ +C+V IN+ +SAS +EN    E+SV KMV+CDSAC SSENGG++ SL 
Subjt:  MADEISSE--------FSASEDPQSPIQPISSASKISAD-EKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK
        VGKIQNLDVEL KES KVD VH FET+   ED  QEVA+DEV+ +D ARSVLSFDGNQ C K+E VQE +LAA  EA    +  LKKE DSESILEMKKK
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEA----DGNLKKEADSESILEMKKK

Query:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT
        LLLE+++AMLVPG+EIHL++G NPPSSGGIVD C KTM+ DEEKIA QQN+SE MNVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+ IGIVCPT
Subjt:  LLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPT

Query:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL
        RSMQ++V KSHEP++GGKKAK+SRR+ARE K+ EMHWN+ NVNE+DKV+GRQ+NA GNKI+YSRKDMEALRFVNV+EQKRLWKAIC EL+P+VAREYSSL
Subjt:  RSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSL

Query:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN
              +K GSTS+PRQ L KREEASSIIREGCSESLDGEI+D+EGDNE  NFV SE SCSL             S+ SDDDKYY+SIQRPAFLVEGEPN
Subjt:  AGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPN

Query:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI
        FDSGPPEDGLEYLRRVRWEAS IPNVTVAKVDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKLRQALS S+   M+SDFILHEK D ++
Subjt:  FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS-MQSDFILHEKTDSVI

Query:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL
         +LI QPS VLPA+D D  Q EE N STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ +LSR DCLWLFALSAAVDTPLD DTCAAFRSLL
Subjt:  RDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLL

Query:  RKCASLRAEKTELDDEVIMLNILATICGR
        RKCASLRAEKTE+DDEVIMLNIL+TI GR
Subjt:  RKCASLRAEKTELDDEVIMLNILATICGR

A0A6J1CUG9 uncharacterized protein LOC111014690 isoform X18.9e-28974.2Show/hide
Query:  MADEISS---------EFSASEDPQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSAS----------AEENQEISETSVAKMVVCDSACAS
        MADEISS          FSA+E+PQ P+QPI SAS+IS DE K P I SNQ Q+ +VMIN  +SAS          AEEN EI+ETSV KMVVCDSA   
Subjt:  MADEISS---------EFSASEDPQSPIQPISSASKISADE-KLPLIVSNQDQECQVMINSASSAS----------AEENQEISETSVAKMVVCDSACAS

Query:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEADGN--------
        SENGG+++SLV +++NLD  +E+EKES KVD V  FET  AVEDGNQEVA+DEVEE+D ARS+ S D NQ   KKE VQEV+L+AA+EADG         
Subjt:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQEVRLAAAMEADGN--------

Query:  -LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPV
         L+KEA S S LEMKKKLLLEELEAMLV       EKG NPP+S GIVDNC    +  + KIA QQN SENMNVL RS+LSLRNSLKIEVIDETALVEPV
Subjt:  -LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPV

Query:  HVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWK
        HVS+IGNGEE+GIVCP+RSMQIK+NK  EPE+GGKKAKRSRRRAREAKIPE+H  L NVNELDK N RQKN  GNKIVYSRKDMEALRFVNV+EQ+RLWK
Subjt:  HVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWK

Query:  AICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDK
        AIC ELMP+VAREYSSL  SN  MKIGSTS+ +QHLEKR+EASSIIREGCSESLDGEI+D+EGD+E KNFV S+ SCSLSV+E GNTIL+ECS  SDDDK
Subjt:  AICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDK

Query:  YYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG
        YY+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEASRIPNVTVAKVD SN KKEQSVYMPVIPAIA CP+HLLPSK+WEDAFL DFS+LRQALS S+G
Subjt:  YYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG

Query:  SMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAV
         +QSDFILHEK D +  +LIDQPS VLPAN++DSQQ E+P A+T AKE++C D+PS+S ISKM+SVFRVSSLRKRINL+ET  +LSR DC WLFALSAA+
Subjt:  SMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILA I GRYFGQSEN
Subjt:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A6J1F307 uncharacterized protein LOC1114392131.7e-28775.21Show/hide
Query:  DEISSEFSASE-DPQSPIQPISSASKISADEKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVEL
        D  S +FSASE   +SP  P         + K PLIVSN   +C+V +NS+SSAS EEN   +ETSV KMVVCD   ASSENGGN+ SLV + + LDVEL
Subjt:  DEISSEFSASE-DPQSPIQPISSASKISADEKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVEL

Query:  EKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLEE
         +ESFKVD VH FE IGAVEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD           L+KEAD+ESILEMKKKLLLEE
Subjt:  EKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQSCTKKEPVQEVRLAAAMEADGN---------LKKEADSESILEMKKKLLLEE

Query:  LEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQI
        LEAMLVPGEEIHLEK           DNCGK M+ DEEKIA QQN+SEN +VL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I+CPTRSMQI
Subjt:  LEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQI

Query:  KVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNY
         V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNA G+KIVYSRKDMEALRFVNVSEQ RLW+AIC ELMP+VAREYSSL  SNY
Subjt:  KVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNY

Query:  PMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGP
        PMK GSTS PRQH EK EEASS IR+GCSESLD EI+D+EGDNE  NF F + SC LSV           SE S+DD+YYNSIQRPAFLVEGEPNF+SGP
Subjt:  PMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGP

Query:  PEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQ
        PEDGLEYLRRVRWEAS IPNV VAKVDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFSKLRQ LSC +G MQSDFI HEK DSV  D IDQ
Subjt:  PEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQ

Query:  PSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASL
        PS+VLPAN++DSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ +LSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCASL
Subjt:  PSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASL

Query:  RAEKTELDDEVIMLNILATICGRYFGQSEN
        RAEK+ELDDEVIMLNILATI GRYFGQSEN
Subjt:  RAEKTELDDEVIMLNILATICGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 21.3e-1330.4Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF

Query:  ILHEKTDSVIRDLIDQPSVVLPAN-DVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALSAAVDT
        +    T     D        L      D       N S           P LS +S+MN     S L    N     +  P L R    WL+AL A ++ 
Subjt:  ILHEKTDSVIRDLIDQPSVVLPAN-DVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALSAAVDT

Query:  PLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
        PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  PLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

O42260 Gem-associated protein 23.9e-1529.75Show/hide
Query:  SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDL
        S PP    EYLRRV+ EA+R P+V +A++D    +K+Q+V +  +      P+   PS  W+   +  FS +RQ+L    G  +S  +    T     D 
Subjt:  SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDL

Query:  IDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLLETQ---PSLSRTDCLWLFALSAAVDTPLDADTCA
               L     +    +   A  S  ++   DY      P LS +S+M+     S L   +N  E +   P L R    WL+AL A ++ PL  +  +
Subjt:  IDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLLETQ---PSLSRTDCLWLFALSAAVDTPLDADTCA

Query:  AFRSLLRKCASLRA-EKTELDDEVIMLNILATICGRYFGQSE
          R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  AFRSLLRKCASLRA-EKTELDDEVIMLNILATICGRYFGQSE

Q54KN2 Gem-associated protein 22.2e-1824.07Show/hide
Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALS----
        +  Q  AF V  E   D   P  G EYL+RV+W ++R P+V VA +D S  K     + Y  + P+I KC + LLP+  WE  FL+DFS+ RQ L     
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKK--EQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALS----

Query:  ------------------------------------------C--------------------------SDGSMQSDFILHEKTDSVIRDLIDQPSVVLP
                                                  C                           D     D+  +E  +    +  ++      
Subjt:  ------------------------------------------C--------------------------SDGSMQSDFILHEKTDSVIRDLIDQPSVVLP

Query:  ANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTE
          + + ++ EE     S K+ +  + P++  + +++ V  V+ +   I  LE +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T 
Subjt:  ANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTE

Query:  LDD-EVIMLNILATICGRYFGQSE
        L+D  +  +NIL TI  +YF Q E
Subjt:  LDD-EVIMLNILATICGRYFGQSE

Q9CQQ4 Gem-associated protein 29.6e-1429.8Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF

Query:  ILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALS
           ++ DS +     +         +  +   E     S +E+   DY      P LS +S+MN     S L    N     +  P L R    W +AL 
Subjt:  ILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALS

Query:  AAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
        A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  AAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

Q9QZP1 Gem-associated protein 21.6e-1329.8Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDF

Query:  ILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALS
           ++ DS +     +         +  +   E     S  E+   DY      P LS +S+MN     S L    N     +  P L R    W +AL 
Subjt:  ILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQPSLSRTDCLWLFALS

Query:  AAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
        A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  AAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related8.2e-6933.72Show/hide
Query:  ARSVLSFDG--NQSCTKKEPVQEVRLAAAMEADGNLKKE-----ADSESILEMKKKLLLEELEAMLVPG---EEIHLEKGSNPPSSGGIVDNCGKTMVTD
        A++ +S DG   +S  KK+   E+   A+   +  +  E      ++  + E+K+    +  E   V G   E ++ E+      +  +++   K  +  
Subjt:  ARSVLSFDG--NQSCTKKEPVQEVRLAAAMEADGNLKKE-----ADSESILEMKKKLLLEELEAMLVPG---EEIHLEKGSNPPSSGGIVDNCGKTMVTD

Query:  EEKIAAQQNNSENMNVLGRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLG
        E ++ +    S +++ L +   ++ N + KIE++D TALV+ V                           H P+R G  A+  +   R+ K       +G
Subjt:  EEKIAAQQNNSENMNVLGRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLG

Query:  NVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGE
        +   + + +G + N    + +Y+RK +E++RF ++  QK LW  + + ++P V  EY SL    Y     S+   R                        
Subjt:  NVNELDKVNGRQKNAGGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGE

Query:  IKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNF-KKE
           V G  ES N        +L   EG     E+  + +DD+  YNSI RPAF V+GEP+F +GPPEDGLEYLRRVRWEA  IPNV VAK+D S + KKE
Subjt:  IKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNF-KKE

Query:  QSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPS
        QSVYMP+IP I KCPE+LLP KEWED+ L DF  LRQ L+ S  S + + I  +  + ++ ++ ++              H E + S             
Subjt:  QSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPS

Query:  LSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILATICGRYFGQ
        ++ I  M+SV RVS L+KRI L+E +  L  +DC W+ AL A+++TPLDADTCA  R LLRKCAS+RAE + E+ DE  + M N+L TI GRYFGQ
Subjt:  LSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKT-ELDDE--VIMLNILATICGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown3.6e-2470.89Show/hide
Query:  ILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFK-KEQSVYMPVIP
        + EE +E +DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA RIPNV VAKV  S ++ KEQSVYMP IP
Subjt:  ILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown3.4e-3024.81Show/hide
Query:  EENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQ
        E    I E +     + +S+  S+  G  +   V KI+++   +E + F V +    E++ + + G        VE           DG+    + + +Q
Subjt:  EENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQSCTKKEPVQ

Query:  EVRLAAAMEADGN------LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENM-----NVLGR
        +++  + +  + +       K E D    +   K++L  + +       ++ +   +   + G  +        +D  KI    N S++       V  +
Subjt:  EVRLAAAMEADGN------LKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENM-----NVLGR

Query:  SHLSLRNSLKIEVIDETALVEPVHVSKIG-NGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNK
         +  +  S+ I+++D+TAL + V   K G +  +  +   T     + +K    E+   K   S    R A        + +     ++NG+Q      +
Subjt:  SHLSLRNSLKIEVIDETALVEPVHVSKIG-NGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNK

Query:  IVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESS
        I+YSR  ME++R+ +++ QK+LW  +   L+P +  EY                                        +G I  V     ++++V  E  
Subjt:  IVYSRKDMEALRFVNVSEQKRLWKAICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESS

Query:  CSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFK-KEQSVYMPVIPAIAKCPEHLL
                         E ++D+  YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA RIPNV VAKV  S ++ KEQSVYMP IP      E + 
Subjt:  CSLSVNEGGNTILEECSEVSDDDKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFK-KEQSVYMPVIPAIAKCPEHLL

Query:  PSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKR
                                G + +  +L   +      L      +L   D                 N  +D    +GI             K 
Subjt:  PSKEWEDAFLDDFSKLRQALSCSDGSMQSDFILHEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKR

Query:  INLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRA
            E +  L  +DC W+ AL A+VDTP DADT A  R+L+RKCASLRA
Subjt:  INLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGTTTTAGGCATTCGGAGGCGATTTGGGCTGAACCGAGTGGAACCGAGGCAGCTGGGGGCACCGGTGGATCGAACGGAAGCGGAAGAACTCGGCCCGCGCAAGCGGGCCG
AGCAGGGGGTCGGGCCAAAAGCCCGATCCCTTCGGCTTTGGCTCGACCCTCTGTCCTGTTCTTCCTCCGGATCCCGTTTTCTGGCTGCCTCCTTAGGCTGGTATCGCGCT
GTCTTCGTCAACTTCTTGCATATCGGACAATGGCGGATGAGATAAGTTCTGAATTCTCAGCATCTGAGGACCCCCAATCTCCTATTCAACCGATTAGTTCTGCCTCTAAG
ATCTCTGCCGACGAGAAGCTCCCTTTGATCGTCTCGAATCAAGACCAGGAGTGTCAAGTAATGATAAACAGTGCGAGTTCCGCTTCTGCAGAAGAGAACCAAGAAATTTC
AGAAACTTCTGTCGCAAAGATGGTCGTGTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACGTGAGCAGTCTAGTGGGCAAGATTCAGAATCTTGATGTGGAGC
TCGAAAAAGAATCTTTCAAGGTCGATGTTGTCCATGGTTTTGAAACGATTGGTGCCGTGGAAGACGGTAATCAAGAAGTTGCGATGGATGAAGTAGAAGAGAGAGATTCT
GCAAGAAGTGTACTGAGTTTTGATGGAAATCAATCTTGTACGAAGAAAGAACCTGTTCAAGAAGTTCGGTTGGCTGCTGCTATGGAAGCCGACGGCAACTTGAAGAAAGA
AGCTGATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAAATTCATCTAGAAAAGGGAAGTAATCCCC
CTAGCTCAGGAGGGATCGTGGATAATTGCGGCAAAACGATGGTTACTGATGAGGAGAAGATTGCTGCTCAGCAAAATAATTCCGAGAACATGAATGTTCTTGGGCGAAGT
CATTTGTCTCTTAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTTCTAAAATTGGAAATGGAGAAGAGATTGGTATAGTTTG
TCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAAGCAAAGATTCCTGAGA
TGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAAAAAAATGCGGGAGGAAACAAGATAGTGTATTCGAGGAAGGATATGGAAGCACTGAGA
TTTGTGAATGTTTCAGAACAGAAGAGATTGTGGAAAGCTATATGCAATGAACTTATGCCCATTGTGGCAAGAGAATACAGTAGCTTAGCAGGCTCGAATTACCCAATGAA
GATAGGCTCCACCTCTGAGCCTAGGCAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATAAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAAAGGACGTGG
AAGGTGATAATGAAAGTAAAAATTTTGTATTTTCGGAATCCTCTTGCAGTCTTAGTGTCAACGAGGGTGGCAATACAATTTTAGAAGAGTGCAGTGAAGTTAGTGATGAT
GATAAATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTG
GGAAGCTTCCCGTATTCCAAACGTGACAGTGGCAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAGTGCCCCG
AGCATTTACTGCCTTCAAAAGAATGGGAGGATGCATTTCTTGATGATTTTTCTAAGCTACGTCAGGCTCTGTCATGCTCCGATGGATCTATGCAGTCTGATTTCATCCTC
CATGAAAAGACCGATTCTGTAATTCGAGACTTGATTGATCAGCCAAGTGTTGTCTTGCCTGCCAACGACGTCGACTCACAGCAACACGAGGAACCGAATGCCAGCACTTC
TGCAAAGGAAAACAGTTGCAATGATTATCCATCTTTATCAGGAATCTCAAAGATGAACTCGGTGTTCCGTGTGTCGTCGTTGAGGAAACGTATAAACTTGTTAGAAACAC
AGCCATCACTGTCAAGGACCGATTGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCCGCTTTCAGGAGTCTGCTTAGGAAA
TGTGCCAGCTTGCGGGCCGAGAAGACCGAGCTTGACGACGAGGTGATAATGCTCAATATTCTTGCCACCATTTGTGGAAGGTACTTTGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
GGTTTTAGGCATTCGGAGGCGATTTGGGCTGAACCGAGTGGAACCGAGGCAGCTGGGGGCACCGGTGGATCGAACGGAAGCGGAAGAACTCGGCCCGCGCAAGCGGGCCG
AGCAGGGGGTCGGGCCAAAAGCCCGATCCCTTCGGCTTTGGCTCGACCCTCTGTCCTGTTCTTCCTCCGGATCCCGTTTTCTGGCTGCCTCCTTAGGCTGGTATCGCGCT
GTCTTCGTCAACTTCTTGCATATCGGACAATGGCGGATGAGATAAGTTCTGAATTCTCAGCATCTGAGGACCCCCAATCTCCTATTCAACCGATTAGTTCTGCCTCTAAG
ATCTCTGCCGACGAGAAGCTCCCTTTGATCGTCTCGAATCAAGACCAGGAGTGTCAAGTAATGATAAACAGTGCGAGTTCCGCTTCTGCAGAAGAGAACCAAGAAATTTC
AGAAACTTCTGTCGCAAAGATGGTCGTGTGCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACGTGAGCAGTCTAGTGGGCAAGATTCAGAATCTTGATGTGGAGC
TCGAAAAAGAATCTTTCAAGGTCGATGTTGTCCATGGTTTTGAAACGATTGGTGCCGTGGAAGACGGTAATCAAGAAGTTGCGATGGATGAAGTAGAAGAGAGAGATTCT
GCAAGAAGTGTACTGAGTTTTGATGGAAATCAATCTTGTACGAAGAAAGAACCTGTTCAAGAAGTTCGGTTGGCTGCTGCTATGGAAGCCGACGGCAACTTGAAGAAAGA
AGCTGATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAAATTCATCTAGAAAAGGGAAGTAATCCCC
CTAGCTCAGGAGGGATCGTGGATAATTGCGGCAAAACGATGGTTACTGATGAGGAGAAGATTGCTGCTCAGCAAAATAATTCCGAGAACATGAATGTTCTTGGGCGAAGT
CATTTGTCTCTTAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTTCTAAAATTGGAAATGGAGAAGAGATTGGTATAGTTTG
TCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAAGCAAAGATTCCTGAGA
TGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAAAAAAATGCGGGAGGAAACAAGATAGTGTATTCGAGGAAGGATATGGAAGCACTGAGA
TTTGTGAATGTTTCAGAACAGAAGAGATTGTGGAAAGCTATATGCAATGAACTTATGCCCATTGTGGCAAGAGAATACAGTAGCTTAGCAGGCTCGAATTACCCAATGAA
GATAGGCTCCACCTCTGAGCCTAGGCAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATAAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAAAGGACGTGG
AAGGTGATAATGAAAGTAAAAATTTTGTATTTTCGGAATCCTCTTGCAGTCTTAGTGTCAACGAGGGTGGCAATACAATTTTAGAAGAGTGCAGTGAAGTTAGTGATGAT
GATAAATATTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTG
GGAAGCTTCCCGTATTCCAAACGTGACAGTGGCAAAAGTTGATAGAAGTAATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAGTGCCCCG
AGCATTTACTGCCTTCAAAAGAATGGGAGGATGCATTTCTTGATGATTTTTCTAAGCTACGTCAGGCTCTGTCATGCTCCGATGGATCTATGCAGTCTGATTTCATCCTC
CATGAAAAGACCGATTCTGTAATTCGAGACTTGATTGATCAGCCAAGTGTTGTCTTGCCTGCCAACGACGTCGACTCACAGCAACACGAGGAACCGAATGCCAGCACTTC
TGCAAAGGAAAACAGTTGCAATGATTATCCATCTTTATCAGGAATCTCAAAGATGAACTCGGTGTTCCGTGTGTCGTCGTTGAGGAAACGTATAAACTTGTTAGAAACAC
AGCCATCACTGTCAAGGACCGATTGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCCGCTTTCAGGAGTCTGCTTAGGAAA
TGTGCCAGCTTGCGGGCCGAGAAGACCGAGCTTGACGACGAGGTGATAATGCTCAATATTCTTGCCACCATTTGTGGAAGGTACTTTGGACAGTCGGAAAATTGA
Protein sequenceShow/hide protein sequence
GFRHSEAIWAEPSGTEAAGGTGGSNGSGRTRPAQAGRAGGRAKSPIPSALARPSVLFFLRIPFSGCLLRLVSRCLRQLLAYRTMADEISSEFSASEDPQSPIQPISSASK
ISADEKLPLIVSNQDQECQVMINSASSASAEENQEISETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDS
ARSVLSFDGNQSCTKKEPVQEVRLAAAMEADGNLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGSNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENMNVLGRS
HLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAGGNKIVYSRKDMEALR
FVNVSEQKRLWKAICNELMPIVAREYSSLAGSNYPMKIGSTSEPRQHLEKREEASSIIREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEVSDD
DKYYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVTVAKVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSMQSDFIL
HEKTDSVIRDLIDQPSVVLPANDVDSQQHEEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQPSLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRK
CASLRAEKTELDDEVIMLNILATICGRYFGQSEN