; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021691 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021691
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationchr7:10819486..10825832
RNA-Seq ExpressionLag0021691
SyntenyLag0021691
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]6.3e-29174.63Show/hide
Query:  DEISSEFSASED-PQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE
        D  S +F ASED  +    P  S +KISA+E K PLIVSN + +C+V +NS+SSAS +EN   +ETSV KMVVCD   ASSENGGN+ SLV + + LDV+
Subjt:  DEISSEFSASED-PQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE

Query:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLL
        L +ESFKVD VH FE IG+VEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD   KEAF RTE+LL+KEAD+ESILEMKKKLL
Subjt:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLL

Query:  LEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRS
        LEELEAMLVPGEEIHLEK           DNCGKTM+ DEEKI+ QQN+SEN NVL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I CPTRS
Subjt:  LEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRS

Query:  MQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTG
        MQI V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNAEG+KIVYSRKDMEALRFVNVSEQ RLW+AI  ELMPVVAREYSSLT 
Subjt:  MQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTG

Query:  SNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDK
        SNYPMK GSTS PRQH EK EEASS I                   R+GCSESLD EI+D+EGDNE  NF F + SC LSV           SEDS+DDK
Subjt:  SNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDK

Query:  FYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG
        +YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEAS IPNV+VA+VDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFS LRQ LSC +G
Subjt:  FYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG

Query:  SIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAA
         +QSDF+ HEK +SV PD IDQPSVV LPANDIDSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ TLSRTDCLWLFALSAA
Subjt:  SIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAA

Query:  VDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        VDTPLDADTCA+FRSLLRKCASLRAEK+EL+DEVIMLNILATI GRYFGQSEN
Subjt:  VDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-29074.63Show/hide
Query:  DEISSEFSASED-PQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE
        D  S +F ASED  +    P  S +KISA+E K PLIVSN + +C+V +NS+SSAS +E    +ETSV KMVVCD   ASSENGGN+ SLV + + LDV+
Subjt:  DEISSEFSASED-PQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVE

Query:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLL
        L +ESFKVD VH FE IG+VEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD   KEAF RTE+LL+KEAD+ESILEMKKKLL
Subjt:  LEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLL

Query:  LEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRS
        LEELEAMLVPGEEIHLEK           DNCGKTM+ DEEKI+ Q+N+SEN NVL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I CPTRS
Subjt:  LEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRS

Query:  MQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTG
        MQI V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNAEG+KIVYSRKDMEALRFVNVSEQ RLW+AI  ELMPVVAREYSSLT 
Subjt:  MQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTG

Query:  SNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDK
        SNYPMK GSTS PRQH EK EEASS I                   R+GCSESLD EI+D+EGDNE  NF F + SC LSV           SEDS+DDK
Subjt:  SNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDK

Query:  FYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG
        +YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEAS IPNVAVA+VDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFS LRQ LSC +G
Subjt:  FYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDG

Query:  SIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAA
         +QSDF+ HEK +SV PD IDQPSVV LPANDIDSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ TLSRTDCLWLFALSAA
Subjt:  SIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAA

Query:  VDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        VDTPLDADTCA+FRSLLRKCASLRAEK+ELDDEVIMLNILATI GRYFGQSEN
Subjt:  VDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_008454478.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103494875 [Cucumis melo]1.4e-29072.87Show/hide
Query:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-
        MADEI+S+        F +SE+PQS  +P  SA  ISAD    PLIVSN+N +C+V IN+ +SAS QENP   E+SV KMV+CDSAC SSENGG++ SL 
Subjt:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES
        VGKIQNLDVEL KES KVD VH FET+   ED  QEVA+DEV+ +D ARSVLSFDGNQ C K+E VQE +LAA        KEAFARTE LLKKE DSES
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES

Query:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE
        ILEMKKKLLLE+++AMLVPG+EIHL++G+NPPSSGGIVD C KTM+ DEEKIA QQN+SE +NVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+ 
Subjt:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE

Query:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV
        IGIVCPTRSMQ++V KSHEP++GGKK  +SRR+ARE K+ EMHWN+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNV+EQKRLWKAIC EL+PVV
Subjt:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV

Query:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE
        AREYSSLT     +K GSTS+PRQ L KREEASSII                   REGCSESLDGEI+D+EGDNE  NFV SE SCSL            
Subjt:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE

Query:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL
         S+DSDDDK+Y+SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEAS IPNV VA+VDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKL
Subjt:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL

Query:  RQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTD
        RQALS S+   ++SDF+LHEK D ++P+LI QPSV  LPA+D D  Q EE N STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ TLSR D
Subjt:  RQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTD

Query:  CLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        CLWLFALSAAVDTPLD DTCAAFRSLLRKCASLRAEKTE+DDEVIMLNIL+TI GRYFGQSEN
Subjt:  CLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_022145184.1 uncharacterized protein LOC111014690 isoform X1 [Momordica charantia]4.4e-29273.13Show/hide
Query:  MADEISS---------EFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSAS----------AQENPEFSETSVAKMVVCDSACAS
        MADEISS          FSA+E+PQ  +QP  SAS+IS DE K P I SNQ Q+ +VMIN  +SAS          A+ENPE +ETSV KMVVCDSA   
Subjt:  MADEISS---------EFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSAS----------AQENPEFSETSVAKMVVCDSACAS

Query:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFART
        SENGG+++SLV +++NLD  +E+EKES KVD V  FET  AVEDGNQEVA+DEVEE+D ARS+ S D NQ   KKE VQEV+L+AA+EAD  GKEAFAR+
Subjt:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFART

Query:  EDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVE
        E+LL+KEA S S LEMKKKLLLEELEAMLV       EKG NPP+S GIVDNC    +  + KIA QQN SEN+NVL RS+LSLRNSLKIEVIDETALVE
Subjt:  EDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVE

Query:  PVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRL
        PVHVS+IGNGEE+GIVCP+RSMQIK+NK  EPE+GGKKAKRSRRRAREAKIPE+H  L NVNELDK N RQKN EGNKIVYSRKDMEALRFVNV+EQ+RL
Subjt:  PVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRL

Query:  WKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSL
        WKAIC ELMPVVAREYSSLT SN  MKIGSTS+ +QHLEKR+EASSII                   REGCSESLDGEI+D+EGD+E KNFV S+ SCSL
Subjt:  WKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSL

Query:  SVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKE
        SV+E GNTIL+ECS DSDDDK+Y+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEASRIPNV VA+VD SN KKEQSVYMPVIPAIA CP+HLLPSK+
Subjt:  SVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKE

Query:  WEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINL
        WEDAFL DFS+LRQALS S+G IQSDF+LHEK D + P+LIDQPSV  LPAN+IDSQQ E+P A+T AKE++C D+PS+S ISKM+SVFRVSSLRKRINL
Subjt:  WEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINL

Query:  LETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        +ET  TLSR DC WLFALSAA+DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILA I GRYFGQSEN
Subjt:  LETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]6.8e-30176.12Show/hide
Query:  MADEISSE--------FSASEDPQSLIQPFGSASKISADEK-LPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-
        MADEISS+        FS SE+ QS  +P  SA KISAD+K  PLIVSNQNQ+ +V INSA+SAS QENP   ETSV K     SAC SSENGGN+ SL 
Subjt:  MADEISSE--------FSASEDPQSLIQPFGSASKISADEK-LPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES
        VGKIQNLDVEL KE  KVD VH FET+ AVEDG Q+VA+DEV E+D ARSVLSFDGN  C+K+E VQEV+LAA        KEAFARTE+LLKKE D ES
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES

Query:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE
        ILE+KKKLLLEEL+AMLVPG++IHLEKGNNPPSS G VD+C KT++ DEEKIA +QN+SE +NVL RSHLSLRNSLKIEVIDETALVEPVHVSKIGNGE 
Subjt:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE

Query:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV
        IGIVCP RSMQ+KVNKSHEP+RGGKKAKRSRR+AREAK+ EM+WNLGNVNELDKVNGRQK AEGNKIVYSRKDMEALRFVNV+EQ+RLWKAIC EL+P V
Subjt:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV

Query:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE
        AREYSSLT SNYPMKIGSTS+PRQ L KREEASSII                   REGCSESLDGEI+D+EGDNES NFV  E SCS SV          
Subjt:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE

Query:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL
         SED D+DK+Y SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEAS IPNV +A+VDRSNFKKEQSVYMPVIP IAKCP+HLLPSKEWE+AFL DFS L
Subjt:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL

Query:  RQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDC
        R+ALS S+   QSDF+LHEK DS IPDLI QP V  LPA +IDS Q EE N STSAKENSCNDYPSLS ISKMNSVFRVSSL+KRIN LETQ TLS+TDC
Subjt:  RQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDC

Query:  LWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        LWLFALSAAVDTPLDADTCAAFRSLLRKCASLRA+KTELDDEVIMLNIL+TI GRYFGQSEN
Subjt:  LWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein2.4e-28872.77Show/hide
Query:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-
        MADEISS+        F +SE PQS  +   SA +ISAD    PLIVSNQN + +V INS +SASAQE+P   ETSV KMV+CDSAC SSENGGN+ SL 
Subjt:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES
        VGKIQNLD+EL KE  KVD VH F T+   EDG Q+VA+DEV+ +D ARSVLS DGNQ C K+E V+E +LAA        KEAFARTE LLKKE DSES
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES

Query:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNN-SENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGE
        ILEMKKKLLLE+++AMLVPG+EIHL++G+NPPSSGGIVD C KTM+  EEKIA QQNN SE +NVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNGE
Subjt:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNN-SENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGE

Query:  EIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPV
         IGIVCPTRSMQ+KVNKSHEP++GGKKAK+SRR+ARE K+ EMHWN+GN+NE+DKVNGRQ+NAEGNKIVYSRKDMEALRFVNV+EQKRLWKAIC EL+PV
Subjt:  EIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPV

Query:  VAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILE
        VAREYSSLT     +K GSTS+PRQ L KREEASSII                   REGCSESLDGEI+D+ GD+E  NFV SE SCSL           
Subjt:  VAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILE

Query:  ECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSK
          S+DSDDDK+Y+SIQRPAF VEGEPNFDSGPPEDGLEYLRRVRWEAS IPNV VA+VDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSK
Subjt:  ECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSK

Query:  LRQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRT
        LRQALS S+   ++SDF+LHEK D ++P+LI QPSV  LPAND DS Q +E + STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ TLSR 
Subjt:  LRQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRT

Query:  DCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        DCLWLFALSAAVDTPLD DTCAAFRSLLRKCASLRAEKTE+D+EVIMLNIL+TI GRYF QSEN
Subjt:  DCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948756.8e-29172.87Show/hide
Query:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-
        MADEI+S+        F +SE+PQS  +P  SA  ISAD    PLIVSN+N +C+V IN+ +SAS QENP   E+SV KMV+CDSAC SSENGG++ SL 
Subjt:  MADEISSE--------FSASEDPQSLIQPFGSASKISAD-EKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSL-

Query:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES
        VGKIQNLDVEL KES KVD VH FET+   ED  QEVA+DEV+ +D ARSVLSFDGNQ C K+E VQE +LAA        KEAFARTE LLKKE DSES
Subjt:  VGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSES

Query:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE
        ILEMKKKLLLE+++AMLVPG+EIHL++G+NPPSSGGIVD C KTM+ DEEKIA QQN+SE +NVL RSHLSLRNSLKIEVIDETALVEPVHVS+IGNG+ 
Subjt:  ILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEE

Query:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV
        IGIVCPTRSMQ++V KSHEP++GGKK  +SRR+ARE K+ EMHWN+ NVNE+DKV+GRQ+NAEGNKI+YSRKDMEALRFVNV+EQKRLWKAIC EL+PVV
Subjt:  IGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVV

Query:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE
        AREYSSLT     +K GSTS+PRQ L KREEASSII                   REGCSESLDGEI+D+EGDNE  NFV SE SCSL            
Subjt:  AREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEE

Query:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL
         S+DSDDDK+Y+SIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEAS IPNV VA+VDRSNFKKEQSVYMPVIPAIA+CPEHLLPSKEWE+AFL DFSKL
Subjt:  CSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKL

Query:  RQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTD
        RQALS S+   ++SDF+LHEK D ++P+LI QPSV  LPA+D D  Q EE N STSAKE SCNDYPSLS ISKMN +FRVSSLRKRIN  ETQ TLSR D
Subjt:  RQALSCSDGS-IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTD

Query:  CLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        CLWLFALSAAVDTPLD DTCAAFRSLLRKCASLRAEKTE+DDEVIMLNIL+TI GRYFGQSEN
Subjt:  CLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A6J1CUG9 uncharacterized protein LOC111014690 isoform X12.1e-29273.13Show/hide
Query:  MADEISS---------EFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSAS----------AQENPEFSETSVAKMVVCDSACAS
        MADEISS          FSA+E+PQ  +QP  SAS+IS DE K P I SNQ Q+ +VMIN  +SAS          A+ENPE +ETSV KMVVCDSA   
Subjt:  MADEISS---------EFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSAS----------AQENPEFSETSVAKMVVCDSACAS

Query:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFART
        SENGG+++SLV +++NLD  +E+EKES KVD V  FET  AVEDGNQEVA+DEVEE+D ARS+ S D NQ   KKE VQEV+L+AA+EAD  GKEAFAR+
Subjt:  SENGGNVSSLVGKIQNLD--VELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFART

Query:  EDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVE
        E+LL+KEA S S LEMKKKLLLEELEAMLV       EKG NPP+S GIVDNC    +  + KIA QQN SEN+NVL RS+LSLRNSLKIEVIDETALVE
Subjt:  EDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVE

Query:  PVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRL
        PVHVS+IGNGEE+GIVCP+RSMQIK+NK  EPE+GGKKAKRSRRRAREAKIPE+H  L NVNELDK N RQKN EGNKIVYSRKDMEALRFVNV+EQ+RL
Subjt:  PVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRL

Query:  WKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSL
        WKAIC ELMPVVAREYSSLT SN  MKIGSTS+ +QHLEKR+EASSII                   REGCSESLDGEI+D+EGD+E KNFV S+ SCSL
Subjt:  WKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSL

Query:  SVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKE
        SV+E GNTIL+ECS DSDDDK+Y+SIQRPAFLVEGEP+FDSGPPEDGLEYLRRVRWEASRIPNV VA+VD SN KKEQSVYMPVIPAIA CP+HLLPSK+
Subjt:  SVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKE

Query:  WEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINL
        WEDAFL DFS+LRQALS S+G IQSDF+LHEK D + P+LIDQPSV  LPAN+IDSQQ E+P A+T AKE++C D+PS+S ISKM+SVFRVSSLRKRINL
Subjt:  WEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINL

Query:  LETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        +ET  TLSR DC WLFALSAA+DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILA I GRYFGQSEN
Subjt:  LETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A6J1F307 uncharacterized protein LOC1114392132.0e-29074.47Show/hide
Query:  DEISSEFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVEL
        D  S +FSASE          + S    DE K PLIVSN + +C+V +NS+SSAS +EN   +ETSV KMVVCD   ASSENGGN+ SLV + + LDVEL
Subjt:  DEISSEFSASEDPQSLIQPFGSASKISADE-KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVEL

Query:  EKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLL
         +ESFKVD VH FE IGAVEDGNQEVAMDEVE +D    SV SFDGNQ C KKE VQEV+ + AMEAD   KEAF RTE+LL+KEAD+ESILEMKKKLLL
Subjt:  EKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLL

Query:  EELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSM
        EELEAMLVPGEEIHLEK           DNCGK M+ DEEKIA QQN+SEN +VL +SHLSL NSLKIEVIDETALVEPVHVSKIGNGEEI I+CPTRSM
Subjt:  EELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSM

Query:  QIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGS
        QI V+KSHEPER GKKA+RSRRRAREAKI E+HWNLGNVNELDK     KNAEG+KIVYSRKDMEALRFVNVSEQ RLW+AIC ELMPVVAREYSSLT S
Subjt:  QIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGS

Query:  NYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKF
        NYPMK GSTS PRQH EK EEASS I                   R+GCSESLD EI+D+EGDNE  NF F + SC LSV           SEDS+DD++
Subjt:  NYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKF

Query:  YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS
        YNSIQRPAFLVEGEPNF+SGPPEDGLEYLRRVRWEAS IPNVAVA+VDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFSKLRQ LSC +G 
Subjt:  YNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGS

Query:  IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAV
        +QSDF+ HEK DSV PD IDQPS+V LPAN+IDSQQ EEPNASTS+KENS N+YPSLS ISKMNSVFRVSSLRKRIN LETQ TLSRTDCLWLFALSAAV
Subjt:  IQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN
        DTPLDADTCA+FRSLLRKCASLRAEK+ELDDEVIMLNILATI GRYFGQSEN
Subjt:  DTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN

A0A6J1IAH4 uncharacterized protein LOC1114707331.6e-28775.87Show/hide
Query:  KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEV
        K PLIVSN + +C+V +NS+SSAS +EN   +ETSV KMVV D   ASSENGGN+ SLV   + LDVEL +ESFKVD VH FE IGAVEDGNQEVAMDEV
Subjt:  KLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEV

Query:  EERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNC
        E +D    SV SFDGNQ C KKE VQEV+ + AMEAD   KEAF RTE+LL+KEAD+ESILEMKKKLLLEELEA+LVPGEEI LEK           DNC
Subjt:  EERDSAR-SVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNC

Query:  GKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPE
        GK M+ DEEKIA QQN+SEN +VL +SHLSL NSLKIEVIDETA+VEPVHVSKIGNGEEI I+CPTRSMQI V+KSHEPER GKKA+RSRRRAREAKI E
Subjt:  GKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPE

Query:  MHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFG
        +HWNLGN+NELDK     KNAEGNKIVYSRKDMEALRFVNVSEQ RLW+AIC ELMPVVAREYSSLT SNYPMK GS+S PRQH EK EEASS I     
Subjt:  MHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFG

Query:  FLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRR
                      R+GCSESLD EI+D+EGDNE  NF F + SC LSV           SEDS+DD++YNSIQRPAFLVEGEPNF+SGPPEDGLEYLRR
Subjt:  FLCVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRR

Query:  VRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPAND
        VRWEAS IPNVAVA+VDRSNFKKE+SVYMPVIPAIA CP++LLPSKEWEDAFL DFSKLRQ LSC +G +QSDF+ HEK DSV PD IDQPSVV LPAND
Subjt:  VRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPAND

Query:  IDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDD
        IDSQQ EEPNASTS+KENSCN+YPSLS ISKMN VFRVSSLRKRIN LETQ TLSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCASLRAEK+ELDD
Subjt:  IDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDD

Query:  EVIMLNILATICGRYFGQSEN
        EVIMLNILATI GRYFGQSEN
Subjt:  EVIMLNILATICGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 21.7e-1229.15Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF

Query:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLD
        +    T     D        L      D       N S           P LS +S+MN    V+S+ + ++    +   +     WL+AL A ++ PL 
Subjt:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLD

Query:  ADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
         +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  ADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

O42260 Gem-associated protein 26.9e-1428.75Show/hide
Query:  SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDL
        S PP    EYLRRV+ EA+R P+V +A++D    +K+Q+V +  +      P+   PS  W+   +  FS +RQ+L    G  +S  +    ++  +P  
Subjt:  SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDL

Query:  IDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAF
         D+ S         +    +   A  S  ++   DY      P LS +S+M+     S L   +N  E +   +     WL+AL A ++ PL  +  +  
Subjt:  IDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAF

Query:  RSLLRKCASLRA-EKTELDDEVIMLNILATICGRYFGQSE
        R L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  RSLLRKCASLRA-EKTELDDEVIMLNILATICGRYFGQSE

Q54KN2 Gem-associated protein 21.3e-1724.38Show/hide
Query:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKK--EQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQAL-----
        +  Q  AF V  E   D   P  G EYL+RV+W ++R P+V VA +D S  K     + Y  + P+I KC + LLP+  WE  FL+DFS+ RQ L     
Subjt:  NSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKK--EQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQAL-----

Query:  ------SCSDGSIQSDFVLHEKTDSVIPDL---------------------IDQPSVVLLPAND------------------------------------
              + ++ +  ++   +   +++IP L                      +   + +   ND                                    
Subjt:  ------SCSDGSIQSDFVLHEKTDSVIPDL---------------------IDQPSVVLLPAND------------------------------------

Query:  ---IDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTE
            + ++ EE     S K+ +  + P++  + +++ V  V+ +   I  LE +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T 
Subjt:  ---IDSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTE

Query:  LDD-EVIMLNILATICGRYFGQSE
        L+D  +  +NIL TI  +YF Q E
Subjt:  LDD-EVIMLNILATICGRYFGQSE

Q9CQQ4 Gem-associated protein 21.0e-1229.06Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ+            
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF

Query:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQRE---------EPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQATLSR
         +H+  +      +D  S V +P ++ +   ++         E     S +E+   DY      P LS +S+MN     S L    N     +    L R
Subjt:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQRE---------EPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQATLSR

Query:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
            W +AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

Q9QZP1 Gem-associated protein 21.7e-1229.06Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF
        P  L EG   FD S PP    EYLRRV+ EA++ P+V VA++D    K++QSV +  +      PE   P+ +W+   +  FS +RQ+            
Subjt:  PAFLVEGEPNFD-SGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDF

Query:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQRE---------EPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQATLSR
         +H+  +      +D  S V +P ++ +   ++         E     S  E+   DY      P LS +S+MN     S L    N     +    L R
Subjt:  VLHEKTDSVIPDLIDQPSVVLLPANDIDSQQRE---------EPNASTSAKENSCNDY------PSLSGISKMNSVFRVSSLRKRINLL---ETQATLSR

Query:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE
            W +AL A ++ PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDE-VIMLNILATICGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related8.8e-6531.77Show/hide
Query:  ARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPG---EEIHLEKGNNPPSSGGIVDNCGKT
        A++ +S DG     +KE   + +L + ++      +  A T +      ++  + E+K+    +  E   V G   E ++ E+      +  +++   K 
Subjt:  ARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPG---EEIHLEKGNNPPSSGGIVDNCGKT

Query:  MVTDEEKIAAQQNNSENLNVLGRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMH
         +  E ++ +    S +++ L +   ++ N + KIE++D TALV+ V                           H P+R G  A+  +   R+ K     
Subjt:  MVTDEEKIAAQQNNSENLNVLGRSHLSLRNSL-KIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMH

Query:  WNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFL
          +G+   + + +G + N +  + +Y+RK +E++RF ++  QK LW  + + ++P V  EY SL    Y     S+   R                    
Subjt:  WNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFL

Query:  CVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVR
                                  V G  ES N        +L   EG     E+  + +DD+  YNSI RPAF V+GEP+F +GPPEDGLEYLRRVR
Subjt:  CVILMVSISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVR

Query:  WEASRIPNVAVARVDRSNF-KKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDI
        WEA  IPNV VA++D S + KKEQSVYMP+IP I KCPE+LLP KEWED+ L DF  LRQ L+ S  S + + +  +  + ++ ++ ++           
Subjt:  WEASRIPNVAVARVDRSNF-KKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDI

Query:  DSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKT-ELDD
                +  T   E+       + G   M+SV RVS L+KRI L+E ++ L  +DC W+ AL A+++TPLDADTCA  R LLRKCAS+RAE + E+ D
Subjt:  DSQQREEPNASTSAKENSCNDYPSLSGISKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKT-ELDD

Query:  E--VIMLNILATICGRYFGQ
        E  + M N+L TI GRYFGQ
Subjt:  E--VIMLNILATICGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown1.2e-2470.89Show/hide
Query:  ILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFK-KEQSVYMPVIP
        + EE +ED+DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA RIPNV VA+V  S ++ KEQSVYMP IP
Subjt:  ILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown1.7e-3125.8Show/hide
Query:  DSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAA--AMEADGKGK
        +S+  S+  G  +   V KI+++   +E + F V +    E++ + + G        VE           DG+    + + +Q+++  +  + E+D  G 
Subjt:  DSACASSENGGNVSSLVGKIQNLDVELEKESFKVDVVHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAA--AMEADGKGK

Query:  EAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENL-----NVLGRSHLSLRNSLK
        + +   +D +   +D + IL  +KK    +++ +         EK N         D       +D  KI    N S++       V  + +  +  S+ 
Subjt:  EAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNNPPSSGGIVDNCGKTMVTDEEKIAAQQNNSENL-----NVLGRSHLSLRNSLK

Query:  IEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEAL
        I+++D+TAL + V   K G           R +    +K    +      ++   +   + I E +    +    D  N  + N +  +I+YSR  ME++
Subjt:  IEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIPEMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEAL

Query:  RFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESK
        R+ +++ QK+LW  +   L+P +  EY                                              IS V R                   ++
Subjt:  RFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVSISLVHREGCSESLDGEIKDVEGDNESK

Query:  NFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFK-KEQSVYMPVIPAI
        ++V                + EE +ED+DD   YNSI RPAF V+GEP+FDSGPPEDG+EYLRRVRWEA RIPNV VA+V  S ++ KEQSVYMP IP  
Subjt:  NFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRSNFK-KEQSVYMPVIPAI

Query:  AKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDL-IDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNS
            E +                         G +    + H    S IP   +   SV LL   D                 N  +D    +GI     
Subjt:  AKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDL-IDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGISKMNS

Query:  VFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRA
                K     E ++ L  +DC W+ AL A+VDTP DADT A  R+L+RKCASLRA
Subjt:  VFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATGAGATAAGTTCTGAATTCTCAGCATCTGAGGACCCCCAATCTCTTATTCAACCGTTTGGTTCTGCCTCTAAGATCTCTGCCGACGAGAAGCTCCCTTTGAT
CGTCTCGAATCAAAACCAGGAGTGTCAAGTAATGATAAACAGTGCGAGTTCCGCTTCTGCACAAGAGAATCCAGAATTTTCAGAAACTTCTGTCGCAAAGATGGTCGTGT
GCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACGTGAGCAGTCTAGTGGGCAAGATTCAGAATCTTGATGTGGAGCTCGAAAAAGAATCGTTCAAGGTCGATGTT
GTCCATGGTTTTGAAACGATTGGTGCCGTGGAAGACGGTAATCAAGAAGTTGCGATGGATGAAGTAGAAGAGAGAGATTCTGCAAGAAGTGTACTGAGTTTTGATGGAAA
TCAAGCTTGTACGAAGAAAGAACCTGTTCAAGAAGTTCGGTTGGCTGCTGCTATGGAAGCCGACGGCAAAGGCAAAGAAGCCTTTGCGCGAACCGAGGATTTGTTGAAGA
AAGAAGCTGATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAAATTCATCTAGAAAAGGGAAATAAT
CCCCCTAGCTCAGGAGGGATCGTGGATAATTGCGGCAAAACGATGGTTACTGATGAGGAGAAGATTGCTGCTCAGCAAAATAATTCCGAGAACTTGAATGTTCTTGGGCG
AAGTCATTTGTCTCTTAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTTCTAAAATTGGAAATGGAGAAGAGATTGGTATAG
TTTGTCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAAGCAAAGATTCCT
GAGATGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAAAAAAATGCGGAAGGAAACAAGATAGTGTATTCGAGGAAGGATATGGAAGCACT
GAGATTTGTGAATGTTTCAGAACAGAAGAGATTGTGGAAAGCTATATGCAATGAACTTATGCCCGTTGTGGCAAGAGAATACAGTAGCTTAACTGGCTCGAATTACCCAA
TGAAGATAGGCTCCACCTCTGAGCCTAGGCAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATAATCATGGTTTTTGGTTTTCTTTGTGTCATTTTGATGGTTTCA
ATTTCTCTTGTGCACAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAAAGGACGTGGAAGGTGATAATGAAAGTAAAAATTTTGTATTCTCAGAATCCTCTTGCAG
TCTTAGTGTCAACGAGGGTGGCAATACAATTTTAGAAGAGTGCAGTGAAGATAGTGATGATGATAAATTTTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAG
AACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCGTATTCCAAATGTGGCAGTGGCAAGAGTTGATAGAAGT
AATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAGTGCCCCGAGCATTTACTGCCTTCAAAAGAATGGGAGGATGCATTTCTTGATGATTT
TTCTAAGCTACGTCAGGCTCTGTCATGCTCCGATGGATCTATACAGTCTGATTTTGTCCTCCATGAAAAGACCGATTCTGTAATTCCAGACTTGATTGATCAGCCAAGTG
TTGTCTTGTTGCCTGCCAACGACATCGACTCACAGCAACGCGAGGAACCGAATGCCAGCACTTCTGCAAAGGAAAACAGTTGCAATGATTATCCATCTTTATCAGGAATC
TCAAAGATGAACTCGGTGTTCCGTGTGTCGTCGTTGAGGAAACGTATAAACTTGTTAGAAACACAGGCAACACTGTCAAGGACCGATTGTCTTTGGCTGTTTGCTTTAAG
TGCAGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCCGCTTTCAGGAGTCTGCTTAGGAAATGTGCCAGCTTGCGGGCCGAGAAGACCGAGCTTGACGACGAGGTGA
TAATGCTCAATATTCTTGCCACCATTTGTGGAAGGTACTTTGGACAGTCGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGATGAGATAAGTTCTGAATTCTCAGCATCTGAGGACCCCCAATCTCTTATTCAACCGTTTGGTTCTGCCTCTAAGATCTCTGCCGACGAGAAGCTCCCTTTGAT
CGTCTCGAATCAAAACCAGGAGTGTCAAGTAATGATAAACAGTGCGAGTTCCGCTTCTGCACAAGAGAATCCAGAATTTTCAGAAACTTCTGTCGCAAAGATGGTCGTGT
GCGATTCGGCTTGTGCGTCTTCTGAAAACGGAGGAAACGTGAGCAGTCTAGTGGGCAAGATTCAGAATCTTGATGTGGAGCTCGAAAAAGAATCGTTCAAGGTCGATGTT
GTCCATGGTTTTGAAACGATTGGTGCCGTGGAAGACGGTAATCAAGAAGTTGCGATGGATGAAGTAGAAGAGAGAGATTCTGCAAGAAGTGTACTGAGTTTTGATGGAAA
TCAAGCTTGTACGAAGAAAGAACCTGTTCAAGAAGTTCGGTTGGCTGCTGCTATGGAAGCCGACGGCAAAGGCAAAGAAGCCTTTGCGCGAACCGAGGATTTGTTGAAGA
AAGAAGCTGATTCTGAGAGCATTCTGGAAATGAAAAAGAAATTACTATTGGAAGAACTTGAAGCCATGTTGGTTCCTGGAGAAGAAATTCATCTAGAAAAGGGAAATAAT
CCCCCTAGCTCAGGAGGGATCGTGGATAATTGCGGCAAAACGATGGTTACTGATGAGGAGAAGATTGCTGCTCAGCAAAATAATTCCGAGAACTTGAATGTTCTTGGGCG
AAGTCATTTGTCTCTTAGAAATTCATTGAAGATTGAAGTAATAGACGAGACTGCATTAGTTGAACCGGTTCATGTTTCTAAAATTGGAAATGGAGAAGAGATTGGTATAG
TTTGTCCAACAAGGTCAATGCAGATCAAGGTGAACAAATCCCATGAACCCGAAAGAGGGGGGAAAAAGGCTAAAAGATCGAGGAGGAGGGCAAGGGAAGCAAAGATTCCT
GAGATGCATTGGAATCTGGGGAATGTGAATGAACTTGATAAAGTCAATGGACGTCAAAAAAATGCGGAAGGAAACAAGATAGTGTATTCGAGGAAGGATATGGAAGCACT
GAGATTTGTGAATGTTTCAGAACAGAAGAGATTGTGGAAAGCTATATGCAATGAACTTATGCCCGTTGTGGCAAGAGAATACAGTAGCTTAACTGGCTCGAATTACCCAA
TGAAGATAGGCTCCACCTCTGAGCCTAGGCAGCATTTGGAGAAGAGAGAAGAAGCCTCTTCAATTATAATCATGGTTTTTGGTTTTCTTTGTGTCATTTTGATGGTTTCA
ATTTCTCTTGTGCACAGGGAGGGATGTTCAGAAAGCTTGGATGGTGAGATAAAGGACGTGGAAGGTGATAATGAAAGTAAAAATTTTGTATTCTCAGAATCCTCTTGCAG
TCTTAGTGTCAACGAGGGTGGCAATACAATTTTAGAAGAGTGCAGTGAAGATAGTGATGATGATAAATTTTACAACAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAG
AACCCAATTTTGATTCAGGACCTCCAGAAGATGGACTAGAATATCTTAGACGTGTCAGGTGGGAAGCTTCCCGTATTCCAAATGTGGCAGTGGCAAGAGTTGATAGAAGT
AATTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAGTGCCCCGAGCATTTACTGCCTTCAAAAGAATGGGAGGATGCATTTCTTGATGATTT
TTCTAAGCTACGTCAGGCTCTGTCATGCTCCGATGGATCTATACAGTCTGATTTTGTCCTCCATGAAAAGACCGATTCTGTAATTCCAGACTTGATTGATCAGCCAAGTG
TTGTCTTGTTGCCTGCCAACGACATCGACTCACAGCAACGCGAGGAACCGAATGCCAGCACTTCTGCAAAGGAAAACAGTTGCAATGATTATCCATCTTTATCAGGAATC
TCAAAGATGAACTCGGTGTTCCGTGTGTCGTCGTTGAGGAAACGTATAAACTTGTTAGAAACACAGGCAACACTGTCAAGGACCGATTGTCTTTGGCTGTTTGCTTTAAG
TGCAGCAGTTGATACTCCTCTGGATGCAGATACTTGTGCCGCTTTCAGGAGTCTGCTTAGGAAATGTGCCAGCTTGCGGGCCGAGAAGACCGAGCTTGACGACGAGGTGA
TAATGCTCAATATTCTTGCCACCATTTGTGGAAGGTACTTTGGACAGTCGGAAAATTGA
Protein sequenceShow/hide protein sequence
MADEISSEFSASEDPQSLIQPFGSASKISADEKLPLIVSNQNQECQVMINSASSASAQENPEFSETSVAKMVVCDSACASSENGGNVSSLVGKIQNLDVELEKESFKVDV
VHGFETIGAVEDGNQEVAMDEVEERDSARSVLSFDGNQACTKKEPVQEVRLAAAMEADGKGKEAFARTEDLLKKEADSESILEMKKKLLLEELEAMLVPGEEIHLEKGNN
PPSSGGIVDNCGKTMVTDEEKIAAQQNNSENLNVLGRSHLSLRNSLKIEVIDETALVEPVHVSKIGNGEEIGIVCPTRSMQIKVNKSHEPERGGKKAKRSRRRAREAKIP
EMHWNLGNVNELDKVNGRQKNAEGNKIVYSRKDMEALRFVNVSEQKRLWKAICNELMPVVAREYSSLTGSNYPMKIGSTSEPRQHLEKREEASSIIIMVFGFLCVILMVS
ISLVHREGCSESLDGEIKDVEGDNESKNFVFSESSCSLSVNEGGNTILEECSEDSDDDKFYNSIQRPAFLVEGEPNFDSGPPEDGLEYLRRVRWEASRIPNVAVARVDRS
NFKKEQSVYMPVIPAIAKCPEHLLPSKEWEDAFLDDFSKLRQALSCSDGSIQSDFVLHEKTDSVIPDLIDQPSVVLLPANDIDSQQREEPNASTSAKENSCNDYPSLSGI
SKMNSVFRVSSLRKRINLLETQATLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCASLRAEKTELDDEVIMLNILATICGRYFGQSEN