; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012126 (gene) of Chayote v1 genome

Gene IDSed0012126
OrganismSechium edule (Chayote v1)
DescriptionMis18-binding protein 1-like isoform X1
Genome locationLG12:3890185..3895762
RNA-Seq ExpressionSed0012126
SyntenySed0012126
Gene Ontology termsGO:0000387 - spliceosomal snRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
GO:0032797 - SMN complex (cellular component)
InterPro domainsIPR035426 - Gemin2/Brr1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597639.1 Gem-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]4.7e-23863.86Show/hide
Query:  MADERSSDYGDDFN-------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA
        MA+  SS  GD F+        +    F P DS + I+A+E     I SN + + E  +N +S +S EE  E SVEK+             VCD+ +  +
Subjt:  MADERSSDYGDDFN-------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA

Query:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQD
           GN                               +   V++ + LDV++     KV +VHDFE I +V+DGNQEV   +              DGNQD
Subjt:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQD

Query:  FAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------E
         AK+E+VQEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEAMLV GEEI LEK            +NC K+M +DE          E
Subjt:  FAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------E

Query:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK
        N NVLR+SHLSL NS+KIEVIDETALVEPVHVSKIGN EE+ I  PTRSMQI V++S EPER  KKA+RSRR+ AREAKI EVH NLGNVN L      K
Subjt:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK

Query:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV-
        KNA+G+KIVYS+KDMEALRF NV EQ RLWEAI +ELMPVVAREYSSLTSSN+P KTGSTSG +QH EK EEASS IR+G SESLD EIEDMEGDNE   
Subjt:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV-

Query:  --LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDW
            +PSC LSV ED++DDKYY+SIQRPAFLVEGEPNFDSGPPEDGLE+LRR RWEAS IPNV+VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+W
Subjt:  --LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDW

Query:  EDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE
        EDAFLADFS LRQ LSC EG +QSDFI HEK + V PD ID+PSVVL ANDIDSQQP+E NAS+S+KENSSN+YPS SAISKMNSVFRVSSLRK INS E
Subjt:  EDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE

Query:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TQTTLSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCA+LRAEK+EL+DEVIMLNILATISGRYFGQSEN
Subjt:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

KAG7029080.1 Gem-associated protein 2 [Cucurbita argyrosperma subsp. argyrosperma]2.1e-23863.99Show/hide
Query:  MADERSSDYGDDFN-------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA
        MA+  SS  GD F+        +    F P DS + I+A+E     I SN + + E  +N +S +S EE  E SVEK+             VCD+ +  +
Subjt:  MADERSSDYGDDFN-------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA

Query:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQD
           GN                               +   V++ + LDV++     KV +VHDFE I +V+DGNQEV   +              DGNQD
Subjt:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQD

Query:  FAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------E
         AK+E+VQEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEAMLV GEEI LEK            +NC K+M +DE          E
Subjt:  FAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------E

Query:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK
        N NVLR+SHLSL NS+KIEVIDETALVEPVHVSKIGN EE+ I  PTRSMQI V++S EPER  KKA+RSRR+ AREAKI EVH NLGNVN L      K
Subjt:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK

Query:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV-
        KNA+G+KIVYS+KDMEALRF NV EQ RLWEAI +ELMPVVAREYSSLTSSN+P KTGSTSG +QH EK EEASS IR+G SESLD EIEDMEGDNE   
Subjt:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV-

Query:  --LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDW
            +PSC LSV ED++DDKYY+SIQRPAFLVEGEPNFDSGPPEDGLE+LRR RWEAS IPNV VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+W
Subjt:  --LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDW

Query:  EDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE
        EDAFLADFS LRQ LSC EG +QSDFI HEK + V PD ID+PSVVL ANDIDSQQP+E NAS+S+KENSSN+YPS SAISKMNSVFRVSSLRK INS E
Subjt:  EDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE

Query:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TQTTLSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCA+LRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

XP_022932760.1 uncharacterized protein LOC111439213 [Cucurbita moschata]1.6e-23864.9Show/hide
Query:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE
        MA+  SS  GD F+     S     + S    DE     I SN + + E  +N +S +S EE  E SVEK+             VCD+ +  +   GN  
Subjt:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE

Query:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV
                                     +   V++ + LDVE+     KV +VHDFE I AV+DGNQEV   +              DGNQD AK+E+V
Subjt:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV

Query:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSVK
        QEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEAMLV GEEI LEK N   P L  + E        D EN +VLR+SHLSL NS+K
Subjt:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSVK

Query:  IEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEA
        IEVIDETALVEPVHVSKIGN EE+ I+ PTRSMQI V++S EPER  KKA+RSRR+ AREAKI EVH NLGNVN L      KKNA+G+KIVYS+KDMEA
Subjt:  IEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEA

Query:  LRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPSCNLSVREDND
        LRF NV EQ RLWEAIC+ELMPVVAREYSSLTSSN+P KTGSTSG +QH EK EEASS IR+G SESLD EIEDMEGDNE       +PSC LSV ED++
Subjt:  LRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPSCNLSVREDND

Query:  DDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSC
        DD+YY+SIQRPAFLVEGEPNF+SGPPEDGLE+LRR RWEAS IPNV VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+WEDAFLADFSKLRQ LSC
Subjt:  DDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSC

Query:  SEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALS
         EG +QSDFI HEK D V PD ID+PS+VL AN+IDSQQP+E NAS+S+KENSSN+YPS SAISKMNSVFRVSSLRK INS ETQTTLSRTDCLWLFALS
Subjt:  SEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALS

Query:  AAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        AAVDTPLDADTCA+FRSLLRKCA+LRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  AAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

XP_023540169.1 uncharacterized protein LOC111800622 [Cucurbita pepo subsp. pepo]6.7e-23764.31Show/hide
Query:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE
        MA+  SS  GD F+    +S     + S    DE     I SN + + E  +N +S +S EE  E SVEK+             VCD+ +  +   GN  
Subjt:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE

Query:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV
                                     +   V++ + LDVE+     KV +VHDFE I AV+DGNQEV   +              DGNQD AK+E+V
Subjt:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV

Query:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENMNVLRR
        QEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEAMLV GEEI LEK            +NC K+M +DE          EN NVLR+
Subjt:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENMNVLRR

Query:  SHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNK
        SHLSL NS+KIEVIDETALVEPVHVSKIGN EE+ I  PTRSMQI V++S EPER  KKA+RSRR+  REAKI EVH NLGNVN L      KKNA+G+K
Subjt:  SHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNK

Query:  IVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPS
        IVYS+KDMEALRF NV EQ RLWEAIC+ELMPVVAREYSSLTSSN+P KTGSTSG +QH EK EEASS IR+G SESLD EIEDMEGDNE       +PS
Subjt:  IVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPS

Query:  CNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLAD
        C LSV ED++DDKYY+SIQRPAFLVEGEPNFDSGPPEDGLE+LRR RWEAS IPNV VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+WEDAFLAD
Subjt:  CNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLAD

Query:  FSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSR
        FSKLRQ LSC EG +QSDFI HEK D V PD ID+PSV+L ANDIDSQ P+E NAS+S+KENS N+YPS SAISKMNSVFRVSSLRK INS ETQTTLSR
Subjt:  FSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSR

Query:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TDCLWLFALSAAVDTPLDADTCA+FRSLLRKCA+LRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

XP_038901998.1 uncharacterized protein LOC120088652 [Benincasa hispida]4.1e-24265.67Show/hide
Query:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV
        MADE SSDYGD FNP      N  SS +P+DS   I+AD+K    I SN+NQ+ E +IN A+ +S +E  E SV K             S C     G+ 
Subjt:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV

Query:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEIE---TKVGSVHDFETIVAVKDGNQEVG----KKD-------SDGNQDFAK
        E+G                     N  S+          V KIQNLDVE+     KV +VHDFET+ AV+DG Q+V     +KD        DGN D +K
Subjt:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEIE---TKVGSVHDFETIVAVKDGNQEVG----KKD-------SDGNQDFAK

Query:  EELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENMN
        EELVQEVQL     A  KE   RTE+L +KE D ESILE+KKKLLLE+L+AMLV G++I LEKG NNPPS  G V++C+K++ IDE          E MN
Subjt:  EELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENMN

Query:  VLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNA
        VLRRSHLSL+NS+KIEVIDETALVEPVHVSKIGN E +GIV P RSMQ+ VN+S EP+RG KKAKRSRRK AREAK+ E++ NLGNVN L +VN R+K A
Subjt:  VLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNA

Query:  DGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---VL
        +GNKIVYS+KDMEALRF NV EQ+RLW+AIC+EL+P VAREYSSLTSSN+P K GSTS  +Q L KREEASSIIREG SESLDGEIEDMEGDNE+   V+
Subjt:  DGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---VL

Query:  SEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDA
         EPSC+ SV ED D+DKYY SIQRPAFLVEGEPNFDSGPPEDGLE+LRR RWEAS IPNVT+AKVDRS+FKKEQSVYMPVIP IAKCP HLLP K+WE+A
Subjt:  SEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDA

Query:  FLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQT
        FLADFS LR+ALS SE   QSDFILHEK D  IPDLI +P  VL A +IDS Q +E N S+SAKENS NDYPS SAISKMNSVFRVSSL+K INS ETQT
Subjt:  FLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQT

Query:  TLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TLS+TDCLWLFALSAAVDTPLDADTCAAFRSLLRKCA+LRA+KTELDDEVIMLNIL+TISGRYFGQSEN
Subjt:  TLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

TrEMBL top hitse value%identityAlignment
A0A0A0KXG5 Uncharacterized protein5.6e-22963.08Show/hide
Query:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV
        MADE SSDY D FNP       P S  + VDS   I+AD  N   I SN+N + E +IN  + +S +E  E SV+K+             +CD    G+ 
Subjt:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV

Query:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQ-----EVGKKD-------SDGNQDFA
        E+G                     N  S+          V KIQNLD+E+     KV +VHDF T+   +DG Q     EV  KD        DGNQD A
Subjt:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQ-----EVGKKD-------SDGNQDFA

Query:  KEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSM-----------HIDEEN
        KEELV+E QL     A  KE   RTE L +KE DSESILEMKKKLLLEK++AMLV G+EI L++G +NPPS GG+V+ C K+M           + D E 
Subjt:  KEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSM-----------HIDEEN

Query:  MNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKK
        MNVLRRSHLSL+NS+KIEVIDETALVEPVHVS+IGN E +GIV PTRSMQ+ VN+S EP++G KKAK+SRRK ARE K+ E+H N+GN+N + +VN R++
Subjt:  MNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKK

Query:  NADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---
        NA+GNKIVYS+KDMEALRF NV EQKRLW+AIC+EL+PVVAREYSSLT      KTGSTS  +Q L KREEASSIIREG SESLDGEIEDM GD+E    
Subjt:  NADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---

Query:  VLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWE
        V+SEPSC+LS  +D+DDDKYY SIQRPAF VEGEPNFDSGPPEDGLE+LRR RWEAS IPNVTVAKVDRS+FKKEQSVYMPVIPAIA+CP+HLLP K+WE
Subjt:  VLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWE

Query:  DAFLADFSKLRQALSCSEGS-VQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE
        +AFLADFSKLRQALS SE   ++SDFILHEK D ++P+LI +PS VL AND DS Q  E + S+SAKE S NDYPS SAISKMN +FRVSSLRK INSFE
Subjt:  DAFLADFSKLRQALSCSEGS-VQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFE

Query:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TQTTLSR DCLWLFALSAAVDTPLD DTCAAFRSLLRKCA+LRAEKTE+D+EVIMLNIL+TISGRYF QSEN
Subjt:  TQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

A0A1S3BZY0 LOW QUALITY PROTEIN: uncharacterized protein LOC1034948754.1e-23263.55Show/hide
Query:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV
        MADE +SDY D FNP      NP S  +PVDS   I+AD  N   I SNRN + E +IN  + +S +E  E SV+K+             +CD    G+ 
Subjt:  MADERSSDYGDDFNP------NPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAV

Query:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD------------SDGNQDFA
        E+G                        S+ SL       V KIQNLDVE+     KV +VHDFET+   +D  QEV   +             DGNQD A
Subjt:  EDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD------------SDGNQDFA

Query:  KEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENM
        KEELVQE QL     A  KE   RTE L +KE DSESILEMKKKLLLEK++AMLV G+EI L++G +NPPS GG+V+ C K+M +DE          E M
Subjt:  KEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDE----------ENM

Query:  NVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKN
        NVLRRSHLSL+NS+KIEVIDETALVEPVHVS+IGN + +GIV PTRSMQ+ V +S EP++G KK  +SRRK ARE K+ E+H N+ NVN + +V+ R++N
Subjt:  NVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKN

Query:  ADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---V
        A+GNKI+YS+KDMEALRF NV EQKRLW+AIC+EL+PVVAREYSSLT      KTGSTS  +Q L KREEASSIIREG SESLDGEIEDMEGDNE    V
Subjt:  ADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET---V

Query:  LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWED
        +SEPSC+LS  +D+DDDKYY SIQRPAFLVEGEPNFDSGPPEDGLE+LRR RWEAS IPNVTVAKVDRS+FKKEQSVYMPVIPAIA+CP+HLLP K+WE+
Subjt:  LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWED

Query:  AFLADFSKLRQALSCSEGS-VQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFET
        AFLADFSKLRQALS SE   ++SDFILHEK D ++P+LI +PS VL A+D D  QP+E N S+SAKE S NDYPS SAISKMN +FRVSSLRK INSFET
Subjt:  AFLADFSKLRQALSCSEGS-VQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFET

Query:  QTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        QTTLSR DCLWLFALSAAVDTPLD DTCAAFRSLLRKCA+LRAEKTE+DDEVIMLNIL+TISGRYFGQSEN
Subjt:  QTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

A0A6J1CUG9 uncharacterized protein LOC111014690 isoform X19.5e-23763.86Show/hide
Query:  MADERSS-DYGDDFN------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA
        MADE SS   GD FN       NP    QP+DS S I+ DE    SIASN+ Q+FE MING + +S                       G   +     A
Subjt:  MADERSS-DYGDDFN------PNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGA

Query:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLD--VEIE---TKVGSVHDFETIVAVKDGNQEVG-----KKD-------SDGNQ
         E  N E+ +           E M+ C S S    ++   V++++NLD  +EIE   +KV  V  FET  AV+DGNQEV      +KD        D NQ
Subjt:  VEDGNQEVAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLD--VEIE---TKVGSVHDFETIVAVKDGNQEVG-----KKD-------SDGNQ

Query:  DFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCT-----KSMHIDE----E
        DFAK+ELVQEVQLS+AIEADGKE   R+E+L RKEA S S LEMKKKLLLE+LEAMLV      LE+   NPP+  G+V+NC+     K    D+    E
Subjt:  DFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCT-----KSMHIDE----E

Query:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK
        NMNVLRRS+LSL+NS+KIEVIDETALVEPVHVS+IGN EE+GIV P+RSMQI +N+ +EPE+G KKAKRSRR+ AREAKIPE+H  L NVN L + N R+
Subjt:  NMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERK

Query:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET--
        KN +GNKIVYS+KDMEALRF NV EQ+RLW+AIC+ELMPVVAREYSSLTSSN   K GSTS +KQHLEKR+EASSIIREG SESLDGEIEDMEGD+E   
Subjt:  KNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNET--

Query:  -VLSEPSCNLSVRE-----------DNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAK
         V S+PSC+LSV E           D+DDDKYY SIQRPAFLVEGEP+FDSGPPEDGLE+LRR RWEASRIPNVTVAKVD S+ KKEQSVYMPVIPAIA 
Subjt:  -VLSEPSCNLSVRE-----------DNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAK

Query:  CPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRV
        CPQHLLP K WEDAFLADFS+LRQALS SEG +QSDFILHEK D + P+LID+PS VL AN+IDSQQP++  A++ AKE++  D+PS SAISKM+SVFRV
Subjt:  CPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRV

Query:  SSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        SSLRK IN  ET TTLSR DC WLFALSAA+DTPLDADTCAAFRSLLRKCA+LRAEKTELDDEVIMLNILA ISGRYFGQSEN
Subjt:  SSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

A0A6J1F307 uncharacterized protein LOC1114392137.8e-23964.9Show/hide
Query:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE
        MA+  SS  GD F+     S     + S    DE     I SN + + E  +N +S +S EE  E SVEK+             VCD+ +  +   GN  
Subjt:  MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQE

Query:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV
                                     +   V++ + LDVE+     KV +VHDFE I AV+DGNQEV   +              DGNQD AK+E+V
Subjt:  VAKKDFDGNQNQEFEAMINCSSISSLEETLEDSVEKIQNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-------------SDGNQDFAKEELV

Query:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSVK
        QEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEAMLV GEEI LEK N   P L  + E        D EN +VLR+SHLSL NS+K
Subjt:  QEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSVK

Query:  IEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEA
        IEVIDETALVEPVHVSKIGN EE+ I+ PTRSMQI V++S EPER  KKA+RSRR+ AREAKI EVH NLGNVN L      KKNA+G+KIVYS+KDMEA
Subjt:  IEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEA

Query:  LRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPSCNLSVREDND
        LRF NV EQ RLWEAIC+ELMPVVAREYSSLTSSN+P KTGSTSG +QH EK EEASS IR+G SESLD EIEDMEGDNE       +PSC LSV ED++
Subjt:  LRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---LSEPSCNLSVREDND

Query:  DDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSC
        DD+YY+SIQRPAFLVEGEPNF+SGPPEDGLE+LRR RWEAS IPNV VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+WEDAFLADFSKLRQ LSC
Subjt:  DDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSC

Query:  SEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALS
         EG +QSDFI HEK D V PD ID+PS+VL AN+IDSQQP+E NAS+S+KENSSN+YPS SAISKMNSVFRVSSLRK INS ETQTTLSRTDCLWLFALS
Subjt:  SEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALS

Query:  AAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        AAVDTPLDADTCA+FRSLLRKCA+LRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  AAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

A0A6J1IAH4 uncharacterized protein LOC1114707335.6e-23770.15Show/hide
Query:  NQNQEFEAMINCSSISSLEETLEDSVEKI-----------------------QNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-----------
        N + + E  +N SS +S EE  E SVEK+                       + LDVE+     KV SVHDFE I AV+DGNQEV   +           
Subjt:  NQNQEFEAMINCSSISSLEETLEDSVEKI-----------------------QNLDVEI---ETKVGSVHDFETIVAVKDGNQEVGKKD-----------

Query:  --SDGNQDFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENM
           DGNQD AK+E+VQEVQ S+A+EAD KE   RTE+L RKEAD+ESILEMKKKLLLE+LEA+LV GEEI LEK N   P L  + E        D EN 
Subjt:  --SDGNQDFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENM

Query:  NVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKN
        +VLR+SHLSL NS+KIEVIDETA+VEPVHVSKIGN EE+ I+ PTRSMQI V++S EPER  KKA+RSRR+ AREAKI EVH NLGN+N L      KKN
Subjt:  NVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKN

Query:  ADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---
        A+GNKIVYS+KDMEALRF NV EQ RLWEAIC+ELMPVVAREYSSLTSSN+P KTGS+SG +QH EK EEASS IR+G SESLD EIEDMEGDNE     
Subjt:  ADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETV---

Query:  LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWED
          +PSC LSV ED++DD+YY+SIQRPAFLVEGEPNF+SGPPEDGLE+LRR RWEAS IPNV VAKVDRS+FKKE+SVYMPVIPAIA CPQ+LLP K+WED
Subjt:  LSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWED

Query:  AFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQ
        AFLADFSKLRQ LSC EG +QSDFI HEK D V PD ID+PSVVL ANDIDSQQP+E NAS+S+KENS N+YPS SAISKMN VFRVSSLRK INS ETQ
Subjt:  AFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQ

Query:  TTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN
        TTLSRTDCLWLFALSAAVDTPLDADTCA+FRSLLRKCA+LRAEK+ELDDEVIMLNILATISGRYFGQSEN
Subjt:  TTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN

SwissProt top hitse value%identityAlignment
O14893 Gem-associated protein 26.3e-1227.38Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF
        P  L EG   FD S PP    E+LRR + EA++ P+V VA++D    K++QSV +  +      P+   P   W+   +A FS +RQ ++      +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF

Query:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV
        +     +  +P   DE         +  +   +     +  E+   DY      P  S +S+MN    V+S+ ++++++  +   +     WL+AL A +
Subjt:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE

O42260 Gem-associated protein 22.6e-1328.87Show/hide
Query:  SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDL
        S PP    E+LRR + EA+R P+V +A++D    +K+Q+V +  +      P    P   W+   +A FS +RQ+L    G  +S  +     +  +P  
Subjt:  SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDL

Query:  IDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFR
         DE S        +    D   A +S  ++   DY      P  S +S+M+     S L   +N FE +         WL+AL A ++ PL  +  +  R
Subjt:  IDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFR

Query:  SLLRKCANLRA-EKTELDDEVIMLNILATISGRYFGQSE
         L R+C+ +RA  + + DD V  LN+   + GRYF Q +
Subjt:  SLLRKCANLRA-EKTELDDEVIMLNILATISGRYFGQSE

Q54KN2 Gem-associated protein 21.5e-0527.5Show/hide
Query:  DSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDD-
        + ++ +E     S K+ +  + P+   + +++ V  V+ +  HI   E +   ++    WL+ L + ++ P+D DTC+  RS +R+ +  R++ T L+D 
Subjt:  DSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDD-

Query:  EVIMLNILATISGRYFGQSE
         +  +NIL TI  +YF Q E
Subjt:  EVIMLNILATISGRYFGQSE

Q9CQQ4 Gem-associated protein 24.8e-1227.38Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF
        P  L EG   FD S PP    E+LRR + EA++ P+V VA++D    K++QSV +  +      P+   P   W+   +A FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF

Query:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV
        +     +  +P   DE         +  +   E     S +E+   DY      P  S +S+MN    ++S+ ++++++  +   +     W +AL A +
Subjt:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE

Q9QZP1 Gem-associated protein 22.4e-1126.98Show/hide
Query:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF
        P  L EG   FD S PP    E+LRR + EA++ P+V VA++D    K++QSV +  +      P+   P   W+   +  FS +RQ++       +S  
Subjt:  PAFLVEGEPNFD-SGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDF

Query:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV
        +     +  +P   DE         +  +   E     S  E+   DY      P  S +S+MN    ++S+ ++++++  +   +     W +AL A +
Subjt:  ILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDY------PSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAV

Query:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE
        + PL  +  +  R L R+C+ +R      DDE V  LN+L  +  RYF Q +
Subjt:  DTPLDADTCAAFRSLLRKCANLRAEKTELDDE-VIMLNILATISGRYFGQSE

Arabidopsis top hitse value%identityAlignment
AT1G54380.1 spliceosome protein-related2.1e-6332.84Show/hide
Query:  LEDSVEKIQNLDVEIETKVGSVHDFETIVAVKDGNQEVGKKDSDGNQDFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKL
        LE   +  + L+ EI+ +  S  +    + ++DG   V  +D    ++ +  +  ++ ++  A+E    E L   E    +E++++ +LE +K+ LL ++
Subjt:  LEDSVEKIQNLDVEIETKVGSVHDFETIVAVKDGNQEVGKKDSDGNQDFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEMKKKLLLEKL

Query:  EAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSV-KIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPE
        E                    LG +     KS+ +D      L +   ++ N V KIE++D TALV+ VH             +P R             
Subjt:  EAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSV-KIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPE

Query:  RGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTS
                    A  E   P     +G+  N+++ +  + N    + +Y++K +E++RF ++  QK LW  +   ++P V  EY SL    +  K+  ++
Subjt:  RGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTS

Query:  GTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETVLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVT
          +   E   E +    EG         ED E                 +DNDD   Y+SI RPAF V+GEP+F +GPPEDGLE+LRR RWEA  IPNV 
Subjt:  GTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETVLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVT

Query:  VAKVDRSSF-KKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNAS
        VAK+D S++ KKEQSVYMP+IP I KCP++LLP+K+WED+ L DF  LRQ L+ S  S + + I  +     + DL+ E                  N  
Subjt:  VAKVDRSSF-KKEQSVYMPVIPAIAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNAS

Query:  SSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKT-ELDDE--VIMLNILA
           +E+ S      + I  M+SV RVS L+K I   E ++ L  +DC W+ AL A+++TPLDADTCA  R LLRKCA++RAE + E+ DE  + M N+L 
Subjt:  SSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKT-ELDDE--VIMLNILA

Query:  TISGRYFGQ
        TI+GRYFGQ
Subjt:  TISGRYFGQ

AT2G42510.1 FUNCTIONS IN: molecular_function unknown8.2e-2357.73Show/hide
Query:  IEDMEG--DNETVLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFK-KEQSVYMPVIP
        + + EG  +++++L +        EDNDD   Y+SI RPAF V+GEP+FDSGPPEDG+E+LRR RWEA RIPNV VAKV  S ++ KEQSVYMP IP
Subjt:  IEDMEG--DNETVLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFK-KEQSVYMPVIP

AT2G42510.2 FUNCTIONS IN: molecular_function unknown5.6e-3227.99Show/hide
Query:  QNQEFEAMINCSSISSLE-ETLEDSVEKIQNLDVEIETKVGSVHDFETIVAVKDGNQEVGKKDSDGNQDFAKEELV--QEVQLSSAIEADGKEGLVRTED
        +N+  E +   S IS+ E + L D V KI+++ VE +  V S+ D +  +     N E   +DSDG +    E+++   E+   S    + KE L + ++
Subjt:  QNQEFEAMINCSSISSLE-ETLEDSVEKIQNLDVEIETKVGSVHDFETIVAVKDGNQEVGKKDSDGNQDFAKEELV--QEVQLSSAIEADGKEGLVRTED

Query:  LSRKEADSE---------------SILEMKKKLLLEKLEAMLVTGEEIDL------EKGNN-----NPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLK
        +S    +S+               SI +MK+ L  +K       G++ D+      EK N+         +  + +N   S    E+   V  + +  + 
Subjt:  LSRKEADSE---------------SILEMKKKLLLEKLEAMLVTGEEIDL------EKGNN-----NPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLK

Query:  NSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEP--ERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGN-----
         SV I+++D+TAL + V   K G                  + S  P      K A R  +K   E  I +     GN +++V+ N   K +D       
Subjt:  NSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEP--ERGVKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGN-----

Query:  -----KIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETVLS
             +I+YS+  ME++R+ ++  QK+LW  +   L+P +  EY    S+  PR+      T+ ++ K E+                             
Subjt:  -----KIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEASSIIREGWSESLDGEIEDMEGDNETVLS

Query:  EPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFK-KEQSVYMPVIPAIAKCPQHLLPLKDWEDA
                 EDNDD   Y+SI RPAF V+GEP+FDSGPPEDG+E+LRR RWEA RIPNV VAKV  S ++ KEQSVYMP IP      + +         
Subjt:  EPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFK-KEQSVYMPVIPAIAKCPQHLLPLKDWEDA

Query:  FLADFSKLRQA--LSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFET
         L  F  L  +  +   + +  S F+L +K                          D RN  S     +                       K     E 
Subjt:  FLADFSKLRQA--LSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHINSFET

Query:  QTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRA
        ++ L  +DC W+ AL A+VDTP DADT A  R+L+RKCA+LRA
Subjt:  QTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATGAGAGAAGCTCCGACTATGGCGACGACTTCAATCCAAACCCCCACTCTTCTTTTCAACCAGTTGATTCTACATCTATCATTGCTGCCGACGAGAAAAATCT
GTCTTCGATCGCTTCGAATCGGAATCAGGAGTTTGAAGCAATGATAAACGGTGCGAGTTTTTCCTCTCTTGAAGAAACCCTAGAAAAATCTGTCGAAAAGATTCAGAATC
TCGATGTGGAGATCGAAACGAAGGTGGGTTCTGTCTGTGATTTCGAAACGATCGGTGCTGTGGAAGATGGTAATCAAGAAGTTGCGAAGAAAGATTTTGATGGGAATCAG
AATCAGGAGTTCGAAGCAATGATCAACTGTTCGAGCATTTCTTCTCTTGAAGAAACCCTAGAAGATTCTGTCGAAAAGATTCAGAATCTTGATGTGGAGATCGAAACAAA
GGTGGGTTCTGTCCATGATTTCGAAACAATCGTTGCTGTGAAAGATGGTAATCAAGAAGTTGGGAAGAAAGATTCTGATGGGAATCAAGATTTTGCGAAGGAAGAACTTG
TTCAAGAAGTTCAGTTGTCTTCTGCTATTGAAGCCGACGGCAAAGAGGGCTTGGTTCGAACCGAGGATTTGTCGAGGAAAGAAGCTGATTCTGAGAGCATTCTGGAAATG
AAAAAGAAACTACTATTGGAAAAACTTGAAGCCATGTTGGTTACTGGAGAAGAAATTGATCTAGAGAAGGGAAATAATAATCCCCCTAGTTTAGGAGGGATGGTGGAAAA
CTGCACGAAATCGATGCATATTGATGAGGAGAACATGAATGTTCTTAGACGAAGCCATTTGTCTCTAAAGAATTCAGTGAAGATTGAAGTGATAGACGAGACTGCATTAG
TCGAGCCGGTTCATGTTTCCAAGATTGGAAATGTAGAAGAGGTTGGTATAGTTTATCCAACAAGGTCAATGCAGATAGGGGTTAACAGATCCCTAGAGCCCGAGAGAGGG
GTGAAAAAGGCAAAAAGATCGAGGAGGAAGGCGGCAAGGGAAGCGAAGATTCCTGAGGTGCATTTGAATCTGGGGAATGTGAATAATCTTGTTCAAGTCAATGAACGTAA
GAAAAATGCAGATGGAAATAAGATAGTGTATTCGAAGAAGGATATGGAAGCACTGAGGTTTGAAAATGTCGAAGAACAGAAGAGATTGTGGGAAGCTATATGCCAGGAAC
TTATGCCTGTTGTGGCAAGGGAATACAGTAGCTTAACGAGCTCGAATCACCCGAGGAAGACAGGCTCCACCTCTGGTACGAAGCAGCATTTGGAGAAGAGGGAAGAAGCT
TCTTCAATTATAAGGGAGGGATGGTCAGAGAGCTTGGATGGTGAGATAGAGGACATGGAAGGTGATAATGAAACTGTACTCTCTGAACCCTCTTGCAATCTTAGTGTCCG
CGAGGATAATGACGATGATAAATATTACAGCAGTATTCAGAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGATTAGAATTTC
TTAGACGTGCCAGGTGGGAAGCTTCCCGTATTCCAAATGTGACGGTGGCAAAAGTTGATAGAAGCAGTTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCA
ATTGCCAAGTGCCCTCAGCATTTACTGCCTTTAAAAGACTGGGAGGATGCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTTTGTCATGCTCCGAGGGCTCTGTGCA
GTCCGATTTCATCCTCCACGAAAAGTTCGATTTTGTAATTCCGGACTTGATTGATGAGCCTAGTGTTGTCTTGACTGCCAACGACATCGACTCACAGCAACCTGATGAAC
GGAATGCCAGCTCTTCTGCAAAGGAAAACAGTAGCAATGATTATCCATCTTTTTCAGCAATCTCAAAGATGAACTCGGTGTTTCGTGTTTCGTCGTTGAGGAAGCATATA
AACTCGTTTGAAACACAGACAACACTCTCAAGGACTGATTGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTTTGGATGCAGATACTTGTGCCGCTTTCAG
GAGTCTGCTTCGGAAATGTGCCAACTTGCGAGCCGAGAAGACCGAGCTTGACGACGAGGTGATAATGCTCAACATTCTTGCCACCATTTCTGGAAGGTACTTCGGACAGT
CGGAAAATTGA
mRNA sequenceShow/hide mRNA sequence
ACCCCATTTTTCATTACACTTCTAGTCAGTGGCGCGTCTAAAAAGTAAAACCCTTCTTCTTTCTTCGAGCAATGGCGGATGAGAGAAGCTCCGACTATGGCGACGACTTC
AATCCAAACCCCCACTCTTCTTTTCAACCAGTTGATTCTACATCTATCATTGCTGCCGACGAGAAAAATCTGTCTTCGATCGCTTCGAATCGGAATCAGGAGTTTGAAGC
AATGATAAACGGTGCGAGTTTTTCCTCTCTTGAAGAAACCCTAGAAAAATCTGTCGAAAAGATTCAGAATCTCGATGTGGAGATCGAAACGAAGGTGGGTTCTGTCTGTG
ATTTCGAAACGATCGGTGCTGTGGAAGATGGTAATCAAGAAGTTGCGAAGAAAGATTTTGATGGGAATCAGAATCAGGAGTTCGAAGCAATGATCAACTGTTCGAGCATT
TCTTCTCTTGAAGAAACCCTAGAAGATTCTGTCGAAAAGATTCAGAATCTTGATGTGGAGATCGAAACAAAGGTGGGTTCTGTCCATGATTTCGAAACAATCGTTGCTGT
GAAAGATGGTAATCAAGAAGTTGGGAAGAAAGATTCTGATGGGAATCAAGATTTTGCGAAGGAAGAACTTGTTCAAGAAGTTCAGTTGTCTTCTGCTATTGAAGCCGACG
GCAAAGAGGGCTTGGTTCGAACCGAGGATTTGTCGAGGAAAGAAGCTGATTCTGAGAGCATTCTGGAAATGAAAAAGAAACTACTATTGGAAAAACTTGAAGCCATGTTG
GTTACTGGAGAAGAAATTGATCTAGAGAAGGGAAATAATAATCCCCCTAGTTTAGGAGGGATGGTGGAAAACTGCACGAAATCGATGCATATTGATGAGGAGAACATGAA
TGTTCTTAGACGAAGCCATTTGTCTCTAAAGAATTCAGTGAAGATTGAAGTGATAGACGAGACTGCATTAGTCGAGCCGGTTCATGTTTCCAAGATTGGAAATGTAGAAG
AGGTTGGTATAGTTTATCCAACAAGGTCAATGCAGATAGGGGTTAACAGATCCCTAGAGCCCGAGAGAGGGGTGAAAAAGGCAAAAAGATCGAGGAGGAAGGCGGCAAGG
GAAGCGAAGATTCCTGAGGTGCATTTGAATCTGGGGAATGTGAATAATCTTGTTCAAGTCAATGAACGTAAGAAAAATGCAGATGGAAATAAGATAGTGTATTCGAAGAA
GGATATGGAAGCACTGAGGTTTGAAAATGTCGAAGAACAGAAGAGATTGTGGGAAGCTATATGCCAGGAACTTATGCCTGTTGTGGCAAGGGAATACAGTAGCTTAACGA
GCTCGAATCACCCGAGGAAGACAGGCTCCACCTCTGGTACGAAGCAGCATTTGGAGAAGAGGGAAGAAGCTTCTTCAATTATAAGGGAGGGATGGTCAGAGAGCTTGGAT
GGTGAGATAGAGGACATGGAAGGTGATAATGAAACTGTACTCTCTGAACCCTCTTGCAATCTTAGTGTCCGCGAGGATAATGACGATGATAAATATTACAGCAGTATTCA
GAGACCTGCCTTTCTGGTGGAGGGAGAACCCAATTTTGATTCAGGACCTCCAGAAGATGGATTAGAATTTCTTAGACGTGCCAGGTGGGAAGCTTCCCGTATTCCAAATG
TGACGGTGGCAAAAGTTGATAGAAGCAGTTTTAAGAAAGAGCAAAGTGTTTATATGCCAGTGATTCCTGCAATTGCCAAGTGCCCTCAGCATTTACTGCCTTTAAAAGAC
TGGGAGGATGCATTTCTTGCTGATTTTTCTAAGCTGCGTCAGGCTTTGTCATGCTCCGAGGGCTCTGTGCAGTCCGATTTCATCCTCCACGAAAAGTTCGATTTTGTAAT
TCCGGACTTGATTGATGAGCCTAGTGTTGTCTTGACTGCCAACGACATCGACTCACAGCAACCTGATGAACGGAATGCCAGCTCTTCTGCAAAGGAAAACAGTAGCAATG
ATTATCCATCTTTTTCAGCAATCTCAAAGATGAACTCGGTGTTTCGTGTTTCGTCGTTGAGGAAGCATATAAACTCGTTTGAAACACAGACAACACTCTCAAGGACTGAT
TGTCTTTGGCTGTTTGCTTTAAGTGCAGCAGTTGATACTCCTTTGGATGCAGATACTTGTGCCGCTTTCAGGAGTCTGCTTCGGAAATGTGCCAACTTGCGAGCCGAGAA
GACCGAGCTTGACGACGAGGTGATAATGCTCAACATTCTTGCCACCATTTCTGGAAGGTACTTCGGACAGTCGGAAAATTGAAGCTATGTTTACTCCCAAGTTTAAATAG
CATCTGGTATTAGGATGGACTATGATTTTAGTTTTACTATTGTTTTAATGTGACAAGTTACTTGATACGAAAAATGAGCTTTGTAATGGACACGACATATTGTTTTAATG
ATATTACTTTCAAATCA
Protein sequenceShow/hide protein sequence
MADERSSDYGDDFNPNPHSSFQPVDSTSIIAADEKNLSSIASNRNQEFEAMINGASFSSLEETLEKSVEKIQNLDVEIETKVGSVCDFETIGAVEDGNQEVAKKDFDGNQ
NQEFEAMINCSSISSLEETLEDSVEKIQNLDVEIETKVGSVHDFETIVAVKDGNQEVGKKDSDGNQDFAKEELVQEVQLSSAIEADGKEGLVRTEDLSRKEADSESILEM
KKKLLLEKLEAMLVTGEEIDLEKGNNNPPSLGGMVENCTKSMHIDEENMNVLRRSHLSLKNSVKIEVIDETALVEPVHVSKIGNVEEVGIVYPTRSMQIGVNRSLEPERG
VKKAKRSRRKAAREAKIPEVHLNLGNVNNLVQVNERKKNADGNKIVYSKKDMEALRFENVEEQKRLWEAICQELMPVVAREYSSLTSSNHPRKTGSTSGTKQHLEKREEA
SSIIREGWSESLDGEIEDMEGDNETVLSEPSCNLSVREDNDDDKYYSSIQRPAFLVEGEPNFDSGPPEDGLEFLRRARWEASRIPNVTVAKVDRSSFKKEQSVYMPVIPA
IAKCPQHLLPLKDWEDAFLADFSKLRQALSCSEGSVQSDFILHEKFDFVIPDLIDEPSVVLTANDIDSQQPDERNASSSAKENSSNDYPSFSAISKMNSVFRVSSLRKHI
NSFETQTTLSRTDCLWLFALSAAVDTPLDADTCAAFRSLLRKCANLRAEKTELDDEVIMLNILATISGRYFGQSEN