; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg25126 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg25126
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationCarg_Chr13:5475317..5481017
RNA-Seq ExpressionCarg25126
SyntenyCarg25126
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583754.1 U4/U6.U5 tri-snRNP-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+00100Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_022927367.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucurbita moschata]0.0e+00100Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_023001558.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucurbita maxima]0.0e+0099.45Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGE SPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGD TDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_023519495.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucurbita pepo subsp. pepo]0.0e+0099.82Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]2.2e-30194.72Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDE----DENHIMRSRDVEVRKD
        MGSKR+N+S++DEEELGPDLKRHK LGE SPSSPPASENPQLPGFNYGDD+EEE+YK KQNGS YDGDEGD  DDEE+DE    D NH+ RSRDVEVRKD
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDE----DENHIMRSRDVEVRKD

Query:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ
        CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ
Subjt:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ

Query:  LDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
        LDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
Subjt:  LDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF

Query:  RIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN
        RIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDAGTE SSV+METSRMPFLMLGLDLPPPPLFKDVMEKN
Subjt:  RIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN

Query:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
        IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK++EKL SKYDLIANIVHDG
Subjt:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG

Query:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein1.2e-29793.82Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSP-SSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDED----ENHIMRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE SP SSPPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSP-SSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDED----ENHIMRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like2.7e-29793.45Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSP-SSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDE----NHIMRSRDVEVRK
        MGSKR+N+S++DEEELGPDLKRHK LGE SP SSPPASENPQLPGFNYGDDDEEED+K KQNGS YD DEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSP-SSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDE----NHIMRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAGT+GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1CND9 U4/U6.U5 tri-snRNP-associated protein 2-like2.1e-29493.44Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSS-PPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDE---DENHIMRSRDVEVRKD
        MGSKR++D+ VDEEEL P++KR K LGE SPSS PPASENP+LPGFNYGDDDEEEDYK KQNGS   GD GD  DDEE+DE   D NH+ RSRDVEVRKD
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSS-PPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDE---DENHIMRSRDVEVRKD

Query:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ
        CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF KEQVE 
Subjt:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ

Query:  LDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
        LDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
Subjt:  LDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF

Query:  RIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN
        RIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN
Subjt:  RIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN

Query:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
        IIPQVPLFNILKKFDGE ITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHDG
Subjt:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG

Query:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like0.0e+00100Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like0.0e+0099.45Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL
        MGSKRQNDSVVDEEELGPDLKRHKSLGE SPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGD TDDEENDEDENHIMRSRDVEVRKDCPYL
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYL

Query:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
        DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK
Subjt:  DTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKK

Query:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
        KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA
Subjt:  KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGA

Query:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
        QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ
Subjt:  QSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQ

Query:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
        VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE
Subjt:  VPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNE

Query:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD14.2e-4527.99Show/hide
Query:  DEENDEDENHIMRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGY
        D +    E+ + +    +++   P   YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP   
Subjt:  DEENDEDENHIMRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGY

Query:  EINDPS----LDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTR
        +I        L+ I++   P +  + +E   ++      L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +
Subjt:  EINDPS----LDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTR

Query:  KIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEG
        KIW  + FK  +S  +F+      S  + R G   +P++   F+ W  N + S    S    SI+    +G++++           K EN  +      G
Subjt:  KIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEG

Query:  SSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN
          ++      PF +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+          + PVKN
Subjt:  SSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKN

Query:  LELKDYIPLPTPKESEKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
           ++   +    E E L  KY L AN+VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  LELKDYIPLPTPKESEKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 23.7e-13452.99Show/hide
Query:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        D++   E E      R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK+ + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA

Query:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN
          E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VN
Subjt:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN

Query:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        FP+ N++L++Y  L    ++    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 23.7e-13452.99Show/hide
Query:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        D++   E E      R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK+ + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA

Query:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN
          E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VN
Subjt:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN

Query:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        FP+ N++L++Y  L    ++    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 23.1e-13352.78Show/hide
Query:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        D++   E E      R D E R+   CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DDEENDEDENHIMRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK+ + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RVSKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA

Query:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN
          E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPT+VN
Subjt:  GTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVN

Query:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        FP+ N++L++Y+        E   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp102.3e-10442.8Show/hide
Query:  DEDENHIMRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYE
        D ++ H + S+++E  +  P      YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y+
Subjt:  DEDENHIMRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYE

Query:  INDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHA
        +   +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+ +N+ +C   LV R   L RK+W+ 
Subjt:  INDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHA

Query:  RNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSK----KSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVI
        + FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +  E G  +     G  VI
Subjt:  RNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSK----KSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVI

Query:  METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELK
         +T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++ 
Subjt:  METSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELK

Query:  DYIPLPTPKESEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
         +I     + + K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  DYIPLPTPKESEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

Arabidopsis top hitse value%identityAlignment
AT1G32850.1 ubiquitin-specific protease 119.1e-1128.65Show/hide
Query:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP
        P + K+V+ K  + + + LF+ L+ F  E   E + P            R A  +  + +LP  L+ H++RFT + +F  K  TLVNF + +L+L  Y+ 
Subjt:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP

Query:  LPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
            K  +     Y+L A   H G    G+Y  + +   E  WY   D  VS      +  S AY+  Y+R
Subjt:  LPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein4.1e-22171.53Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGES-SPSSPPASENPQLPGFN-YGDDDEEED---YKSKQNGSGYDGDEGDGT-----------DDEENDEDENH
        M  +R+  + V EEE   ++KR + +  S SP  P    NP LP  N Y DDDEEE+    KS+  G+G    EG+G            DDE++D  +  
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGES-SPSSPPASENPQLPGFN-YGDDDEEED---YKSKQNGSGYDGDEGDGT-----------DDEENDEDENH

Query:  IMRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV
           SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+V
Subjt:  IMRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV

Query:  LNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF
        LNPRF++ QV +LDK +QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEF
Subjt:  LNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF

Query:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLP
        LQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G E      E SRM FLMLGLDLP
Subjt:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLP

Query:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLR
        PPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + 
Subjt:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLR

Query:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein3.0e-21670.86Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMR---SRDVEVRKDC
        M  +R+  + V EEE   ++KR + +  S    PP                     K + NG+   G+     DD+E+D+D+    R   SR VEVR+DC
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMR---SRDVEVRKDC

Query:  PYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQL
        PYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +L
Subjt:  PYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQL

Query:  DKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFR
        DK +QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFR
Subjt:  DKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFR

Query:  IGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNI
        IG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G E      E SRMPFLMLGLDLPPPPLFKDVMEKNI
Subjt:  IGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNI

Query:  IPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIANIVHDG
        IPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + SKY+LIANIVHDG
Subjt:  IPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIANIVHDG

Query:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein1.0e-21971.25Show/hide
Query:  MGSKRQNDSVVDEEELGPDLKRHKSLGES-SPSSPPASENPQLPGFNYGDDDEEEDYKSKQ----------NGSGYDGDEGDGTDDEENDEDENHIMR--
        M  +R+  + V EEE   ++KR + +  S SP  P    NP LP  N  DDD  +  KS+           NG+   G+     DD+E+D+D+    R  
Subjt:  MGSKRQNDSVVDEEELGPDLKRHKSLGES-SPSSPPASENPQLPGFNYGDDDEEEDYKSKQ----------NGSGYDGDEGDGTDDEENDEDENHIMR--

Query:  -SRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN
         SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLN
Subjt:  -SRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN

Query:  PRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ
        PRF++ QV +LDK +QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQ
Subjt:  PRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ

Query:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPP
        AVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G E      E SRMPFLMLGLDLPPP
Subjt:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPP

Query:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLRSK
        PLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + SK
Subjt:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKESEKLRSK

Query:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        Y+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein1.1e-16279.15Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDK +Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRVSKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCAAAGAGGCAAAATGATAGTGTTGTAGATGAGGAAGAGTTGGGTCCAGACTTAAAAAGGCATAAATCACTTGGAGAATCGTCACCTTCTTCTCCACCTGCCTC
AGAGAACCCTCAGCTTCCCGGTTTTAACTATGGCGACGATGATGAAGAAGAAGATTACAAATCTAAACAAAATGGAAGTGGATATGATGGAGATGAAGGGGATGGTACTG
ATGATGAAGAAAATGATGAAGATGAAAATCATATAATGCGTAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACCGTAAACCGTCAGGTTTTGGATTTC
GATTTTGAAAAGTTTTGTTCTGTCTCTCTGTCAAATCTGAATGTTTATGCCTGCCTAGTATGTGGCAAGTATTATCAAGGGAGAGGGAAGAAGTCTCATGCTTACACTCA
TAGTCTTGAAGCAGGACACCATGTGTACATCAACCTACGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCGTTGGATGATATTCGATATG
TCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAGCTTGACAAGAAGAAGCAATGGTCTAGAGCACTTGATGGTTCTGATTACCTTCCAGGAATGGTGGGGCTTAAC
AACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAATCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTCATACCGGAGAACTATCAGCACTGCAGATCTCC
ACTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCACGAAACTTCAAGGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAA
AGCGTTTCCGAATAGGTGCACAGTCGGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTACATTCAGAACTGCGAGTTTCAAAGAAAAGTAGCAGTATAATCTAC
GAGTGTTTCCAGGGAGAACTGGAGGTTGTCAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAACTGAAGGTAGCAGTGT
CATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCT
TCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGGATGCGCTACCGTGTTACTCGATTGCCTCAGTATTTAATTCTTCAT
ATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACTTTAGTGAACTTTCCCGTCAAGAATCTAGAATTGAAGGATTACATCCCCTTGCCAACTCCGAA
AGAGAGTGAAAAATTACGTTCGAAGTACGATTTAATTGCAAATATTGTTCATGATGGCAAACCGAATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAAT
TATGGTACGAGATGCAGGATCTTCACGTCTCAGAAACACTTCCTCAAATGGTTGCTCTATCTGAGGCTTATATGCAGATATATGAACGGCAGCAATAG
mRNA sequenceShow/hide mRNA sequence
CTGGAATTTCGCGGGTTTTTCCAAACATTCACCTCTTATGTAAGCCCCTTTCATGCTCAGTTTCCTCCCAATTTAGCATCGTGGTTGTATTGTTCTCCCTCCTATGAACT
CAAGGCATGCTTCAACTTGCCAAAACATCCATGTAAATCTCCTTGAATCTCAGATTTTCAACTTGGTTTTGGGTGGGTTTGTTTCTGCACACGTTCCTGGTTTTTCCCTT
GCTCAAGCTCACTGTAGTCTACTTAAAATGCCCTTTAACACTCCTTAACCTTCTGCCAATTCACTTGTTTTCCACCACGGCCTAAGATTATTCTCCACCTACGTGTGTTC
CTTTTCTCCCGCCATAGTAGCAGCGTTTTTTTTAATAGGATGTTATATCATATATATTCGTATTATTTATAAATTTGCTCTGTTGCCAGAGACGAAAAAGGGACGATAGA
ATGGGATCAAAGAGGCAAAATGATAGTGTTGTAGATGAGGAAGAGTTGGGTCCAGACTTAAAAAGGCATAAATCACTTGGAGAATCGTCACCTTCTTCTCCACCTGCCTC
AGAGAACCCTCAGCTTCCCGGTTTTAACTATGGCGACGATGATGAAGAAGAAGATTACAAATCTAAACAAAATGGAAGTGGATATGATGGAGATGAAGGGGATGGTACTG
ATGATGAAGAAAATGATGAAGATGAAAATCATATAATGCGTAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACCGTAAACCGTCAGGTTTTGGATTTC
GATTTTGAAAAGTTTTGTTCTGTCTCTCTGTCAAATCTGAATGTTTATGCCTGCCTAGTATGTGGCAAGTATTATCAAGGGAGAGGGAAGAAGTCTCATGCTTACACTCA
TAGTCTTGAAGCAGGACACCATGTGTACATCAACCTACGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCGTTGGATGATATTCGATATG
TCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAGCTTGACAAGAAGAAGCAATGGTCTAGAGCACTTGATGGTTCTGATTACCTTCCAGGAATGGTGGGGCTTAAC
AACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAATCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTCATACCGGAGAACTATCAGCACTGCAGATCTCC
ACTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCACGAAACTTCAAGGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAA
AGCGTTTCCGAATAGGTGCACAGTCGGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTACATTCAGAACTGCGAGTTTCAAAGAAAAGTAGCAGTATAATCTAC
GAGTGTTTCCAGGGAGAACTGGAGGTTGTCAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAACTGAAGGTAGCAGTGT
CATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCT
TCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGGATGCGCTACCGTGTTACTCGATTGCCTCAGTATTTAATTCTTCAT
ATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACTTTAGTGAACTTTCCCGTCAAGAATCTAGAATTGAAGGATTACATCCCCTTGCCAACTCCGAA
AGAGAGTGAAAAATTACGTTCGAAGTACGATTTAATTGCAAATATTGTTCATGATGGCAAACCGAATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAAT
TATGGTACGAGATGCAGGATCTTCACGTCTCAGAAACACTTCCTCAAATGGTTGCTCTATCTGAGGCTTATATGCAGATATATGAACGGCAGCAATAGATAGGAAGTTCA
ATAGCGCAATTAGTTGGTTTATTTCCCCTCTACAATATATGCATCTGTACTTAAACAGGAGATGATAGTAGATGAAGGAAGCTTCTTGCTGCTAATCTCTTCGTTTAATT
GTAGAATCAATTGAACATACTTTGGGAAATTGGACTGGAAAGGTCTGGACAATGCTGCTGCTGATGCTGCTGCACGCCAATTTCTGTAAAGCAGGTGAAATTTTGTTGCC
TAAATTACTTGAGTTTTTCTGTAACAAACATAACATTGATGATTATGATATAGATAGTAAGGCAACGGGATGAGTTTTATGAGTTTGGAGGATATGTATTTGCAGAGATG
TGGTATGAGAAGGCATAACTGTGAGAGTTTGTACTTTTGAGTATCTACTGAATTTTTGTTACTTGTTGGTAGGCTGTGTAGAACATATATTCTTGAGTTTGCAGCCCAAT
AGACAGACTCCTCTGATTAATAACACCCTTTTGTACGCCCCTCTTCCCCTCACCTTCCTCCCAAATTCTCATCTTC
Protein sequenceShow/hide protein sequence
MGSKRQNDSVVDEEELGPDLKRHKSLGESSPSSPPASENPQLPGFNYGDDDEEEDYKSKQNGSGYDGDEGDGTDDEENDEDENHIMRSRDVEVRKDCPYLDTVNRQVLDF
DFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKKKQWSRALDGSDYLPGMVGLN
NIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRVSKKSSSIIY
ECFQGELEVVKEIHSKALTEKKENGDDQDAGTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILH
MRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ