; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G15940 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G15940
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationChr1:11444424..11451041
RNA-Seq ExpressionCSPI01G15940
SyntenyCSPI01G15940
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]0.0e+0097.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYD DEGDYNDDEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]0.0e+00100Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_023001558.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucurbita maxima]2.3e-29893.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE+SP SSPPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]2.7e-30796.36Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLPGFNYGDD+EEE++KFKQNGS+YDGDEGD NDDEEDDEE+D++ N VKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTE SSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDN+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]2.7e-30796.36Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLPGFNYGDD+EEE++KFKQNGS+YDGDEGD NDDEEDDEE+D++ N VKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTE SSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDN+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein0.0e+00100Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like0.0e+0097.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYD DEGDYNDDEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A5D3E5N4 U4/U6.U5 tri-snRNP-associated protein 2-like4.7e-29793.45Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLL EVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNG                          +KRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like4.2e-29893.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE SP SSPPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like1.1e-29893.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE+SP SSPPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD19.4e-4528.77Show/hide
Query:  EDDEEYDNNGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD
        E D +  ++ +++K+    +++   P   YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP 
Subjt:  EDDEEYDNNGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD

Query:  GYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGEL
          +I        L+ I++   P +  + +E   +       L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R    
Subjt:  GYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGEL

Query:  TRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGT
         +KIW  + FK  +S  +F+      S  + R G   +P++   F+ W  N + S     K   SI+    +G++++ K       +E K    E     
Subjt:  TRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGT

Query:  EGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK--NPTLVNF
           SV  +    PF +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+    K  N TLV F
Subjt:  EGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK--NPTLVNF

Query:  PVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
            LE+              L  KY L AN+VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  PVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 22.7e-13252.11Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 22.1e-13252.11Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 27.8e-13251.91Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       ++   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp106.9e-10443.4Show/hide
Query:  SRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI
        S+++E  +  P      YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y++   +L DI
Subjt:  SRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI

Query:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP
         YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+  N+ +C   LV R   L RK+W+ + FK  VSP
Subjt:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP

Query:  HEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFL
         E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +  E GE+      G  V ++T+ +PFL
Subjt:  HEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFL

Query:  MLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK
         L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++  +I     +
Subjt:  MLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK

Query:  DNDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
         N K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  DNDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

Arabidopsis top hitse value%identityAlignment
AT1G32850.1 ubiquitin-specific protease 111.8e-1129.24Show/hide
Query:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP
        P + K+V+ K  + + + LF+ L+ F  E   E + P            R A  +  + +LP  L+ H++RFT + +F  K  TLVNF + +L+L  Y+ 
Subjt:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP

Query:  LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
            K+ D     Y+L A   H G    G+Y  + +   E  WY   D  VS      +  S AY+  Y+R
Subjt:  LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein2.1e-22070.62Show/hide
Query:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDFKFKQ-------------NGSKYDGDEGDYNDDEEDDEEYDNN
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N Y DDDEEE+ + K+             NG+K  G+  +  DD+EDD+     
Subjt:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDFKFKQ-------------NGSKYDGDEGDYNDDEEDDEEYDNN

Query:  GNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI
        G   K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDI
Subjt:  GNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI

Query:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP
        R+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQHC+SPLVHRFGELTRKIWHARNFKGQVSP
Subjt:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP

Query:  HEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGL
        HEFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRM FLMLGL
Subjt:  HEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGL

Query:  DLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDND
        DLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   + +
Subjt:  DLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDND

Query:  KLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
         + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  KLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein8.1e-21776.32Show/hide
Query:  KFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAG
        K + NG+K  G+     DD+EDD++ D +  + K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAG
Subjt:  KFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAG

Query:  HHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQ
        HHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQ
Subjt:  HHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQ

Query:  HCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKK
        HC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE          
Subjt:  HCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKK

Query:  ENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVE
                G E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF E
Subjt:  ENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVE

Query:  KNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KNPTLVNFPVK++EL+DYIP LP   + + + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  KNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein3.0e-21970.59Show/hide
Query:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDFKFKQ----------NGSKYDGDEGDYNDDEEDDEEYDNNGNQV
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N  DDD  +  K +           NG+K  G+     DD+EDD++ D +  + 
Subjt:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDFKFKQ----------NGSKYDGDEGDYNDDEEDDEEYDNNGNQV

Query:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVL
        K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VL
Subjt:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVL

Query:  NPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL
        NPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFL
Subjt:  NPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL

Query:  QAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPP
        QAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRMPFLMLGLDLPP
Subjt:  QAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPP

Query:  PPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRS
        PPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   + + + S
Subjt:  PPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRS

Query:  KYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  KYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein3.3e-16278.87Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IP NYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGAAGAGGCGGAGCAATAGTCTTCTAGATGAGGAAGAATTGGGTCCAGATTTAAAGAGGCATAAATTGCTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAATCCTCAGCTTCCGGGATTTAACTATGGCGATGATGATGAAGAAGAAGATTTCAAATTTAAACAAAATGGAAGTAAATATGATGGAGATGAAGGGGATTACA
ATGATGATGAGGAAGATGATGAAGAATACGATAATAATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGT
CAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTC
TCATGCTTACACTCATAGTCTTGAAGCTGGACATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCATTAG
ATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAGGAGCAGGTGGAGCAGCTTGACAAGAACAAGCAGTGGTCTAGGGCACTCGATGGTTCTGATTACCTTCCTGGA
ATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTAACAATTCAGTCCTTAATGAGAGTTACACCACTCAGAAACTTCTTCCTAATACCTGCGAACTATCA
GCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCCGTTA
TGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCACAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTACAAAGAAAAGT
AGCAGTATAATCTACGAGTGTTTTCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTGAGAAGAAAGAAAATGGTGAGGAGCAGGATGCTGGAAC
TGAAGGTAGCAGTGTGGCAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATAC
CACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGGATGCGTTACCGGGTTACTCGATTGCCTCAG
TACTTAATTCTTCATATGCGACGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAATTTTCCTGTCAAGAATCTAGAATTGAAGGATTACATCCC
CTTGCCAACACCTAAAGATAACGATAAATTGCGCTCAAAGTACGATTTGATTGCGAATATTGTTCATGATGGCAAACCCAATGAAGGGTACTACAGGGTATTTGTACAGA
GGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTCCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGACAACAA
TAG
mRNA sequenceShow/hide mRNA sequence
ATTTAGGTTAGCCGTGCGTCTTCCTCTTCTCAGTTTCTCTTCCTCGCGCGCCGTCCGTCTCCCTCTTCTCAGGCGCCGTCCATTTCCCTCTTCTTAGCAGCCGTCCGTCT
ACCTCCCGGCTTTCTCCCACCGATACTCCTTCCGAAGGCCATTCTAACAGCTTGGTTAGCTGACATTTTGCACACCCATAAAATCTATTGGCTTGTGAGTCTGTGACCTT
CGGGTGTAAAAGCCCTGAATCTTCTTCTACCTTTGAAAGATATCCAACAATAACAGCACAACATCCAAGTGTATTCAAGAGAACATGGAAGAAAATGATATATGAAATCT
TGCCTTCATCTTTAGTTAGATGAAATTGATGTACTTGGATAATTTGTGCCTCTGTAAATAAGAGTATTTAGTGAGAATACAAGATTACTCTAGGCTGCAGCGGCACAGGG
CACAAATAATCTAAAATATCATAAAAGAGTTGATAAAATGGGATCGAAGAGGCGGAGCAATAGTCTTCTAGATGAGGAAGAATTGGGTCCAGATTTAAAGAGGCATAAAT
TGCTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGCCTCAGAGAATCCTCAGCTTCCGGGATTTAACTATGGCGATGATGATGAAGAAGAAGATTTCAAATTTAAACAA
AATGGAAGTAAATATGATGGAGATGAAGGGGATTACAATGATGATGAGGAAGATGATGAAGAATACGATAATAATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAGT
TCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGTCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTGG
TATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCATAGTCTTGAAGCTGGACATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGC
CTTCCCGATGGATATGAGATTAATGACCCCTCATTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAGGAGCAGGTGGAGCAGCTTGACAAGAACAAGCAGTG
GTCTAGGGCACTCGATGGTTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTAACAATTCAGTCCTTAATGAGAGTTACAC
CACTCAGAAACTTCTTCCTAATACCTGCGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAA
GGACAGGTAAGCCCGCATGAATTCCTGCAAGCCGTTATGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCACAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAA
CACACTTCATTCAGAACTGCGAATTACAAAGAAAAGTAGCAGTATAATCTACGAGTGTTTTCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTG
AGAAGAAAGAAAATGGTGAGGAGCAGGATGCTGGAACTGAAGGTAGCAGTGTGGCAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCA
CCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTAT
AGCAAGGATGCGTTACCGGGTTACTCGATTGCCTCAGTACTTAATTCTTCATATGCGACGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAATT
TTCCTGTCAAGAATCTAGAATTGAAGGATTACATCCCCTTGCCAACACCTAAAGATAACGATAAATTGCGCTCAAAGTACGATTTGATTGCGAATATTGTTCATGATGGC
AAACCCAATGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTCCCTCAGATGGTTGCTCT
CTCTGAGGCTTATATGCAAATATATGAACGACAACAATAGATAGGGCATGTAACTACCAAATTAGTTTTGTTAATTTCTCCTCTAGACAATATGTACCTGGACTTGAACA
GGAAATGAGCATGAGAAGCTTTGATATCCGCGTTATAAAGGAAGCTTCCTACGACTAATCTCTTTGTTGATTGTAGAATCAATTGTCACATTCAACGTAGTTTGGAAACT
TGGACTTAGACTATGCTGCTGTACACTGCTTTCTGGTGAAATTTTGTTGCCTAAACTACTTTTCGGTTTCTGTAACGTAATGATAATATTTAGGGCTACAATTTTTCATC
ACTCTAAATTTCTCTGCAATCGGATAACATTTCTTTGTTTAGAGGATTTGTATTTGCAGAGATTGTAATGTGAAGCCATAACTTGGACTCAGTAGTTTTGAGTTCATACC
GTTATGAAGGCTTTTTTACTATACAGATGGGGAATTTGAAACTTATTGAGTTGATGGTCCGCATAGTTTGCCAC
Protein sequenceShow/hide protein sequence
MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNR
QVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPG
MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKS
SSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQ
YLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ