; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6532 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6532
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationctg1450:144064..150684
RNA-Seq ExpressionCucsat.G6532
SyntenyCucsat.G6532
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]0.097.64Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYD DEGDYNDDEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]0.099.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_023001558.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucurbita maxima]0.093.64Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE+SPSS PPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]0.096.18Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSS PPASENPQLPGFNYGDD+EEE++KFKQNGS+YDGDEGD NDDEEDDEE+D++ N VKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTE SSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDN+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]0.096.18Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSS PPASENPQLPGFNYGDD+EEE++KFKQNGS+YDGDEGD NDDEEDDEE+D++ N VKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTE SSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDN+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein0.099.82Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like0.097.64Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYD DEGDYNDDEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A5D3E5N4 U4/U6.U5 tri-snRNP-associated protein 2-like0.093.27Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLL EVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNG                          +KRSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGT+GSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like0.093.64Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE SPSS PPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like0.093.64Show/hide
Query:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK
        MGSKR+++S++DEEELGPDLKRHK LGE+SPSS PPASENPQLPGFNYGDDDEEED+K KQNGS YDGDEGD  DDEE+DE+     N + RSRDVEVRK
Subjt:  MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD11.6e-4428.77Show/hide
Query:  EDDEEYDNNGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD
        E D +  ++ +++K+    +++   P   YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP 
Subjt:  EDDEEYDNNGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD

Query:  GYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGEL
          +I        L+ I++   P +  + +E   +       L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R    
Subjt:  GYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGEL

Query:  TRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGT
         +KIW  + FK  +S  +F+      S  + R G   +P++   F+ W  N + S     K   SI+    +G++++ K       +E K    E     
Subjt:  TRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGT

Query:  EGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK--NPTLVNF
           SV  +    PF +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+    K  N TLV F
Subjt:  EGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK--NPTLVNF

Query:  PVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
            LE+              L  KY L AN+VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  PVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 26.0e-13251.91Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQA++  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 24.6e-13251.91Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQA++  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 21.7e-13151.71Show/hide
Query:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE
        E + ++D E + E      +V    D E R+   CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T 
Subjt:  EGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE

Query:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----
        K YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL   NY++ + P     
Subjt:  KVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSP-----

Query:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK
          LV RFGEL RK+W+ RNFK  VSPHE LQA++  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H     E+K
Subjt:  --LVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALIEKK

Query:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN
        E    N E Q+   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKN
Subjt:  E----NGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKN

Query:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NFFVEKNPT+VNFP+ N++L++Y+       ++   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp101.2e-10343.19Show/hide
Query:  SRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI
        S+++E  +  P      YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y++   +L DI
Subjt:  SRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI

Query:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP
         YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+  N+ +C   LV R   L RK+W+ + FK  VSP
Subjt:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP

Query:  HEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFL
         E +Q +   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +  E GE+      G  V ++T+ +PFL
Subjt:  HEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFL

Query:  MLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK
         L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++  +I     +
Subjt:  MLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK

Query:  DNDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
         N K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  DNDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

Arabidopsis top hitse value%identityAlignment
AT1G32850.1 ubiquitin-specific protease 113.1e-1129.24Show/hide
Query:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP
        P + K+V+ K  + + + LF+ L+ F  E   E + P            R A  +  + +LP  L+ H++RFT + +F  K  TLVNF + +L+L  Y+ 
Subjt:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------RIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP

Query:  LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
            K+ D     Y+L A   H G    G+Y  + +   E  WY   D  VS      +  S AY+  Y+R
Subjt:  LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein4.6e-22070.44Show/hide
Query:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDFKFKQ-------------NGSKYDGDEGDYNDDEEDDEEYDNN
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N Y DDDEEE+ + K+             NG+K  G+  +  DD+EDD+     
Subjt:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDFKFKQ-------------NGSKYDGDEGDYNDDEEDDEEYDNN

Query:  GNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI
        G   K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDI
Subjt:  GNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDI

Query:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP
        R+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQHC+SPLVHRFGELTRKIWHARNFKGQVSP
Subjt:  RYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSP

Query:  HEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGL
        HEFLQA+MKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRM FLMLGL
Subjt:  HEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGL

Query:  DLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDND
        DLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   + +
Subjt:  DLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDND

Query:  KLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
         + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  KLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein1.8e-21676.11Show/hide
Query:  KFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAG
        K + NG+K  G+     DD+EDD++ D +  + K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAG
Subjt:  KFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAG

Query:  HHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQ
        HHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQ
Subjt:  HHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQ

Query:  HCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKK
        HC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQA+MKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE          
Subjt:  HCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKK

Query:  ENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVE
                G E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF E
Subjt:  ENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVE

Query:  KNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KNPTLVNFPVK++EL+DYIP LP   + + + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  KNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein6.6e-21970.41Show/hide
Query:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDFKFKQ----------NGSKYDGDEGDYNDDEEDDEEYDNNGNQV
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N  DDD  +  K +           NG+K  G+     DD+EDD++ D +  + 
Subjt:  GSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDFKFKQ----------NGSKYDGDEGDYNDDEEDDEEYDNNGNQV

Query:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVL
        K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VL
Subjt:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVL

Query:  NPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL
        NPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP NYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFL
Subjt:  NPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL

Query:  QAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPP
        QA+MKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRMPFLMLGLDLPP
Subjt:  QAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPP

Query:  PPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRS
        PPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   + + + S
Subjt:  PPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDNDKLRS

Query:  KYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  KYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein4.3e-16278.59Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IP NYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQA+MKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE                  G E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGAAGAGGCGGAGCAATAGTCTTCTAGATGAGGAAGAATTGGGTCCAGATTTAAAGAGGCATAAATTGCTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAATCCTCAGCTTCCGGGATTTAACTATGGCGATGATGATGAAGAAGAAGATTTCAAATTTAAACAAAATGGAAGTAAATATGATGGAGATGAAGGGGATTACA
ATGATGATGAGGAAGATGATGAAGAATACGATAATAATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGT
CAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTC
TCATGCTTACACTCATAGTCTTGAAGCTGGACATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCATTAG
ATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAGGAGCAGGTGGAGCAGCTTGACAAGAACAAGCAGTGGTCTAGGGCACTCGATGGTTCTGATTACCTTCCTGGA
ATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTAACAATTCAGTCCTTAATGAGAGTTACACCACTCAGAAACTTCTTCCTAATACCTGCGAACTATCA
GCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAGGGACAGGTAAGCCCGCATGAATTCCTGCAAGCCATTA
TGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCACAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTACAAAGAAAAGT
AGCAGTATAATCTACGAGTGTTTTCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTGAGAAGAAAGAAAATGGTGAGGAGCAGGATGCTGGAAC
TGAAGGTAGCAGTGTGGCAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATAC
CACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGGATGCGTTACCGGGTTACTCGATTGCCTCAG
TACTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAATTTTCCTGTCAAGAATCTAGAATTGAAGGATTACATCCC
CTTGCCAACACCTAAAGATAACGATAAATTGCGCTCAAAGTACGATTTGATTGCGAATATTGTTCATGATGGCAAACCCAATGAAGGGTACTACAGGGTATTTGTACAGA
GGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTCCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGACAACAA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCGAAGAGGCGGAGCAATAGTCTTCTAGATGAGGAAGAATTGGGTCCAGATTTAAAGAGGCATAAATTGCTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAATCCTCAGCTTCCGGGATTTAACTATGGCGATGATGATGAAGAAGAAGATTTCAAATTTAAACAAAATGGAAGTAAATATGATGGAGATGAAGGGGATTACA
ATGATGATGAGGAAGATGATGAAGAATACGATAATAATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGT
CAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTC
TCATGCTTACACTCATAGTCTTGAAGCTGGACATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCATTAG
ATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAGGAGCAGGTGGAGCAGCTTGACAAGAACAAGCAGTGGTCTAGGGCACTCGATGGTTCTGATTACCTTCCTGGA
ATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTAACAATTCAGTCCTTAATGAGAGTTACACCACTCAGAAACTTCTTCCTAATACCTGCGAACTATCA
GCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAGGGACAGGTAAGCCCGCATGAATTCCTGCAAGCCATTA
TGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCACAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTACAAAGAAAAGT
AGCAGTATAATCTACGAGTGTTTTCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTGAGAAGAAAGAAAATGGTGAGGAGCAGGATGCTGGAAC
TGAAGGTAGCAGTGTGGCAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATAC
CACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGGATGCGTTACCGGGTTACTCGATTGCCTCAG
TACTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAATTTTCCTGTCAAGAATCTAGAATTGAAGGATTACATCCC
CTTGCCAACACCTAAAGATAACGATAAATTGCGCTCAAAGTACGATTTGATTGCGAATATTGTTCATGATGGCAAACCCAATGAAGGGTACTACAGGGTATTTGTACAGA
GGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTCCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGACAACAA
TAG
Protein sequenceShow/hide protein sequence
MGSKRRSNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDFKFKQNGSKYDGDEGDYNDDEEDDEEYDNNGNQVKRSRDVEVRKDCPYLDTVNR
QVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPG
MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPANYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAIMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKS
SSIIYECFQGELEVVKEIHSKALIEKKENGEEQDAGTEGSSVAMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQ
YLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDNDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ