; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023864 (gene) of Chayote v1 genome

Gene IDSed0023864
OrganismSechium edule (Chayote v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationLG08:26267903..26278227
RNA-Seq ExpressionSed0023864
SyntenySed0023864
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]6.0e-28388.07Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGD----NNDEEDVEEHDDDNDANHVKRSRDVE
        MGSK+R+ + +DEE++GPDLKR KL+ ++ SPS+SPPASENPQLPGFNYGDDD+EED+++K NGS+YD D    N+DEED EEHDD    N VKRSRDVE
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGD----NNDEEDVEEHDDDNDANHVKRSRDVE

Query:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE
        VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KE
Subjt:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE

Query:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
        QVEQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
Subjt:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS

Query:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV
        KKRFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        + DAG++GS+ ++ETSRMPFLMLGLDLPPPPLFKDV
Subjt:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV

Query:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI
        MEKNIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK+++KLRSKYDLIAN+
Subjt:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI

Query:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        VHDGKP+EGCYRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]1.9e-28188.2Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD--DNDANHVKRSRDVEVR
        MGSK+R  + +DEE++GPDLKR KL+ ++ SPS+SPPASENPQLPGFNYGDDD+EED+++K NGS+YDGD  D  D EE D+  DN+ N VKRSRDVEVR
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD--DNDANHVKRSRDVEVR

Query:  KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQV
        KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQV
Subjt:  KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQV

Query:  EQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
        EQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
Subjt:  EQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK

Query:  RFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVME
        RFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        + DAG+EGS+  +ETSRMPFLMLGLDLPPPPLFKDVME
Subjt:  RFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVME

Query:  KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVH
        KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVH
Subjt:  KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVH

Query:  DGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        DGKP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  DGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

XP_022142503.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia]1.3e-28589.45Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD-DNDANHVKRSRDVEVRK
        MGSK+RD N VDEE++ P++KR+KL+ +  SPS+ PPASENP+LPGFNYGDDD+EEDY++K NGSR  GD  D+ D EE D+ D+DANHVKRSRDVEVRK
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD-DNDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
         LDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEF+SWFLNTLHS+LR+SKKSSSIIYECFQGELEVVKEIHSKALT       D DAGSEGS+ I+ETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGE ITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD

Query:  GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        GKPDEGCYRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]2.7e-28388.79Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDG----DNNDEEDVEEHDDDNDANHVKRSRDVE
        MGSK+R+ + +DEE++GPDLKR KL+ ++S   +SPPASENPQLPGFNYGDD++EE+Y++K NGSRYDG    DN+DEED EEHDD  DANHVKRSRDVE
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDG----DNNDEEDVEEHDDDNDANHVKRSRDVE

Query:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE
        VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KE
Subjt:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE

Query:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
        QVEQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
Subjt:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS

Query:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV
        KKRFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        D DAG+E S+ ++ETSRMPFLMLGLDLPPPPLFKDV
Subjt:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV

Query:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI
        MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK++EKL SKYDLIANI
Subjt:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI

Query:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        VHDGKP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]2.7e-28388.79Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDG----DNNDEEDVEEHDDDNDANHVKRSRDVE
        MGSK+R+ + +DEE++GPDLKR KL+ ++S   +SPPASENPQLPGFNYGDD++EE+Y++K NGSRYDG    DN+DEED EEHDD  DANHVKRSRDVE
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDG----DNNDEEDVEEHDDDNDANHVKRSRDVE

Query:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE
        VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KE
Subjt:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE

Query:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
        QVEQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
Subjt:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS

Query:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV
        KKRFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        D DAG+E S+ ++ETSRMPFLMLGLDLPPPPLFKDV
Subjt:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV

Query:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI
        MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK++EKL SKYDLIANI
Subjt:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI

Query:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        VHDGKP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein9.3e-28288.2Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD--DNDANHVKRSRDVEVR
        MGSK+R  + +DEE++GPDLKR KL+ ++ SPS+SPPASENPQLPGFNYGDDD+EED+++K NGS+YDGD  D  D EE D+  DN+ N VKRSRDVEVR
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD--DNDANHVKRSRDVEVR

Query:  KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQV
        KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQV
Subjt:  KDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQV

Query:  EQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
        EQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
Subjt:  EQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK

Query:  RFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVME
        RFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        + DAG+EGS+  +ETSRMPFLMLGLDLPPPPLFKDVME
Subjt:  RFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVME

Query:  KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVH
        KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK+++KLRSKYDLIANIVH
Subjt:  KNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVH

Query:  DGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        DGKP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  DGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like2.9e-28388.07Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGD----NNDEEDVEEHDDDNDANHVKRSRDVE
        MGSK+R+ + +DEE++GPDLKR KL+ ++ SPS+SPPASENPQLPGFNYGDDD+EED+++K NGS+YD D    N+DEED EEHDD    N VKRSRDVE
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGD----NNDEEDVEEHDDDNDANHVKRSRDVE

Query:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE
        VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KE
Subjt:  VRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKE

Query:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
        QVEQLDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS
Subjt:  QVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKAS

Query:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV
        KKRFRIGAQSDPVEF+SWFLNTLHS+LRI+KKSSSIIYECFQGELEVVKEIHSKAL        + DAG++GS+ ++ETSRMPFLMLGLDLPPPPLFKDV
Subjt:  KKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKAL-------TDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV

Query:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI
        MEKNIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPK+++KLRSKYDLIAN+
Subjt:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANI

Query:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        VHDGKP+EGCYRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  VHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

A0A6J1CND9 U4/U6.U5 tri-snRNP-associated protein 2-like6.2e-28689.45Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD-DNDANHVKRSRDVEVRK
        MGSK+RD N VDEE++ P++KR+KL+ +  SPS+ PPASENP+LPGFNYGDDD+EEDY++K NGSR  GD  D+ D EE D+ D+DANHVKRSRDVEVRK
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDD-DNDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
         LDKNKQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEF+SWFLNTLHS+LR+SKKSSSIIYECFQGELEVVKEIHSKALT       D DAGSEGS+ I+ETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGE ITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD

Query:  GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        GKPDEGCYRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like9.3e-28289.44Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDDDNDANHVKRSRDVEVRKD
        MGSK+++ + VDEE++GPDLKR K + + SSPS SPPASENPQLPGFNYGDDD+EEDY+ K NGS YDGD  D  D EE+D+D   NH+ RSRDVEVRKD
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDDDNDANHVKRSRDVEVRKD

Query:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQ
        CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQVEQ
Subjt:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQ

Query:  LDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
        LDK KQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
Subjt:  LDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF

Query:  RIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKN
        RIGAQSDPVEF+SWFLNTLHS+LR+SKKSSSIIYECFQGELEVVKEIHSKALT       D DAG+EGS+ I+ETSRMPFLMLGLDLPPPPLFKDVMEKN
Subjt:  RIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKN

Query:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
        IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
Subjt:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG

Query:  KPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        KP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  KPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like1.6e-28188.89Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDDDNDANHVKRSRDVEVRKD
        MGSK+++ + VDEE++GPDLKR K + +LS   +SPPASENPQLPGFNYGDDD+EEDY+ K NGS YDGD  D  D EE+D+D   NH+ RSRDVEVRKD
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDDDNDANHVKRSRDVEVRKD

Query:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQ
        CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTE+VYCLPDGYEINDPSLD IRYVLNPRF+KEQVEQ
Subjt:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQ

Query:  LDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
        LDK KQWSRALDGSDYLPGMVGLNNI+ TDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF
Subjt:  LDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRF

Query:  RIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKN
        RIGAQSDPVEF+SWFLNTLHS+LR+SKKSSSIIYECFQGELEVVKEIHSKALT       D DAG+EGS+ I+ETSRMPFLMLGLDLPPPPLFKDVMEKN
Subjt:  RIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALT-------DHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKN

Query:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
        IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPT+VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG
Subjt:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDG

Query:  KPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        KP+EG YRVFVQRKSEELWYEMQDLHV+ETLPQMVALSE YMQIYERQQ
Subjt:  KPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD13.8e-4629.25Show/hide
Query:  DDDNDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDP
        +D+     VK+ +  E   +  YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + + Y LP   +I   
Subjt:  DDDNDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDP

Query:  S----LDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHA
             L+ I++   P +  + +E   +       L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +KIW  
Subjt:  S----LDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHA

Query:  RNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPF
        + FK  +S  +F+      S  + R G   +P++   F+ W  N + S    S    SI+    +G++++ K  +    ++   G        +    PF
Subjt:  RNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPF

Query:  LMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTP
         +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+          + PVKN   ++   +   
Subjt:  LMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTP

Query:  KESEKLRSKYDLIANIVH--------DGKP---DEGCYRV--FVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
         E E L  KY L AN+VH        DG     DE  + +      KSE+ W E+  ++ TE   +++ L ET++Q++E+Q+
Subjt:  KESEKLRSKYDLIANIVH--------DGKP---DEGCYRV--FVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 29.7e-13551.96Show/hide
Query:  EHDDDNDANHVKRSRDVEV------RKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLP
        E D+D++     R+++  V       + CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T + YCLP
Subjt:  EHDDDNDANHVKRSRDVEV------RKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLP

Query:  DGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHR
        D YEI D SL+ I YVL P F+K+Q+  LDK  + SRA DG+ YLPG+VGLNNI+A D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV R
Subjt:  DGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHR

Query:  FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNA
        FGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG +     I +K L   D  +E    
Subjt:  FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNA

Query:  ILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVN
        +L             F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPTIVN
Subjt:  ILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVN

Query:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ
        FP+ N++L++Y  L    ++    + YDLIANIVHDGKP EG YR+ V       WYE+QDL VT+ LPQM+ LSE Y+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 22.2e-13452.36Show/hide
Query:  EEDVEEHDDDNDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYC
        E +V+E  +       K  R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T + YC
Subjt:  EEDVEEHDDDNDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYC

Query:  LPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LV
        LPD YEI D SL+ I YVL P F+K+Q+  LDK  + SRA DG+ YLPG+VGLNNI+A D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV
Subjt:  LPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LV

Query:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGS
         RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG +     I +K L   D  +E  
Subjt:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGS

Query:  NAILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTI
          +L             F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPTI
Subjt:  NAILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTI

Query:  VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ
        VNFP+ N++L++Y  L    ++    + YDLIANIVHDGKP EG YR+ V       WYE+QDL VT+ LPQM+ LSE Y+QI++R+
Subjt:  VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 21.8e-13352.16Show/hide
Query:  EEDVEEHDDDNDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYC
        E +V+E  +       K  R D E R+   CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T + YC
Subjt:  EEDVEEHDDDNDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYC

Query:  LPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LV
        LPD YEI D SL+ I YVL P F+K+Q+  LDK  + SRA DG+ YLPG+VGLNNI+A D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV
Subjt:  LPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LV

Query:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGS
         RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG +     I +K L   D  +E  
Subjt:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDL-RISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGS

Query:  NAILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTI
          +L             F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNPTI
Subjt:  NAILETS-------RMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNPTI

Query:  VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ
        VNFP+ N++L++Y+        E   + YDLIANIVHDGKP EG YR+ V       WYE+QDL VT+ LPQM+ LSE Y+QI++R+
Subjt:  VNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp106.3e-10241.51Show/hide
Query:  DNNDEEDVEEHDDDNDANHVKRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINL
        DN   + +E   D  D + +  S+++E  +  P      YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N 
Subjt:  DNNDEEDVEEHDDDNDANHVKRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINL

Query:  RTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLV
         T + Y LP+ Y++   +L  I YV+ P F+K +V++LD   Q S  L    Y+PG VG+NNI+  D+ NV I  L  V P RN+FL+ +N+ +C   LV
Subjt:  RTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLV

Query:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISK----KSSSIIYECFQGELEVVKEIHSKALTDHDAGS
         R   L RK+W+ + FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ +  H    
Subjt:  HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISK----KSSSIIYECFQGELEVVKEIHSKALTDHDAGS

Query:  E----GSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIV
        E      + +++T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N TIV
Subjt:  E----GSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIV

Query:  NFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD----GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYER
         FP+ + ++  +I     + + K+ +KY+L+ANI+H+     + +   +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  NFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHD----GKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYER

Arabidopsis top hitse value%identityAlignment
AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein4.8e-22271.45Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFN-YGDDDDEEDYRYKL-------------NGSRYDGDNNDEEDVEEHDDDNDA
        M  ++   NGV EE+   ++KRK++++   SP   P    NP LP  N Y DDD+EE+   K              NG++  G+  +E D ++ DDD   
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFN-YGDDDDEEDYRYKL-------------NGSRYDGDNNDEEDVEEHDDDNDA

Query:  NHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIR
           K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE+VYCLPD YEINDPSLD IR
Subjt:  NHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIR

Query:  YVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPH
        +VLNPRFS+ QV +LDKN+QWSRALDGSDYLPGMVGLNNIQ T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPH
Subjt:  YVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPH

Query:  EFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLF
        EFLQAVMKASKKRFRIG QSDPVEF+SW LNTLH DLR SK +SSII++CFQGELEVVKE           G+E      E SRM FLMLGLDLPPPPLF
Subjt:  EFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLF

Query:  KDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDL
        KDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPT+VNFPVK++EL+DYIP LP   E E + SKY+L
Subjt:  KDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDL

Query:  IANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        IANIVHDGKP++G +RVFVQRKS+ELWYEMQDLHV ETLPQMV LSE YMQIYE+++
Subjt:  IANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein3.2e-21872.06Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDE-EDVEEHDDDNDANHVKRSRDVEVRK
        M  ++   NGV EE+   ++KRK++++     S SPP    P L     G            NG++  G+   E +D E+ DDD      K SR VEVR+
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDE-EDVEEHDDDNDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE
        DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE+VYCLPD YEINDPSLD IR+VLNPRFS+ QV 
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        +LDKN+QWSRALDGSDYLPGMVGLNNIQ T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        FRIG QSDPVEF+SW LNTLH DLR SK +SSII++CFQGELEVVKE           G+E      E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  FRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIANIVHDGKPDEG
        LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPT+VNFPVK++EL+DYIP LP   E E + SKY+LIANIVHDGKP++G
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIANIVHDGKPDEG

Query:  CYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
         +RVFVQRKS+ELWYEMQDLHV ETLPQMV LSE YMQIYE+Q+
Subjt:  CYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein1.4e-22171.48Show/hide
Query:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKL----------NGSRYDGDNNDE-EDVEEHDDDNDANHV
        M  ++   NGV EE+   ++KRK++++   SP   P    NP LP  N  DDD+ +  + +           NG++  G+   E +D E+ DDD      
Subjt:  MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKL----------NGSRYDGDNNDE-EDVEEHDDDNDANHV

Query:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVL
        K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE+VYCLPD YEINDPSLD IR+VL
Subjt:  KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVL

Query:  NPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL
        NPRFS+ QV +LDKN+QWSRALDGSDYLPGMVGLNNIQ T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFL
Subjt:  NPRFSKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFL

Query:  QAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV
        QAVMKASKKRFRIG QSDPVEF+SW LNTLH DLR SK +SSII++CFQGELEVVKE           G+E      E SRMPFLMLGLDLPPPPLFKDV
Subjt:  QAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDV

Query:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIAN
        MEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPT+VNFPVK++EL+DYIP LP   E E + SKY+LIAN
Subjt:  MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTIVNFPVKNLELKDYIP-LPTPKESEKLRSKYDLIAN

Query:  IVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ
        IVHDGKP++G +RVFVQRKS+ELWYEMQDLHV ETLPQMV LSE YMQIYE+Q+
Subjt:  IVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein7.8e-16480.75Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TE+VYCLPD YEINDPSLD IR+VLNPRFS+ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNIQ T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKK
        DPVEF+SW LNTLH DLR SK +SSII++CFQGELEVVKE           G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKK
Subjt:  DPVEFISWFLNTLHSDLRISKKSSSIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKK

Query:  FDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTI
        FDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPT+
Subjt:  FDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTI

AT4G22420.1 Ubiquitin-specific protease family C19-related protein2.1e-0740.18Show/hide
Query:  DLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEED--------------YRYKLNGSRYDGDNNDEEDVEEHDDDN-DANHVKRSRDVEVRKDCPY
        D   K++++   SP   P    N  LP  N  DDDDEE+               + + NG++ +G+  +E D EE DDD+      K SR VEVR+DCPY
Subjt:  DLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEED--------------YRYKLNGSRYDGDNNDEEDVEEHDDDN-DANHVKRSRDVEVRKDCPY

Query:  LDTVNRQVLDFD
        LDTVNRQV+  D
Subjt:  LDTVNRQVLDFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCAAAGAAACGAGACTATAATGGTGTAGATGAGGAAGATATAGGTCCAGACTTGAAGAGGAAGAAATTAGTTCAGGACTTATCATCACCTTCTGCTTCTCCACC
TGCCTCAGAGAACCCCCAACTTCCTGGTTTTAACTATGGCGACGACGACGACGAAGAGGATTACAGATATAAACTAAATGGAAGCAGGTATGATGGAGACAACAACGACG
AGGAAGATGTTGAAGAACACGATGATGATAATGATGCGAATCATGTGAAGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGTCAG
GTTTTGGATTTCGATTTCGAGAAGTTCTGCTCTGTCTCGCTCTCAAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTCTCA
TGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAGAGTTTACTGCCTTCCTGATGGATATGAGATAAATGACCCCTCGTTAGATG
GAATTCGATATGTCCTGAATCCAAGGTTTTCTAAAGAGCAGGTAGAGCAGCTTGACAAGAACAAGCAATGGTCCAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATG
GTGGGGCTTAACAACATTCAGGCAACCGATTTTGTAAATGTTACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAATTTTTTCCTAATACCTGAGAACTATCAGCA
TTGCAGATCTCCTCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTTCTGCAAGCAGTTATGA
AGGCTAGTAAAAAGCGTTTTCGGATAGGTGCACAGTCAGATCCTGTTGAATTTATTTCTTGGTTTCTTAACACACTTCATTCAGACCTACGAATTTCAAAGAAAAGTAGC
AGTATAATCTATGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACCGATCATGATGCTGGAAGTGAAGGAAGCAATGCCATATTGGA
AACATCAAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAACATTT
TGAAGAAATTTGATGGTGAAACGATCACAGAAGTTGTCCGTCCACGTATAGCAAGGATGCGCTACCGAGTCACTCGATTGCCTCAATATTTGATTCTTCACATGCGACGA
TTTACAAAGAACAATTTTTTTGTGGAGAAAAATCCCACAATAGTGAACTTTCCTGTCAAGAATCTAGAATTGAAGGATTACATTCCCTTGCCAACACCTAAAGAGAGTGA
AAAATTGCGTTCAAAGTACGATTTGATCGCAAATATTGTTCATGACGGCAAGCCCGACGAAGGGTGCTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACG
AGATGCAGGATCTTCATGTCACAGAAACACTTCCTCAAATGGTTGCTCTCTCCGAAACTTATATGCAGATATATGAACGGCAGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATTAAGCCCAAATGACCCAATGACCCAAAATAAGATCCATCATCGCCTCGCCCAATTCCCAAACCTTACTCTGAAGACGAAGAGAACAGCAAGCCGCGCTAAACCGGCGA
GGGCAAATCGGAATCCTCTTCTGAAGCATATGCCATTGATTTAGTTCTTGCAAGAATCGAGCTCCGTTCCAGTCATCAAGCATCTCCAAGAACAGCCCAACCACTGCCCA
GGTATCAATTTTTCTGAAATGGGATCAAAGAAACGAGACTATAATGGTGTAGATGAGGAAGATATAGGTCCAGACTTGAAGAGGAAGAAATTAGTTCAGGACTTATCATC
ACCTTCTGCTTCTCCACCTGCCTCAGAGAACCCCCAACTTCCTGGTTTTAACTATGGCGACGACGACGACGAAGAGGATTACAGATATAAACTAAATGGAAGCAGGTATG
ATGGAGACAACAACGACGAGGAAGATGTTGAAGAACACGATGATGATAATGATGCGAATCATGTGAAGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTT
GATACTGTTAACCGTCAGGTTTTGGATTTCGATTTCGAGAAGTTCTGCTCTGTCTCGCTCTCAAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGG
GAGGGGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAGAGTTTACTGCCTTCCTGATGGATATGAGATAA
ATGACCCCTCGTTAGATGGAATTCGATATGTCCTGAATCCAAGGTTTTCTAAAGAGCAGGTAGAGCAGCTTGACAAGAACAAGCAATGGTCCAGGGCACTTGATGGTTCT
GATTACCTTCCTGGAATGGTGGGGCTTAACAACATTCAGGCAACCGATTTTGTAAATGTTACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAATTTTTTCCTAAT
ACCTGAGAACTATCAGCATTGCAGATCTCCTCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAAT
TTCTGCAAGCAGTTATGAAGGCTAGTAAAAAGCGTTTTCGGATAGGTGCACAGTCAGATCCTGTTGAATTTATTTCTTGGTTTCTTAACACACTTCATTCAGACCTACGA
ATTTCAAAGAAAAGTAGCAGTATAATCTATGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACCGATCATGATGCTGGAAGTGAAGG
AAGCAATGCCATATTGGAAACATCAAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGATGTTATGGAGAAAAATATAATACCACAGG
TTCCACTCTTCAACATTTTGAAGAAATTTGATGGTGAAACGATCACAGAAGTTGTCCGTCCACGTATAGCAAGGATGCGCTACCGAGTCACTCGATTGCCTCAATATTTG
ATTCTTCACATGCGACGATTTACAAAGAACAATTTTTTTGTGGAGAAAAATCCCACAATAGTGAACTTTCCTGTCAAGAATCTAGAATTGAAGGATTACATTCCCTTGCC
AACACCTAAAGAGAGTGAAAAATTGCGTTCAAAGTACGATTTGATCGCAAATATTGTTCATGACGGCAAGCCCGACGAAGGGTGCTACAGGGTATTTGTACAGAGGAAGT
CGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCACAGAAACACTTCCTCAAATGGTTGCTCTCTCCGAAACTTATATGCAGATATATGAACGGCAGCAATAGATA
CGAAACTCGATATCCACATCTAACTGCGTAGTTGGTTGATTATTTCCCCTCTAGAACATGTGTATGTACTTAAACAGGAGATTAGAAAAGATGAAATTTTGAAATGGTGG
ATATAAAAAGGAAGATTATTTGTGTGGAAAAGGTGAATATTCAACATGTTTGAAGCTGCGATGGGGAGGATAATATTGTTTATCAGCTAATATCTTCGTTTAATTGTAGA
AACAATTGTCATTCTAAACCCTACTTTGAAACGTTGGACAGAATCTATGGACAAATGCTGCCATTTCAGAAAGGCCGGTGAGATTTTGTTGCCTAATTTACTTTTCAGTT
TTCTTTTAACAAAATAGTGATATATTAGGGCTGTATAGACTTCAAAATCTTTCTTTGCGACAGGCAAC
Protein sequenceShow/hide protein sequence
MGSKKRDYNGVDEEDIGPDLKRKKLVQDLSSPSASPPASENPQLPGFNYGDDDDEEDYRYKLNGSRYDGDNNDEEDVEEHDDDNDANHVKRSRDVEVRKDCPYLDTVNRQ
VLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTERVYCLPDGYEINDPSLDGIRYVLNPRFSKEQVEQLDKNKQWSRALDGSDYLPGM
VGLNNIQATDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFISWFLNTLHSDLRISKKSS
SIIYECFQGELEVVKEIHSKALTDHDAGSEGSNAILETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRR
FTKNNFFVEKNPTIVNFPVKNLELKDYIPLPTPKESEKLRSKYDLIANIVHDGKPDEGCYRVFVQRKSEELWYEMQDLHVTETLPQMVALSETYMQIYERQQ