; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009824 (gene) of Snake gourd v1 genome

Gene IDTan0009824
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationLG09:46860841..46870368
RNA-Seq ExpressionTan0009824
SyntenyTan0009824
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]3.3e-29792.91Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR+++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K KQNGS+YD D+GD NDDEEDDE+HDD  N V+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KL SKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]3.3e-29793.09Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR ++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K KQNGS+YDGD+GD NDDEEDDE++D++ N V+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_022142503.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Momordica charantia]3.4e-30295.45Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRRDDN VDEEE A ++KRQKLLGEFS SS PPASENPRLPGFNYGDDDEEEDYK KQNGSR  GD GD NDDEEDDE +DDDANHV+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF KEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
         LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGE ITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]5.8e-30294.55Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR+++ +DEEE   DLKR KLLGE  S SSPPASENP+LPGFNYGDD+EEE+YK KQNGSRYDGD+GD NDDEEDDE+HDDDANHV+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+NEKLCSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]5.8e-30294.55Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR+++ +DEEE   DLKR KLLGE  S SSPPASENP+LPGFNYGDD+EEE+YK KQNGSRYDGD+GD NDDEEDDE+HDDDANHV+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDAG+E SSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+NEKLCSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein1.6e-29793.09Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR ++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K KQNGS+YDGD+GD NDDEEDDE++D++ N V+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDAG+EGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like1.6e-29792.91Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRR+++ +DEEE   DLKR KLLGE S SSSPPASENP+LPGFNYGDDDEEED+K KQNGS+YD D+GD NDDEEDDE+HDD  N V+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRI+KKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAG++GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KL SKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1CND9 U4/U6.U5 tri-snRNP-associated protein 2-like1.7e-30295.45Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKRRDDN VDEEE A ++KRQKLLGEFS SS PPASENPRLPGFNYGDDDEEEDYK KQNGSR  GD GD NDDEEDDE +DDDANHV+RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF KEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
         LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGE ITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPDEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like4.7e-29794Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKR++D+ VDEEE   DLKR K LGE SS SSPPASENP+LPGFNYGDDDEEEDYK KQNGS YDGD+GDG DDEE+DED     NH+ RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like4.4e-29593.45Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK
        MGSKR++D+ VDEEE   DLKR K LGE  S SSPPASENP+LPGFNYGDDDEEEDYK KQNGS YDGD+GD  DDEE+DED     NH+ RSRDVEVRK
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK
        DCPYLDTVNRQVLDFDFEKFCSV LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQV+
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVK

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR+SKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRV RLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD

Query:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD12.7e-4429.18Show/hide
Query:  YLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPS----LDDIRYVLNPRFAKEQV
        YL+TV R+ LDFD EK C + LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP   +I        L+ I++   P +     
Subjt:  YLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPS----LDDIRYVLNPRFAKEQV

Query:  KQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
        K L+   +    L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +KIW  + FK  +S  +F+      S  
Subjt:  KQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK

Query:  RFRIGAQSDPVE---FMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKD
        + R G   +P++   F+ W  N + S    S    SI+    +G++++ K        E K    +   G     VI++    PF +L LDLP    F+D
Subjt:  RFRIGAQSDPVE---FMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKD

Query:  VMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIAN
            + +PQ+ +  +L KF         R       + + RLPQ+LI H  RF +N+          + PVKN   ++   +    E E L  KY L AN
Subjt:  VMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIAN

Query:  IVH--------DGK----PDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        +VH        DG      ++ ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  IVH--------DGK----PDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 22.3e-13151.32Show/hide
Query:  KGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKV
        K +   DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+ LS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K 
Subjt:  KGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKV

Query:  YCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------
        YCLPD YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       
Subjt:  YCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------

Query:  LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKEN
        LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE 
Subjt:  LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKEN

Query:  GDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEK
            D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++ +LP YLI  ++RFTKNNFFVEK
Subjt:  GDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEK

Query:  NPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        NPT+VNFP+ N++L++Y+       ++   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  NPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 23.5e-13251.75Show/hide
Query:  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+ LS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA

Query:  GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVN
          E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++ +LP YLI  ++RFTKNNFFVEKNPT+VN
Subjt:  GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVN

Query:  FPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        FP+ N++L++Y+       ++   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 21.0e-13151.75Show/hide
Query:  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + + +  A + R   +    + CPYLDT+NR VLDFDFEK CS+  S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L    KK  +I+ + FQG + +   K  H     E+KE     D 
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL-RISKKSSSIIYECFQGELEVV--KEIHSKALTEKKENGDDQDA

Query:  GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVN
          E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++ +LP YLI  ++RFTKNNFFVEKNPT+VN
Subjt:  GSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVN

Query:  FPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        FP+ N++L++Y+       +E   + YDLIANIVHDGKP EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  FPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp108.4e-10242.06Show/hide
Query:  EEDYKLKQNGSRYDGDKGDGNDDEEDDED-HDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTH
        EED  +  NG R   + G      +D ED HD  +  +       +     YLDT+NR++LDFDFEK CSV L+NL+VYACLVCG+Y+QGRG  SHAY H
Subjt:  EEDYKLKQNGSRYDGDKGDGNDDEEDDED-HDDDANHVRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTH

Query:  SLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLI
        +L   HHV++N  T K Y LP+ Y++   +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+
Subjt:  SLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLI

Query:  PENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISK----KSSSIIYECFQGELEVVKEI
         +N+ +C   LV R   L RK+W+ + FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I
Subjt:  PENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISK----KSSSIIYECFQGELEVVKEI

Query:  HSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRR
         S+ + +  E G  +     G  VI +T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ ++  P Y I H++R
Subjt:  HSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRR

Query:  FTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD----GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQ
        F KNN+F E+N T+V FP+ + ++  +I     + N K+ +KY+L+ANI+H+     + +   +R+ ++  S   WY++QDL+V E    M+ L E+++Q
Subjt:  FTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHD----GKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQ

Query:  IYER
        ++ER
Subjt:  IYER

Arabidopsis top hitse value%identityAlignment
AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein1.9e-22672.42Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFN-YGDDDEEEDYKLKQNGSRYDG-DKGDGNDD-------EEDDEDHDDDAN--H
        M  +R   NGV EEE   ++KR++++    S   P    NP LP  N Y DDDEEE+ + K++ +R +G  KG+GN +       EE D+D DDD +   
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFN-YGDDDEEEDYKLKQNGSRYDG-DKGDGNDD-------EEDDEDHDDDAN--H

Query:  VRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV
         + SR VEVR+DCPYLDTVNRQVLDFDFE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+V
Subjt:  VRRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV

Query:  LNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF
        LNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEF
Subjt:  LNPRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF

Query:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLP
        LQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G+E      E SRM FLMLGLDLP
Subjt:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLP

Query:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLC
        PPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E +C
Subjt:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLC

Query:  SKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        SKY+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  SKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein8.3e-22272.15Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVR--RSRDVEV
        M  +R   NGV EEE   ++KR++++ E S S  PP                     K + NG++    KG+   + +DDED DDDA+  R   SR VEV
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVR--RSRDVEV

Query:  RKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQ
        R+DCPYLDTVNRQVLDFDFE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ Q
Subjt:  RKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQ

Query:  VKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK
        V +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK
Subjt:  VKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASK

Query:  KRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVM
        KRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G+E      E SRMPFLMLGLDLPPPPLFKDVM
Subjt:  KRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVM

Query:  EKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLCSKYDLIANI
        EKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E +CSKY+LIANI
Subjt:  EKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLCSKYDLIANI

Query:  VHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        VHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  VHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein8.0e-22571.96Show/hide
Query:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGND-------DEEDDEDHDDDANHVR--
        M  +R   NGV EEE   ++KR++++    S   P    NP LP  N  DDD  +  K +   +     +G+GN        + +DDED DDDA+  R  
Subjt:  MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGND-------DEEDDEDHDDDANHVR--

Query:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN
         SR VEVR+DCPYLDTVNRQVLDFDFE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLN
Subjt:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN

Query:  PRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ
        PRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQ
Subjt:  PRFAKEQVKQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ

Query:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPP
        AVMKASKKRFRIG QSDPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G+E      E SRMPFLMLGLDLPPP
Subjt:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPP

Query:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLCSK
        PLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E +CSK
Subjt:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLCSK

Query:  YDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        Y+LIANIVHDGKP++GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  YDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein3.0e-16379.44Show/hide
Query:  VNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQ
        V  QVLDF FE+FCSV LSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR SK +SSII++CFQGELEVVKE                  G+E      E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRISKKSSSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP++ARMRYRVI+ P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVIRLPQYLILHMRRFTKNNFFVEKNPTL

AT4G22420.1 Ubiquitin-specific protease family C19-related protein2.9e-0948.15Show/hide
Query:  QKLLGEFSSSSSPPAS-ENPRLPGFN-YGDDDEEEDYKLKQNGSRYDG-DKGDGN---------DDEEDDEDHDDDANHVR--RSRDVEVRKDCPYLDTV
        +K + E S S  PP    N  LP  N Y DDDEEE  +LK++ +R +G  KG+GN         ++ +D+ED DDDA+  R   SR VEVR+DCPYLDTV
Subjt:  QKLLGEFSSSSSPPAS-ENPRLPGFN-YGDDDEEEDYKLKQNGSRYDG-DKGDGN---------DDEEDDEDHDDDANHVR--RSRDVEVRKDCPYLDTV

Query:  NRQVLDFD
        NRQV+  D
Subjt:  NRQVLDFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCAAAGAGACGAGACGATAATGGGGTAGATGAGGAAGAGTTTGCTTCAGACTTAAAGAGGCAGAAATTACTAGGGGAGTTCTCATCATCTTCTTCTCCACCTGC
CTCAGAGAACCCTCGGCTTCCCGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTATAAATTGAAACAAAATGGAAGTAGATATGATGGAGATAAAGGGGATGGCA
ATGATGATGAGGAAGATGATGAAGACCATGATGATGATGCGAATCATGTAAGGCGAAGCCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGT
CAGGTTTTGGATTTCGATTTCGAGAAGTTTTGCTCTGTCTGTCTCTCGAATCTGAATGTTTATGCCTGCCTGGTATGTGGTAAGTACTATCAAGGGAGGGGGAAGAAGTC
TCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAGAGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCATCCTTAG
ATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAAAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGA
ATGGTAGGGCTTAACAACATTAAGGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCA
GCACTGCAGATCTCCCCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTA
TGAAGGCTAGTAAAAAACGTTTCCGAATAGGTGCGCAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAATACACTTCATTCAGAATTGCGAATTTCAAAGAAAAGT
AGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCGAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAAG
TGAAGGCAGCAGTGTTATAATGGAAACATCCAGAATGCCATTCTTAATGCTTGGATTGGATTTGCCACCACCACCTCTTTTCAAAGACGTTATGGAGAAAAATATAATAC
CACAGGTTCCACTCTTCAATATTTTGAAGAAATTTGATGGTGAAACTATCACAGAAGTTGTCCGTCCACGTATAGCAAGAATGCGCTACCGTGTCATTCGATTGCCTCAA
TATTTAATTCTTCATATGCGACGATTTACAAAGAACAACTTTTTTGTGGAGAAAAATCCCACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATCCC
CCTGCCAACACCTAAAGAGAATGAAAAATTGTGTTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCGGACGAAGGGTACTACAGGGTATTTGTACAGA
GGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGGCAGCAA
TAG
mRNA sequenceShow/hide mRNA sequence
CAAACATAGACTCAACTCAATTCAATTATATACTCTAAACATAAACTTCCTGCCTCAACTCAATTTTGCTCTCGAAACGAGACTTTTGAATTAACTCATAAACTAACTTT
GTTCAACTCAAACTTGCCTCATCTCAAATTCTAAAGGGGAAAACCCCTAAGATTGCTTTGCAAAATAAAATAAAATTGCATAAAATAAAGAGGAAAACCCTAACTGCTCC
ACGCCGCTCCTTCTTCTTCACTCCCTCTCCTCCTCCCGCCGTGAGTCATCTGGCCGACGATTCTTCTTTCCTCGCCTAAATACAAAGAAACTACAGTTCAGTCGAAGTAA
ACAACTCCAGGGTTTGCAACTGGAAGAGGATTTCAGTTGTCCACGTTTTTTCTTCGTTTGAATCGATTGCTCTGCGTGAGCCGGAACGTTTGAGCTACATCAAAACATGC
TGGAATTTGATAGACGACCAGCCAGATGAGAGAGAATAAAGAGTCAATAGAATGGGATCAAAGAGACGAGACGATAATGGGGTAGATGAGGAAGAGTTTGCTTCAGACTT
AAAGAGGCAGAAATTACTAGGGGAGTTCTCATCATCTTCTTCTCCACCTGCCTCAGAGAACCCTCGGCTTCCCGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATT
ATAAATTGAAACAAAATGGAAGTAGATATGATGGAGATAAAGGGGATGGCAATGATGATGAGGAAGATGATGAAGACCATGATGATGATGCGAATCATGTAAGGCGAAGC
CGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTTAACCGTCAGGTTTTGGATTTCGATTTCGAGAAGTTTTGCTCTGTCTGTCTCTCGAATCTGAATGT
TTATGCCTGCCTGGTATGTGGTAAGTACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCAGGACACCACGTGTATATCAACCTTCGGACAG
AGAAAGTTTACTGCCTTCCTGATGGATATGAGATTAATGACCCATCCTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAAAGCAGCTTGAC
AAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGAATGGTAGGGCTTAACAACATTAAGGAAACTGATTTTGTAAATGTGACAATTCAGTCCTT
AATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCAGCACTGCAGATCTCCCCTTGTCCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATG
CAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTATGAAGGCTAGTAAAAAACGTTTCCGAATAGGTGCGCAGTCAGATCCTGTTGAATTTATG
TCATGGTTTCTTAATACACTTCATTCAGAATTGCGAATTTCAAAGAAAAGTAGCAGTATAATCTACGAATGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTC
GAAAGCTCTCACTGAGAAGAAAGAAAATGGTGACGATCAGGATGCTGGAAGTGAAGGCAGCAGTGTTATAATGGAAACATCCAGAATGCCATTCTTAATGCTTGGATTGG
ATTTGCCACCACCACCTCTTTTCAAAGACGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAATATTTTGAAGAAATTTGATGGTGAAACTATCACAGAAGTT
GTCCGTCCACGTATAGCAAGAATGCGCTACCGTGTCATTCGATTGCCTCAATATTTAATTCTTCATATGCGACGATTTACAAAGAACAACTTTTTTGTGGAGAAAAATCC
CACATTAGTGAACTTTCCTGTCAAGAATCTGGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGAGAATGAAAAATTGTGTTCAAAGTACGATTTGATTGCAAATA
TTGTTCATGATGGCAAACCGGACGAAGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCT
CAGATGGTTGCTCTCTCTGAGGCTTATATGCAAATATATGAACGGCAGCAATAGATAGGAAGTTCGATCTCTGCATCTAACTGCGTAGTTGGTTTATTTCCCCTCTAGAA
CATATGTATCTGTACCTAAACAGGAGATGGGAGAAGATGACGTTTTGAAATGGTGGATATAAAGGAAACTTATTCATTCAGCATATTTGAAGCTGAAGCTGTGATTGAGG
ATAATATTGCTTATTGCTGCTAATATCTTTGGTAAATTGTAGAATCAATTGTCATTTTAAACCTACTTTGAAAAGTTAAACTGAAAAAGGCAGGTGAAATTTTGTTGCCT
AAATTACTTCAAAATCTCTGTGTTGGGGCAACTTAATCGTTTGGAAGACTATTTGCAGAGTTGCCAGTGCGTCTTAGAGCTTTGATTTCATGGTAGCATTCAACTAAAGC
TTAATTTTGAATTTC
Protein sequenceShow/hide protein sequence
MGSKRRDDNGVDEEEFASDLKRQKLLGEFSSSSSPPASENPRLPGFNYGDDDEEEDYKLKQNGSRYDGDKGDGNDDEEDDEDHDDDANHVRRSRDVEVRKDCPYLDTVNR
QVLDFDFEKFCSVCLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVKQLDKNKQWSRALDGSDYLPG
MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRISKKS
SSIIYECFQGELEVVKEIHSKALTEKKENGDDQDAGSEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVIRLPQ
YLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLCSKYDLIANIVHDGKPDEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ