; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006607 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006607
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationchr12:12796220..12800140
RNA-Seq ExpressionPI0006607
SyntenyPI0006607
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035622.1 U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo var. makuwa]4.4e-28393.92Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS
        MGSKRRNNSLLDEEELGPDLKRHKLL EVSPSSSPPASENPQLP  ++Y +D+E +E+     N +KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS

Query:  LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGL
        LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +      DKNKQWSRALDGSDYLPGMVGL
Subjt:  LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGL

Query:  NNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL
        NNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL
Subjt:  NNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL

Query:  RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV
        RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGT+GSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV
Subjt:  RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV

Query:  RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQ
        RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIAN+VHDGKPNEG YRVFVQRKSEELWYEMQ
Subjt:  RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQ

Query:  DLHVSETLPQMVALSEAYMQIYERQQ
        DLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  DLHVSETLPQMVALSEAYMQIYERQQ

XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]2.3e-29292.73Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP                        DE DYN+DEEDDEEHDDHGNQVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGT+GSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]4.1e-28992Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP                        DE DYN+DEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]1.8e-28491.27Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLP                        DE D N+DEEDDEEHDD  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAGTE SSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD++KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]1.8e-28491.27Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLP                        DE D N+DEEDDEEHDD  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAGTE SSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD++KL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein2.0e-28992Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP                        DE DYN+DEEDDEE+D++GNQVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENG+EQDAGTEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like1.1e-29292.73Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP                        DE DYN+DEEDDEEHDDHGNQVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLP------------------------DEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +    
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----

Query:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
          DKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGT+GSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A5D3E5N4 U4/U6.U5 tri-snRNP-associated protein 2-like2.1e-28393.92Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS
        MGSKRRNNSLLDEEELGPDLKRHKLL EVSPSSSPPASENPQLP  ++Y +D+E +E+     N +KRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVS

Query:  LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGL
        LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +      DKNKQWSRALDGSDYLPGMVGL
Subjt:  LSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGL

Query:  NNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL
        NNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL
Subjt:  NNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSEL

Query:  RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV
        RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGT+GSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV
Subjt:  RITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVV

Query:  RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQ
        RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKD+DKLRSKYDLIAN+VHDGKPNEG YRVFVQRKSEELWYEMQ
Subjt:  RPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQ

Query:  DLHVSETLPQMVALSEAYMQIYERQQ
        DLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  DLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like3.9e-27788.85Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEE---------------------DDEEHDDHGNQVKRSRDVEVRKDCP
        MGSKR+N+S++DEEELGPDLKRHK LGE SP SSPPASENPQLP  ++Y +D+E                     DDEE+D+  N + RSRDVEVRKDCP
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEE---------------------DDEEHDDHGNQVKRSRDVEVRKDCP

Query:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AAD
        YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +      D
Subjt:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AAD

Query:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI
        K KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI
Subjt:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI

Query:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNII
        GAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAGTEGSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNII
Subjt:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNII

Query:  PQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKP
        PQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+S+KLRSKYDLIANIVHDGKP
Subjt:  PQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKP

Query:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like6.0e-27889.03Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEE---------------------DDEEHDDHGNQVKRSRDVEVRKDCP
        MGSKR+N+S++DEEELGPDLKRHK LGE+SP SSPPASENPQLP  ++Y +D+E                     DDEE+D+  N + RSRDVEVRKDCP
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEE---------------------DDEEHDDHGNQVKRSRDVEVRKDCP

Query:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AAD
        YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPR  +      D
Subjt:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGR----AAD

Query:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI
        K KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI
Subjt:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRI

Query:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNII
        GAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDAGTEGSSV+METSRMPFLMLGLDLPPPPLFKDVMEKNII
Subjt:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNII

Query:  PQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKP
        PQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+S+KLRSKYDLIANIVHDGKP
Subjt:  PQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKP

Query:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD18.4e-4328.46Show/hide
Query:  EDDEEHDDHGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD
        E D +     +++K+    +++   P   YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP 
Subjt:  EDDEEHDDHGNQVKRSRDVEVRKDCP---YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD

Query:  GYEINDPS----LDDIRYVLNPR-AGRAADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRK
          +I        L+ I++   P    +  +   +    L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +K
Subjt:  GYEINDPS----LDDIRYVLNPR-AGRAADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRK

Query:  IWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGS
        IW  + FK  +S  +F+      S  + R G   +P++   F+ W  N + S     K   SI+    +G          K  I K EN  E      G 
Subjt:  IWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGS

Query:  SVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNL
         +V      PF +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+          + PVKN 
Subjt:  SVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNL

Query:  ELKDYIPLPTPKDSDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
          ++   +    + + L  KY L AN+VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  ELKDYIPLPTPKDSDKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 21.0e-12850.97Show/hide
Query:  VSPSSSPPASE--NPQLPDEWDYNEDEEDDEEHDDHGNQVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKS
        V   + P A E   P LP      E E D++   +   + K  R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KS
Subjt:  VSPSSSPPASE--NPQLPDEWDYNEDEEDDEEHDDHGNQVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKS

Query:  HAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLR
        HAY HS++  HHV++NL T K YCLPD YEI D SL+DI YVL P   +      DK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLR
Subjt:  HAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLR

Query:  NFFLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQ
        N+FL   NY++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQ
Subjt:  NFFLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQ

Query:  GELEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRV
        G + +   K  H     E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++
Subjt:  GELEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRV

Query:  TRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQM
        T+LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y  L     +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM
Subjt:  TRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQM

Query:  VALSEAYMQIYERQ
        + LSEAY+QI++R+
Subjt:  VALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 21.3e-12850.2Show/hide
Query:  EVSPSSS--PPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHA
        E  P+S+   PAS  P +  + +   DE+ + E +      +   +    + CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHA
Subjt:  EVSPSSS--PPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHA

Query:  YTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNF
        Y HS++  HHV++NL T K YCLPD YEI D SL+DI YVL P   +      DK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+
Subjt:  YTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNF

Query:  FLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGE
        FL   NY++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG 
Subjt:  FLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGE

Query:  LEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTR
        + +   K  H     E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++T+
Subjt:  LEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTR

Query:  LPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA
        LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y  L     +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ 
Subjt:  LPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA

Query:  LSEAYMQIYERQ
        LSEAY+QI++R+
Subjt:  LSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 21.4e-12750Show/hide
Query:  EVSPSSS--PPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHA
        E  P+S+   PAS  P +  + +   DE+ + E +      +   +    + CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHA
Subjt:  EVSPSSS--PPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHA

Query:  YTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNF
        Y HS++  HHV++NL T K YCLPD YEI D SL+DI YVL P   +      DK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+
Subjt:  YTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNF

Query:  FLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGE
        FL   NY++ + P       LV RFGEL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG 
Subjt:  FLIPKNYQHCRSP-------LVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGE

Query:  LEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTR
        + +   K  H     E+KE     D   E    ++E++   F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++T+
Subjt:  LEVV--KEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTR

Query:  LPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA
        LP YLI  ++RFTKNNFFVEKNPT+VNFP+ N++L++Y  L     +    + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ 
Subjt:  LPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVA

Query:  LSEAYMQIYERQ
        LSEAY+QI++R+
Subjt:  LSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp103.7e-9941.93Show/hide
Query:  DEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIN
        ++ HD    +++      +     YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y++ 
Subjt:  DEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEIN

Query:  DPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARN
          +L DI YV+ P   +      D   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+ KN+ +C   LV R   L RK+W+ + 
Subjt:  DPSLDDIRYVLNPRAGR----AADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARN

Query:  FKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVME
        FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +  E G EQ   T     V++
Subjt:  FKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVME

Query:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY
        T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++  +
Subjt:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY

Query:  IPLPTPKDSDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
        I     + + K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  IPLPTPKDSDKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

Arabidopsis top hitse value%identityAlignment
AT1G32850.1 ubiquitin-specific protease 116.6e-1128.65Show/hide
Query:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------HIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP
        P + K+V+ K  + + + LF+ L+ F  E   E + P              A  +  + +LP  L+ H++RFT + +F  K  TLVNF + +L+L  Y+ 
Subjt:  PPLFKDVMEKNIIPQ-VPLFNILKKFDGETITEVVRP------------HIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP

Query:  LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
            K+ D     Y+L A   H G    G+Y  + +   E  WY   D  VS      +  S AY+  Y+R
Subjt:  LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein1.5e-21277.15Show/hide
Query:  EDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD
        E+ +DDE+ D    + K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD
Subjt:  EDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPD

Query:  GYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKI
         YEINDPSLDDIR+VLNPR  RA     DKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP+NYQHC+SPLVHRFGELTRKI
Subjt:  GYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKI

Query:  WHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVM
        WHARNFKGQVSPHEFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE            G+E            
Subjt:  WHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVM

Query:  ETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKD
        E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+D
Subjt:  ETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKD

Query:  YIP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        YIP LP   + + + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  YIP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein1.1e-21271.37Show/hide
Query:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPP-----ASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF
        G +   N + +EE    ++KR +++ E S S  PP          ++  E     D+++D++ D    + K SR VEVR+DCPYLDTVNRQVLDFDFE+F
Subjt:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPP-----ASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKF

Query:  CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPG
        CSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPR  RA     DKN+QWSRALDGSDYLPG
Subjt:  CSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPG

Query:  MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTL
        MVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP+NYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QSDPVEFMSW LNTL
Subjt:  MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTL

Query:  HSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI
        H +LR +K +SSII++CFQGELEVVKE            G+E            E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+
Subjt:  HSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETI

Query:  TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEEL
        TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   + + + SKY+LIANIVHDGKP +GY+RVFVQRKS+EL
Subjt:  TEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEEL

Query:  WYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        WYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  WYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein1.1e-21277.1Show/hide
Query:  DEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        D+++D++ D    + K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD 
Subjt:  DEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIW
        YEINDPSLDDIR+VLNPR  RA     DKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIP+NYQHC+SPL HRFGELTRKIW
Subjt:  YEINDPSLDDIRYVLNPRAGRA----ADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIW

Query:  HARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVME
        HARNFKGQVSPHEFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE            G+E            E
Subjt:  HARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVME

Query:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY
         SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DY
Subjt:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY

Query:  IP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        IP LP   + + + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  IP-LPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein2.1e-15878.31Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPR  RA     DKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRA----ADKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IP+NYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE            G+E            E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKENGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGGAGGAGTTGGGTCCGGACTTAAAGAGGCATAAATTACTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAATCCTCAGCTTCCGGATGAATGGGATTACAATGAGGATGAGGAAGATGATGAAGAACATGACGATCATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAG
TTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGTCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTG
GTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCATAGTCTTGAAGCTGGCCATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTG
CCTTCCCGATGGATATGAGATTAATGACCCCTCGTTAGATGATATACGATATGTCCTGAATCCAAGAGCAGGTAGAGCAGCTGACAAGAACAAGCAATGGTCTAGGGCAC
TCGATGGTTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGAGTTACACCACTCAGGAAC
TTCTTCCTAATACCCAAGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTTACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAG
CCCGCATGAATTCCTGCAAGCCGTTATGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCGCAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAACACACTTCATT
CAGAATTGCGAATTACAAAGAAAAGTAGCAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTGAGAAGAAAGAA
AATGGCGACGAGCAGGATGCTGGAACTGAAGGTAGCAGTGTGGTAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCACCTCTTTTCAA
AGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACATATAGCAAGGATGC
GTTACCGGGTTACTCGATTGCCTCAGTACTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAACTTTCCTGTCAAG
AATCTAGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGATAGCGATAAATTGCGGTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCCAATGA
AGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTT
ATATGCAAATATATGAACGGCAACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGGAGGAGTTGGGTCCGGACTTAAAGAGGCATAAATTACTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAATCCTCAGCTTCCGGATGAATGGGATTACAATGAGGATGAGGAAGATGATGAAGAACATGACGATCATGGAAATCAAGTTAAGCGAAGTCGTGATGTTGAAG
TTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGTCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTTAATGTTTATGCCTGCCTG
GTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTCTCATGCTTACACTCATAGTCTTGAAGCTGGCCATCATGTGTATATCAACCTTCGGACAGAGAAAGTTTACTG
CCTTCCCGATGGATATGAGATTAATGACCCCTCGTTAGATGATATACGATATGTCCTGAATCCAAGAGCAGGTAGAGCAGCTGACAAGAACAAGCAATGGTCTAGGGCAC
TCGATGGTTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGAGTTACACCACTCAGGAAC
TTCTTCCTAATACCCAAGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGCGAACTTACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAG
CCCGCATGAATTCCTGCAAGCCGTTATGAAGGCTAGTAAAAAACGTTTTCGAATAGGTGCGCAGTCAGATCCCGTTGAGTTTATGTCATGGTTTCTTAACACACTTCATT
CAGAATTGCGAATTACAAAGAAAAGTAGCAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTGAAAGAGATTCACTCCAAAGCTCTCATTGAGAAGAAAGAA
AATGGCGACGAGCAGGATGCTGGAACTGAAGGTAGCAGTGTGGTAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATCTACCGCCGCCACCTCTTTTCAA
AGATGTTATGGAGAAAAATATAATACCACAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACATATAGCAAGGATGC
GTTACCGGGTTACTCGATTGCCTCAGTACTTAATTCTTCATATGCGGCGATTTACGAAGAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAACTTTCCTGTCAAG
AATCTAGAATTGAAGGATTACATCCCCCTGCCAACACCTAAAGATAGCGATAAATTGCGGTCAAAGTACGATTTGATTGCAAATATTGTTCATGATGGCAAACCCAATGA
AGGGTACTACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTT
ATATGCAAATATATGAACGGCAACAATAG
Protein sequenceShow/hide protein sequence
MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPDEWDYNEDEEDDEEHDDHGNQVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACL
VCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRAGRAADKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRN
FFLIPKNYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALIEKKE
NGDEQDAGTEGSSVVMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVK
NLELKDYIPLPTPKDSDKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ