; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G10580 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G10580
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationClcChr01:14600606..14618689
RNA-Seq ExpressionClc01G10580
SyntenyClc01G10580
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583754.1 U4/U6.U5 tri-snRNP-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]4.9e-30194.73Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKR+N+S++DE+ELGPDLKRHK LGE SP SSPPASENP+LPGFNYGDDDEEEDYK KQNGS YDGDEGDG DDEE+DED     NH+ RSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQ+AG EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]6.3e-30996.18Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRRNNSLLDE+ELGPDLKRHKLLGEVSPSSSPPASENP+LPGFNYGDDDEEED+KFKQNGS+YD DEGD NDDEEDDE+HDD  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+Q+AG +GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]6.0e-30795.82Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRR+NSLLDE+ELGPDLKRHKLLGEVSPSSSPPASENP+LPGFNYGDDDEEED+KFKQNGS+YDGDEGD NDDEEDDE++D++ N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++Q+AG EGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]4.2e-31097.27Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRRNNSLLDE+ELGPDLKRHKLLGEVSP SSPPASENP+LPGFNYGDD+EEE+YKFKQNGSRYDGDEGD NDDEEDDE+HDDDANHVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQ+AG E SSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+NEKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]4.2e-31097.27Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRRNNSLLDE+ELGPDLKRHKLLGEVSP SSPPASENP+LPGFNYGDD+EEE+YKFKQNGSRYDGDEGD NDDEEDDE+HDDDANHVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQ+AG E SSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+NEKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein2.9e-30795.82Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRR+NSLLDE+ELGPDLKRHKLLGEVSPSSSPPASENP+LPGFNYGDDDEEED+KFKQNGS+YDGDEGD NDDEEDDE++D++ N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++Q+AG EGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like3.1e-30996.18Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRRNNSLLDE+ELGPDLKRHKLLGEVSPSSSPPASENP+LPGFNYGDDDEEED+KFKQNGS+YD DEGD NDDEEDDE+HDD  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+Q+AG +GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+N+KLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1CND9 U4/U6.U5 tri-snRNP-associated protein 2-like1.6e-30094.36Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKRR+++ +DE+EL P++KR KLLGE SPSS PPASENPRLPGFNYGDDDEEEDYKFKQNGSR  GD GD NDDEEDDE +DDDANHVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF K+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
         LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQ+AG+EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGE ITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like2.4e-30194.73Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKR+N+S++DE+ELGPDLKRHK LGE SP SSPPASENP+LPGFNYGDDDEEEDYK KQNGS YDGDEGDG DDEE+DED     NH+ RSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQ+AG EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like3.5e-30094.36Show/hide
Query:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK
        MGSKR+N+S++DE+ELGPDLKRHK LGE+SP SSPPASENP+LPGFNYGDDDEEEDYK KQNGS YDGDEGD  DDEE+DED     NH+ RSRDVEVRK
Subjt:  MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAK+QVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
        QLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQ+AG EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE+EKLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD11.5e-4528.97Show/hide
Query:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPS----LDDIRYVLNPRFAKDQV
        YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y LP   +I        L+ I++   P +    +
Subjt:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPS----LDDIRYVLNPRFAKDQV

Query:  EQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK
        E   +       L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R     +KIW  + FK  +S  +F+      S  
Subjt:  EQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKK

Query:  RFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKD
        + R G   +P++   F+ W  N + S     K   SI+    +G++++           K EN  +      G  ++      PF +L LDLP    F+D
Subjt:  RFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKD

Query:  VMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIAN
            + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+          + PVKN   ++   +    E E L  KY L AN
Subjt:  VMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIAN

Query:  IVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        +VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  IVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 21.4e-13352.04Show/hide
Query:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + + +  A + +   +    + CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN
         QE   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKN
Subjt:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN

Query:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        PT+VNFP+ N++L++Y+       ++   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 21.4e-13352.04Show/hide
Query:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + + +  A + +   +    + CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN
         QE   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKN
Subjt:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN

Query:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        PT+VNFP+ N++L++Y+       ++   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 24.1e-13352.04Show/hide
Query:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + + +  A + +   +    + CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN
         QE   E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    + +  + R+++T+LP YLI  ++RFTKNNFFVEKN
Subjt:  DQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITE--VVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKN

Query:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        PT+VNFP+ N++L++Y+       +E   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  PTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp109.9e-10342.92Show/hide
Query:  EDDED-HDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGY
        +D ED HD  +  ++      +     YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY H+L   HHV++N  T K Y LP+ Y
Subjt:  EDDED-HDDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGY

Query:  EINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWH
        ++   +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+FL+ +N+ +C   LV R   L RK+W+
Subjt:  EINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWH

Query:  ARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSV
         + FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +     I S+ + +  E G  ++    G  V
Subjt:  ARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSV

Query:  IMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLEL
        I +T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H++RF KNN+F E+N T+V FP+ + ++
Subjt:  IMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLEL

Query:  KDYIPLPTPKENEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER
          +I     + N K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E+++Q++ER
Subjt:  KDYIPLPTPKENEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYER

Arabidopsis top hitse value%identityAlignment
AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein9.2e-22171.53Show/hide
Query:  GSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN-------DDEEDDEDHDDDANH
        G +   N + +E+    ++KR +++ E S S  PP    NP LP  N Y DDDEEE+ + K++ +R +G    EG+GN       ++ +DDED D     
Subjt:  GSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN-------DDEEDDEDHDDDANH

Query:  VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV
         K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+V
Subjt:  VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV

Query:  LNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF
        LNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEF
Subjt:  LNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF

Query:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLP
        LQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRM FLMLGLDLP
Subjt:  LQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLP

Query:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLR
        PPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + 
Subjt:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLR

Query:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein5.2e-21674.62Show/hide
Query:  GDDDEEEDYKFKQNGSRYDG--------DEGDGND-------DEEDDEDHDDDAN--HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVY
        G  +EE + K K+   R D          EG+GN        + +DDED DDDA+    K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVY
Subjt:  GDDDEEEDYKFKQNGSRYDG--------DEGDGND-------DEEDDEDHDDDAN--HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVY

Query:  ACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETD
        ACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+
Subjt:  ACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETD

Query:  FVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSS
        FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +S
Subjt:  FVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSS

Query:  SIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARM
        SII++CFQGELEVVKE       +  EN               E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP +ARM
Subjt:  SIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARM

Query:  RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSE
        RYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+E
Subjt:  RYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSE

Query:  TLPQMVALSEAYMQIYERQQ
        TLPQMV LSEAYMQIYE+Q+
Subjt:  TLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein1.3e-21971.43Show/hide
Query:  GSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGND-------DEEDDEDHDDDAN--HVK
        G +   N + +E+    ++KR +++ E S S  PP    NP LP  N  DDD  +  K +   +     EG+GN        + +DDED DDDA+    K
Subjt:  GSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGND-------DEEDDEDHDDDAN--HVK

Query:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN
         SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLN
Subjt:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN

Query:  PRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ
        PRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQ
Subjt:  PRFAKDQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ

Query:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPP
        AVMKASKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRMPFLMLGLDLPPP
Subjt:  AVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPP

Query:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLRSK
        PLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + SK
Subjt:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKENEKLRSK

Query:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        Y+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein2.5e-16279.15Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP +ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPHIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL

AT4G22420.1 Ubiquitin-specific protease family C19-related protein4.2e-0845.08Show/hide
Query:  SLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN-------DDEEDDEDHDDDAN--HVKRSR
        SL+D  E        K + E S S  PP    N  LP  N Y DDDEEE  + K++ +R +G    EG+GN       ++ +D+ED DDDA+    K SR
Subjt:  SLLDEKELGPDLKRHKLLGEVSPSSSPPAS-ENPRLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN-------DDEEDDEDHDDDAN--HVKRSR

Query:  DVEVRKDCPYLDTVNRQVLDFD
         VEVR+DCPYLDTVNRQV+  D
Subjt:  DVEVRKDCPYLDTVNRQVLDFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGAAAGAGTTGGGTCCAGACTTAAAGAGGCATAAATTACTTGGGGAGGTGTCACCTTCTTCTTCTCCACCTGC
CTCAGAGAACCCTCGGCTTCCTGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTACAAATTTAAACAAAATGGAAGTAGATATGATGGAGATGAAGGGGATGGCA
ATGATGACGAAGAAGATGATGAAGACCATGACGATGATGCAAATCATGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATACTGTAAACCGT
CAGGTTTTAGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTAAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGGGGGAAGAAGTC
TCATGCTTACACTCACAGTCTTGAAGCCGGACACCATGTGTATATCAACCTCCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGACCCCTCGTTAG
ATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGACCAGGTAGAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATTACCTTCCTGGA
ATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCTGAGAACTATCA
GCACTGCAGATCTCCACTTGTTCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCTGCAAGCAGTTA
TGAAGGCTAGTAAAAAGCGTTTTCGAATAGGTGCACAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTACAAAGAAAAGT
AGCAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTCAAAGAGATTCACTCAAAAGCTCTCGCCGAGAAGAAAGAAAATGGTGATGATCAGGAAGCTGGAGC
TGAAGGTAGCAGTGTGATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCCCTTTTCAAAGATGTCATGGAGAAAAATATAATAC
CGCAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTCGTACGGCCACATATTGCAAGGATGCGCTACCGGGTTACTCGATTGCCTCAG
TACTTAATTCTTCATATGCGGCGATTTACAAAGAACAACTTTTTTGTGGAAAAGAATCCGACATTAGTGAACTTTCCTGTGAAGAATCTTGAATTGAAGGATTACATCCC
CTTGCCAACACCTAAAGAGAACGAAAAATTGCGTTCAAAATACGATCTGATTGCAAATATTGTTCATGATGGCAAACCCAATGAAGGGTACTACAGGGTATTCGTGCAGA
GGAAGTCTGAAGAATTATGGTATGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGATATATGAACGGCAGCAA
TAG
mRNA sequenceShow/hide mRNA sequence
GTTGTTTTTTGTGAAAGCATCGACATTTTGGGTCAAATGTACATCTAGCTGACGTCAACAAAACAAAAGACTATTTTTCATTATTTTTTAACTTTGGCCCAAATGAAAAA
AAGGGCCATCGTTACAAATTTCCTCCTCATCTTCTTCTTTCTTCTTCGCGTGAAAAAACCCGCCCGCTCCGTCCCTCCATTTCTCTCTCGCACGTTCATCGCACAGCCGC
TGCACGCCTCCGTCGGCGTCACCAGCCACGCCGCGCCGCCGCTCGTCACCGCAAGTCGCCGGTAGCGCCTCCGCTTGCTAGTCATCAATACACCACTGCCACGGATGTAC
GGTTGCCGGAATCGTAGTCCAGCAATCAGTTGGGTCGGAAGTGTTTGGGGTCGCTGCCCAACAAGGATATTGGGCTGAAGTCATATATATTCACCTCATTTATAAAAAAA
AAGTTGATAGAATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGAAAGAGTTGGGTCCAGACTTAAAGAGGCATAAATTACTTGGGGAGGTGTCACCTTCTTCT
TCTCCACCTGCCTCAGAGAACCCTCGGCTTCCTGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTACAAATTTAAACAAAATGGAAGTAGATATGATGGAGATGA
AGGGGATGGCAATGATGACGAAGAAGATGATGAAGACCATGACGATGATGCAAATCATGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATCTTGATA
CTGTAAACCGTCAGGTTTTAGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTGTCAAATCTAAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAAGGGAGG
GGGAAGAAGTCTCATGCTTACACTCACAGTCTTGAAGCCGGACACCATGTGTATATCAACCTCCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGATTAATGA
CCCCTCGTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGACCAGGTAGAGCAGCTTGACAAGAACAAGCAATGGTCTAGGGCACTTGATGGTTCTGATT
ACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACTGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTCCTAATACCT
GAGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGAATTCCT
GCAAGCAGTTATGAAGGCTAGTAAAAAGCGTTTTCGAATAGGTGCACAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGCGAATTA
CAAAGAAAAGTAGCAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTCAAAGAGATTCACTCAAAAGCTCTCGCCGAGAAGAAAGAAAATGGTGATGATCAG
GAAGCTGGAGCTGAAGGTAGCAGTGTGATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCCCTTTTCAAAGATGTCATGGAGAA
AAATATAATACCGCAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTCGTACGGCCACATATTGCAAGGATGCGCTACCGGGTTACTC
GATTGCCTCAGTACTTAATTCTTCATATGCGGCGATTTACAAAGAACAACTTTTTTGTGGAAAAGAATCCGACATTAGTGAACTTTCCTGTGAAGAATCTTGAATTGAAG
GATTACATCCCCTTGCCAACACCTAAAGAGAACGAAAAATTGCGTTCAAAATACGATCTGATTGCAAATATTGTTCATGATGGCAAACCCAATGAAGGGTACTACAGGGT
ATTCGTGCAGAGGAAGTCTGAAGAATTATGGTATGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGATATATG
AACGGCAGCAATAGAGATAGGAAGTTCAAATCGTCTAACTGCGTAAATTAGTTTTGGTTATTTCCTCTCTAGAAGATATGTTCCTGTACTTAAACAGGAGATGAGAGTAG
GCAAGTCTTTGAAATCCGAGATATAAAGGAAGGTTCCTGCTACTGATCTCTTTGTTTAATTGTAGAATCAATTGTCAAATTGAACATACTTTGGAAACTTGGACTGAAAA
GGTCTGGACTAAGCTGCTACACACTCCTTTTAGTAAAGCAGGTGAAATTTTGTTGCCTAAATTGCTTCAGTTTTTTTGTAACATAACATTGATCATGATATGTTAGGGCT
GTAATTTTCATCATTCTAAATCTTCCTATGCAACGGCATAACTTCTGTGTGTTTTAGAGGATTTGTATTTGCAGAGATGTGATGTGAAGGCGTATCCTCGCCTTAGTAAT
TTTAAAATATCTACTGAATTTTTTATTGTTTGATACCATTCTGAATGTTGTTGACTTTACTAATGGGCAATTCGAAACTCAACATCTGCCAGTTTACGACTTCATTATCA
TAGAAATATAAATGTGTAAATGGGAG
Protein sequenceShow/hide protein sequence
MGSKRRNNSLLDEKELGPDLKRHKLLGEVSPSSSPPASENPRLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDHDDDANHVKRSRDVEVRKDCPYLDTVNR
QVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKDQVEQLDKNKQWSRALDGSDYLPG
MVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKASKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKS
SSIIYECFQGELEVVKEIHSKALAEKKENGDDQEAGAEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPHIARMRYRVTRLPQ
YLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKENEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ