; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G011050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G011050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionU4/U6.U5 tri-snRNP-associated protein 2-like
Genome locationchr02:12657220..12672619
RNA-Seq ExpressionLsi02G011050
SyntenyLsi02G011050
Gene Ontology termsGO:0000245 - spliceosomal complex assembly (biological process)
GO:0016579 - protein deubiquitination (biological process)
GO:0005681 - spliceosomal complex (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001394 - Peptidase C19, ubiquitin carboxyl-terminal hydrolase
IPR001607 - Zinc finger, UBP-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR028889 - Ubiquitin specific protease domain
IPR033809 - USP39
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583754.1 U4/U6.U5 tri-snRNP-associated protein 2, partial [Cucurbita argyrosperma subsp. sororia]1.9e-30695.5Show/hide
Query:  YQREKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSR
        +QR KR DRMGSKR+N+S++DEEELGPDLKRHK LGE SP SSPPASENPQLPGFNYGDDDEEEDYK KQNGS YDGDEGDG DDEE+DED+ NH+ RSR
Subjt:  YQREKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSR

Query:  DVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF
        DVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF
Subjt:  DVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF

Query:  AKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVM
        AKEQVEQLDK KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVM
Subjt:  AKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVM

Query:  KTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLF
        K SKKRFRIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDA TEGSSVIMETSRMPFLMLGLDLPPPPLF
Subjt:  KTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLF

Query:  KDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLI
        KDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE EKLRSKYDLI
Subjt:  KDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLI

Query:  ANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        ANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  ANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_008463627.1 PREDICTED: U4/U6.U5 tri-snRNP-associated protein 2-like [Cucumis melo]1.2e-30596Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEED+KFKQNGS+YD DEGD NDDEEDDE   D  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDA T+GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+ +KLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_011655023.1 U4/U6.U5 tri-snRNP-associated protein 2 [Cucumis sativus]5.5e-30696.18Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDED---DANHVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEED+KFKQNGS+YDGDEGD NDDEEDDE+   + N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDED---DANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDA TEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+ +KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894254.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X1 [Benincasa hispida]0.0e+0095.48Show/hide
Query:  IGVKLYIFRSFINLLYYQREKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGND
        I VK  IF  F+  +  +REKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLPGFNYGDD+EEE+YKFKQNGSRYDGDEGD ND
Subjt:  IGVKLYIFRSFINLLYYQREKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGND

Query:  DEEDDE---DDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DEEDDE   DDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
Subjt:  DEEDDE---DDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIW
        YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIW
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIW

Query:  HARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIME
        HARNFKGQVSPHEFLQAVMK SKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDA TE SSV+ME
Subjt:  HARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIME

Query:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY
        TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY
Subjt:  TSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDY

Query:  IPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        IPLPTPK+ EKL SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  IPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

XP_038894256.1 U4/U6.U5 tri-snRNP-associated protein 2-like isoform X2 [Benincasa hispida]1.5e-30897.45Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSP SSPPASENPQLPGFNYGDD+EEE+YKFKQNGSRYDGDEGD NDDEEDDE   DDANHVKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDA TE SSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+ EKL SKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

TrEMBL top hitse value%identityAlignment
A0A0A0LTF9 Uncharacterized protein2.7e-30696.18Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDED---DANHVKRSRDVEVRK
        MGSKRR+NSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEED+KFKQNGS+YDGDEGD NDDEEDDE+   + N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDED---DANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENG++QDA TEGSSV METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+ +KLRSKYDLIANIVHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A1S3CJP7 U4/U6.U5 tri-snRNP-associated protein 2-like5.9e-30696Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK
        MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEED+KFKQNGS+YD DEGD NDDEEDDE   D  N VKRSRDVEVRK
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE---DDANHVKRSRDVEVRK

Query:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
        DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE
Subjt:  DCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVE

Query:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR
        QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKR
Subjt:  QLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKR

Query:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK
        FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKAL EKKENGD+QDA T+GSSV+METSRMPFLMLGLDLPPPPLFKDVMEK
Subjt:  FRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEK

Query:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD
        NIIPQVPLFNILKKFDGETITEVVRP IARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPK+ +KLRSKYDLIAN+VHD
Subjt:  NIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD

Query:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        GKPNEG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1CND9 U4/U6.U5 tri-snRNP-associated protein 2-like2.4e-29994.35Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE--DDANHVKRSRDVEVRKD
        MGSKRR+++ +DEEEL P++KR KLLGE SPSS PPASENP+LPGFNYGDDDEEEDYKFKQNGSR  GD GD NDDEEDDE  DDANHVKRSRDVEVRKD
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDE--DDANHVKRSRDVEVRKD

Query:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ
        CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRF KEQVE 
Subjt:  CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQ

Query:  LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRF
        LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKRF
Subjt:  LDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRF

Query:  RIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN
        RIGAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDA +EGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN
Subjt:  RIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKN

Query:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDG
        IIPQVPLFNILKKFDGE ITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE EKLRSKYDLIANIVHDG
Subjt:  IIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDG

Query:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        KP+EG YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  KPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1EKT5 U4/U6.U5 tri-snRNP-associated protein 2-like2.8e-30395.98Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP
        MGSKR+N+S++DEEELGPDLKRHK LGE SP SSPPASENPQLPGFNYGDDDEEEDYK KQNGS YDGDEGDG DDEE+DED+ NH+ RSRDVEVRKDCP
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP

Query:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD
        YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD
Subjt:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD

Query:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRI
        K KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKRFRI
Subjt:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRI

Query:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII
        GAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDA TEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII
Subjt:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII

Query:  PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKP
        PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE EKLRSKYDLIANIVHDGKP
Subjt:  PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKP

Query:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

A0A6J1KIZ3 U4/U6.U5 tri-snRNP-associated protein 2-like4.0e-30295.61Show/hide
Query:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP
        MGSKR+N+S++DEEELGPDLKRHK LGE+SP SSPPASENPQLPGFNYGDDDEEEDYK KQNGS YDGDEGD  DDEE+DED+ NH+ RSRDVEVRKDCP
Subjt:  MGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP

Query:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD
        YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD
Subjt:  YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLD

Query:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRI
        K KQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIP+NYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKRFRI
Subjt:  KNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRI

Query:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII
        GAQSDPVEFMSWFLNTLHSELR++KKSSSIIYECFQGELEVVKEIHSKAL EKKENGDDQDA TEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII
Subjt:  GAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNII

Query:  PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKP
        PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKE EKLRSKYDLIANIVHDGKP
Subjt:  PQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKP

Query:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
Subjt:  NEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

SwissProt top hitse value%identityAlignment
P43589 Pre-mRNA-splicing factor SAD11.1e-4328.97Show/hide
Query:  EGDGNDDEEDDEDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCL
        E D      +DE     VK+ +  E   +  YL+TV R+ LDFD EK C ++LS LNVY CLVCG YYQGR +KS A+ HS++  HHV++NL + K Y L
Subjt:  EGDGNDDEEDDEDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCL

Query:  PDGYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFG
        P   +I        L+ I++   P +  + +E   +       L    YL G +G  N    D+ +  +  +  + P+R+ FL+  N+   +   + R  
Subjt:  PDGYEINDPS----LDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDA
           +KIW  + FK  +S  +F+      S  + R G   +P++   F+ W  N + S     K   SI+    +G++++ K + +K  A +   G     
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVE---FMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDA

Query:  VTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNF
              VI++    PF +L LDLP    F+D    + +PQ+ +  +L KF         R       + +TRLPQ+LI H  RF +N+          + 
Subjt:  VTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNF

Query:  PVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        PVKN   ++   +    E E L  KY L AN+VH        DG    G    ++   +     E W E+  ++ +E   +++ L E ++Q++E+Q+
Subjt:  PVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVH--------DGKPNEG----YYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

Q3TIX9 U4/U6.U5 tri-snRNP-associated protein 21.4e-13452.97Show/hide
Query:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + +     K  R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP
         Q+ + E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNP
Subjt:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP

Query:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        T+VNFP+ N++L++Y  L    +     + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q53GS9 U4/U6.U5 tri-snRNP-associated protein 21.4e-13452.97Show/hide
Query:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + +     K  R D E R+   CPYLDT+NR VLDFDFEK CS+SLS++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP
         Q+ + E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNP
Subjt:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP

Query:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        T+VNFP+ N++L++Y  L    +     + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q5R761 U4/U6.U5 tri-snRNP-associated protein 26.9e-13452.76Show/hide
Query:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG
        DE+ + +     K  R D E R+   CPYLDT+NR VLDFDFEK CS+S S++N YACLVCGKY+QGRG KSHAY HS++  HHV++NL T K YCLPD 
Subjt:  DEEDDEDDANHVKRSR-DVEVRKD--CPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDG

Query:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG
        YEI D SL+DI YVL P F K+Q+  LDK  + SRA DG+ YLPG+VGLNNIK  D+ N  +Q+L  V PLRN+FL  +NY++ + P       LV RFG
Subjt:  YEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSP-------LVHRFG

Query:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD
        EL RK+W+ RNFK  VSPHE LQAV+  SKK F+I  Q D V+F+SWFLN LHS L  T KK  +I+ + FQG + +   K  H    AE+KE    N +
Subjt:  ELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRIT-KKSSSIIYECFQGELEVV--KEIHSKALAEKKE----NGD

Query:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP
         Q+ + E +          F+ L LDLP  PL+KD  E+ IIPQVPLFNIL KF+G T  E    +   + R+++T+LP YLI  ++RFTKNNFFVEKNP
Subjt:  DQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARM-RYRVTRLPQYLILHMRRFTKNNFFVEKNP

Query:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ
        T+VNFP+ N++L++Y+        E   + YDLIANIVHDGKP+EG YR+ V       WYE+QDL V++ LPQM+ LSEAY+QI++R+
Subjt:  TLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQ

Q9USR2 Probable mRNA-splicing protein ubp102.0e-10142.69Show/hide
Query:  EEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAY
        EED     NG R   + G    D ED  D A     S+++E  +  P      YLDT+NR++LDFDFEK CSVSL+NL+VYACLVCG+Y+QGRG  SHAY
Subjt:  EEDYKFKQNGSRYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCP------YLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAY

Query:  THSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFF
         H+L   HHV++N  T K Y LP+ Y++   +L DI YV+ P F K +V++LD   Q S  L    Y+PG VG+NNIK  D+ NV I  L  V P RN+F
Subjt:  THSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFF

Query:  LIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVK
        L+ +N+ +C   LV R   L RK+W+ + FK  VSP E +Q V   S K++ I  Q DPVEF+SWFLNTLH+ L   K    K +SI++  FQG +    
Subjt:  LIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITK----KSSSIIYECFQGELEVVK

Query:  EIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHM
         I S+ + +  E G  +  V  G  VI +T+ +PFL L LDLPP P+F+D  E NIIPQV L  IL K++G    E+      R R+ +   P Y I H+
Subjt:  EIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHM

Query:  RRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAY
        +RF KNN+F E+N T+V FP+ + ++  +I     +   K+ +KY+L+ANI+H+     +     +R+ ++  S   WY++QDL+V E    M+ L E++
Subjt:  RRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHD----GKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAY

Query:  MQIYER
        +Q++ER
Subjt:  MQIYER

Arabidopsis top hitse value%identityAlignment
AT4G22285.1 Ubiquitin C-terminal hydrolases superfamily protein1.5e-22171.53Show/hide
Query:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN----------DDEEDDEDDANH
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N Y DDDEEE+ + K++ +R +G    EG+GN          DD+EDD+     
Subjt:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN----------DDEEDDEDDANH

Query:  VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV
         K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+V
Subjt:  VKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYV

Query:  LNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF
        LNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPLVHRFGELTRKIWHARNFKGQVSPHEF
Subjt:  LNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEF

Query:  LQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLP
        LQAVMK SKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRM FLMLGLDLP
Subjt:  LQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLP

Query:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKEKEKLR
        PPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + 
Subjt:  PPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKEKEKLR

Query:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+++
Subjt:  SKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.1 Ubiquitin C-terminal hydrolases superfamily protein7.9e-21877.08Show/hide
Query:  KFKQNGSRYDGDEGDGNDDEEDDEDDAN--HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGH
        K + NG++  G+     DD+EDD+DDA+    K SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGH
Subjt:  KFKQNGSRYDGDEGDGNDDEEDDEDDAN--HVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGH

Query:  HVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQH
        HVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQH
Subjt:  HVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQH

Query:  CRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKE
        C+SPL HRFGELTRKIWHARNFKGQVSPHEFLQAVMK SKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  E
Subjt:  CRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKE

Query:  NGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK
        N               E SRMPFLMLGLDLPPPPLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EK
Subjt:  NGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEK

Query:  NPTLVNFPVKNLELKDYIP-LPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        NPTLVNFPVK++EL+DYIP LP   E E + SKY+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  NPTLVNFPVKNLELKDYIP-LPTPKEKEKLRSKYDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22350.2 Ubiquitin C-terminal hydrolases superfamily protein9.9e-22171.43Show/hide
Query:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGN----------DDEEDDEDDAN--HVK
        G +   N + +EE    ++KR +++ E S S  PP    NP LP  N  DDD  +  K +   +     EG+GN          DD+EDD+DDA+    K
Subjt:  GSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFNYGDDDEEEDYKFKQNGSRYDGDEGDGN----------DDEEDDEDDAN--HVK

Query:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN
         SR VEVR+DCPYLDTVNRQVLDFDFE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLN
Subjt:  RSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLN

Query:  PRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ
        PRF++ QV +LDKN+QWSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFFLIPENYQHC+SPL HRFGELTRKIWHARNFKGQVSPHEFLQ
Subjt:  PRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQ

Query:  AVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPP
        AVMK SKKRFRIG QSDPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRMPFLMLGLDLPPP
Subjt:  AVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPP

Query:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKEKEKLRSK
        PLFKDVMEKNIIPQV LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTLVNFPVK++EL+DYIP LP   E E + SK
Subjt:  PLFKDVMEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIP-LPTPKEKEKLRSK

Query:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ
        Y+LIANIVHDGKP +GY+RVFVQRKS+ELWYEMQDLHV+ETLPQMV LSEAYMQIYE+Q+
Subjt:  YDLIANIVHDGKPNEGYYRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ

AT4G22410.1 Ubiquitin C-terminal hydrolases superfamily protein3.6e-16278.87Show/hide
Query:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ
        V  QVLDF FE+FCSVSLSNLNVYACLVCGKY+QGR +KSHAYTHSLEAGHHVYINL TEKVYCLPD YEINDPSLDDIR+VLNPRF++ QV +LDKN+Q
Subjt:  VNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYEINDPSLDDIRYVLNPRFAKEQVEQLDKNKQ

Query:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQS
        WSRALDGSDYLPGMVGLNNI++T+FVNVTIQSLMRVTPLRNFF IPENYQHC+SPLVH FGELTRKIWHARNFKGQVSPHEFLQAVMK SKKRFRIG QS
Subjt:  WSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPHEFLQAVMKTSKKRFRIGAQS

Query:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP
        DPVEFMSW LNTLH +LR +K +SSII++CFQGELEVVKE       +  EN               E SRM FLMLGLDLPPPPLFKDVMEKNIIPQV 
Subjt:  DPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDVMEKNIIPQVP

Query:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL
        LF++LKKFDGET+TEVVRP++ARMRYRV + P+YL+ HM RF KNNFF EKNPTL
Subjt:  LFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTL

AT4G22420.1 Ubiquitin-specific protease family C19-related protein1.4e-0946.72Show/hide
Query:  SLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN----------DDEEDDEDDAN--HVKRSR
        SL+D  E        K + E S S  PP    N  LP  N Y DDDEEE  + K++ +R +G    EG+GN          DDEEDD+DDA+    K SR
Subjt:  SLLDEEELGPDLKRHKLLGEVSPSSSPPAS-ENPQLPGFN-YGDDDEEEDYKFKQNGSRYDG---DEGDGN----------DDEEDDEDDAN--HVKRSR

Query:  DVEVRKDCPYLDTVNRQVLDFD
         VEVR+DCPYLDTVNRQV+  D
Subjt:  DVEVRKDCPYLDTVNRQVLDFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGGTTGCCGGAATTGTAGTCCAGCAAGCAGTTGTGTTTGGTTGCTGCCCAGCAAGGATATTGGGGTGAAGTTATATATATTCAGGTCATTTATAAATTTGCTCTA
TTACCAGAGGGAAAAAAGAGTTGATAGAATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGGAAGAGTTGGGTCCAGACTTAAAGAGGCATAAATTACTTGGGG
AGGTGTCACCTTCTTCTTCTCCACCTGCCTCAGAGAATCCTCAGCTTCCTGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTACAAATTTAAACAAAATGGAAGT
AGATATGATGGAGATGAAGGGGATGGCAATGATGACGAAGAAGATGATGAAGATGATGCAAATCATGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTA
TCTTGATACTGTAAATCGTCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTATCAAATCTAAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATC
AAGGGAGGGGGAAGAAGTCCCATGCCTACACTCATAGTCTTGAAGCGGGACACCATGTGTATATCAATCTTCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAG
ATTAATGACCCCTCGTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAACTTGACAAGAACAAGCAATGGTCTAGGGCACTCGATGG
TTCTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACCGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTTC
TAATACCTGAGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCAT
GAGTTCCTGCAAGCAGTCATGAAGACTAGTAAAAAGCGTTTTCGAATAGGTGCACAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACT
GCGAATTACAAAGAAAAGTAGTAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTCAAAGAGATTCACTCGAAAGCTCTCGCTGAGAAGAAAGAAAATGGTG
ATGATCAGGATGCTGTAACTGAAGGTAGCAGCGTGATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCTCTTTTCAAAGATGTT
ATGGAGAAAAATATAATACCGCAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGAATGCGCTACCG
GGTTACTCGATTGCCTCAGTACTTAATTCTTCACATGCGGCGATTTACAAAAAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAACTTTCCCGTGAAGAATCTTG
AATTGAAGGATTACATCCCCTTGCCAACACCTAAAGAGAAGGAAAAATTGCGTTCAAAGTACGATCTGATTGCAAATATTGTTCATGATGGCAAACCCAACGAAGGGTAC
TACAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCA
GATATATGAACGGCAGCAATAG
mRNA sequenceShow/hide mRNA sequence
CGAATTTCCCCTCATCTTCTTCTTTCTTCTTCTCGTGAAAAGACCCACCCGCTTCATCCCTCCATCTCTCTCTTGCACGTTCATCGCACAGCCGGCCGACCATCGGCTTG
CCGCTTTACGCCTCCGTCGGCATCACCAGCCACGCCGCGCCGTCGCTCGTCACCGCCAGTCGCCGGTAGCGCCGCCGCTTGCTAGTCATCAATACACCACTGCCACGGAT
GTACGGTTGCCGGAATTGTAGTCCAGCAAGCAGTTGTGTTTGGTTGCTGCCCAGCAAGGATATTGGGGTGAAGTTATATATATTCAGGTCATTTATAAATTTGCTCTATT
ACCAGAGGGAAAAAAGAGTTGATAGAATGGGATCGAAGAGGCGGAACAATAGTCTTCTAGATGAGGAAGAGTTGGGTCCAGACTTAAAGAGGCATAAATTACTTGGGGAG
GTGTCACCTTCTTCTTCTCCACCTGCCTCAGAGAATCCTCAGCTTCCTGGTTTTAACTATGGCGATGATGATGAAGAAGAAGATTACAAATTTAAACAAAATGGAAGTAG
ATATGATGGAGATGAAGGGGATGGCAATGATGACGAAGAAGATGATGAAGATGATGCAAATCATGTTAAGCGAAGTCGTGATGTTGAAGTTCGAAAAGATTGTCCTTATC
TTGATACTGTAAATCGTCAGGTTTTGGATTTTGATTTTGAGAAGTTTTGCTCTGTCTCTCTATCAAATCTAAATGTTTATGCCTGCCTGGTATGTGGTAAATACTATCAA
GGGAGGGGGAAGAAGTCCCATGCCTACACTCATAGTCTTGAAGCGGGACACCATGTGTATATCAATCTTCGGACAGAGAAAGTTTACTGCCTTCCCGATGGATATGAGAT
TAATGACCCCTCGTTAGATGATATTCGATATGTCCTGAATCCAAGGTTTGCCAAAGAGCAGGTAGAGCAACTTGACAAGAACAAGCAATGGTCTAGGGCACTCGATGGTT
CTGATTACCTTCCTGGAATGGTGGGGCTTAACAACATTAAAGAAACCGATTTTGTAAATGTGACAATTCAGTCCTTAATGAGGGTTACACCACTCAGGAACTTCTTTCTA
ATACCTGAGAACTATCAGCACTGCAGATCTCCACTTGTTCACCGGTTTGGTGAACTCACACGTAAGATTTGGCATGCAAGAAACTTCAAAGGACAGGTAAGCCCGCATGA
GTTCCTGCAAGCAGTCATGAAGACTAGTAAAAAGCGTTTTCGAATAGGTGCACAGTCAGATCCTGTTGAATTTATGTCATGGTTTCTTAACACACTTCATTCAGAACTGC
GAATTACAAAGAAAAGTAGTAGTATAATCTACGAGTGTTTCCAGGGGGAATTGGAGGTTGTCAAAGAGATTCACTCGAAAGCTCTCGCTGAGAAGAAAGAAAATGGTGAT
GATCAGGATGCTGTAACTGAAGGTAGCAGCGTGATAATGGAAACTTCAAGAATGCCATTCTTAATGCTTGGATTGGATTTACCGCCGCCACCTCTTTTCAAAGATGTTAT
GGAGAAAAATATAATACCGCAGGTTCCACTCTTCAACATTTTGAAAAAATTTGATGGTGAAACTATCACAGAAGTTGTACGGCCACGTATAGCAAGAATGCGCTACCGGG
TTACTCGATTGCCTCAGTACTTAATTCTTCACATGCGGCGATTTACAAAAAACAACTTTTTTGTGGAAAAGAATCCCACATTAGTGAACTTTCCCGTGAAGAATCTTGAA
TTGAAGGATTACATCCCCTTGCCAACACCTAAAGAGAAGGAAAAATTGCGTTCAAAGTACGATCTGATTGCAAATATTGTTCATGATGGCAAACCCAACGAAGGGTACTA
CAGGGTATTTGTACAGAGGAAGTCGGAAGAATTATGGTACGAGATGCAGGATCTTCATGTCTCAGAAACACTTCCTCAAATGGTTGCTCTCTCTGAGGCTTATATGCAGA
TATATGAACGGCAGCAATAGATAGGAAGTTCAATATCTACATCTAACTGCGTAATTAGTTTTGGTTATTTTCTCTCTAGAAGATATGTACCTGTACTTAAACAGGAGATG
AGAGTAGATAAAGCTTTGAAATCCGTGATATAAAGGAAGCTTGCTGCGACTAATCTCTTTGTTTAATTGTAGAATCAATGATCACATTGAACATACTTTGGAAACTTGGA
GTGAAAAGGGATGGACTATGCTGCTACGCACTCCTTTCAGTAAAGCAGGTGAAATTTTGTTGCCAATATTAATATTCAGTTTTCTGTAACATAACATTGATTATCATATG
TTAGGGCTGTAATTTTCATCATTCTAAATATTTCTATGCAACGGGATAATTTTTGTGTGTTTAGAGGATTTGTATTTGCAGAGATTTGATGTGAAGGCATAACTTCGTCT
TAGTAATTTTGAATATCTACTGAATTCTAGTGTTTGAACCATTATAATGGTTGTTTCCTTTACTAATGGGCAATTCGAAACTTGTTAAGTTGATGATCCGCATAGTTTAC
CACTTCATTATCATAGTAAATACATGTATAATATTTGAGTGTCAAAGAAACTAGGAGGATATTAAATCCTAAATTAGG
Protein sequenceShow/hide protein sequence
MYGCRNCSPASSCVWLLPSKDIGVKLYIFRSFINLLYYQREKRVDRMGSKRRNNSLLDEEELGPDLKRHKLLGEVSPSSSPPASENPQLPGFNYGDDDEEEDYKFKQNGS
RYDGDEGDGNDDEEDDEDDANHVKRSRDVEVRKDCPYLDTVNRQVLDFDFEKFCSVSLSNLNVYACLVCGKYYQGRGKKSHAYTHSLEAGHHVYINLRTEKVYCLPDGYE
INDPSLDDIRYVLNPRFAKEQVEQLDKNKQWSRALDGSDYLPGMVGLNNIKETDFVNVTIQSLMRVTPLRNFFLIPENYQHCRSPLVHRFGELTRKIWHARNFKGQVSPH
EFLQAVMKTSKKRFRIGAQSDPVEFMSWFLNTLHSELRITKKSSSIIYECFQGELEVVKEIHSKALAEKKENGDDQDAVTEGSSVIMETSRMPFLMLGLDLPPPPLFKDV
MEKNIIPQVPLFNILKKFDGETITEVVRPRIARMRYRVTRLPQYLILHMRRFTKNNFFVEKNPTLVNFPVKNLELKDYIPLPTPKEKEKLRSKYDLIANIVHDGKPNEGY
YRVFVQRKSEELWYEMQDLHVSETLPQMVALSEAYMQIYERQQ