; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g210330 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g210330
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionSplicing factor-like protein
Genome locationCsor_Chr04:11280757..11290684
RNA-Seq ExpressionCsor.00g210330
SyntenyCsor.00g210330
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR000504 - RNA recognition motif domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR025640 - GYF domain 2
IPR034392 - TatSF1-like, RNA recognition motif 1
IPR034393 - TatSF1-like
IPR035979 - RNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601707.1 HIV Tat-specific factor 1-like protein, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDDDDELGKYQKGVGE
        MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDDDDELGKYQKGVGE
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDDDDELGKYQKGVGE

Query:  VATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPS
        VATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPS
Subjt:  VATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPS

Query:  TSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRET
        TSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRET
Subjt:  TSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRET

Query:  MKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVK
        MKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVK
Subjt:  MKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVK

Query:  LSVNDLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEAH
        LSVNDLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEAH
Subjt:  LSVNDLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEAH

XP_038882573.1 splicing factor U2AF-associated protein 2 isoform X1 [Benincasa hispida]8.23e-27079.49Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELR-------EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDEL
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELR       EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT K++EQ+S+ S AV AN+  DDDEL
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELR-------EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDEL

Query:  GKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTS
         KYQK VGE  AT EVS+PSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPE+MTFMQEEEVFPQLEADAPCTS
Subjt:  GKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTS

Query:  IKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP
        IK EGDSVPSTSIKEED H TKEA  KSE+I TK+NGKRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP
Subjt:  IKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP

Query:  RVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVIL
        RVKLYVDRET KKKGDALVSYLKEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDA++SIPATVIL
Subjt:  RVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVIL

Query:  RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQ
        RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQ
Subjt:  RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQ

Query:  FGSELEA
        FGSELEA
Subjt:  FGSELEA

XP_038882574.1 splicing factor U2AF-associated protein 2 isoform X2 [Benincasa hispida]6.84e-27079.49Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELR-------EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDEL
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELR       EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT K++EQ+S+ S AV AN+  DDDEL
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELR-------EHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDEL

Query:  GKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTS
         KYQK VGE  AT EVS+PSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPE+MTFMQEEEVFPQLEADAPCTS
Subjt:  GKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTS

Query:  IKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP
        IK EGDSVPSTSIKEED H TKEA  KSE+I TK+NGKRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP
Subjt:  IKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKP

Query:  RVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVIL
        RVKLYVDRET KKKGDALVSYLKEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDA++SIPATVIL
Subjt:  RVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVIL

Query:  RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQ
        RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQ
Subjt:  RFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQ

Query:  FGSELEA
        FGSELEA
Subjt:  FGSELEA

XP_038882575.1 splicing factor U2AF-associated protein 2 isoform X3 [Benincasa hispida]1.16e-27280.6Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDELGKYQKGV
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT K++EQ+S+ S AV AN+  DDDEL KYQK V
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDELGKYQKGV

Query:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS
        GE  AT EVS+PSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPE+MTFMQEEEVFPQLEADAPCTSIK EGDS
Subjt:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS

Query:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
        VPSTSIKEED H TKEA  KSE+I TK+NGKRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
Subjt:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD

Query:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA
        RET KKKGDALVSYLKEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDA++SIPATVILRFMFTPA
Subjt:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA

Query:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        EMRADENLASEIETDVKEESTRFGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQFGSELEA
Subjt:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

XP_038882576.1 splicing factor U2AF-associated protein 2 isoform X4 [Benincasa hispida]9.61e-27380.6Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDELGKYQKGV
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT K++EQ+S+ S AV AN+  DDDEL KYQK V
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND--DDDELGKYQKGV

Query:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS
        GE  AT EVS+PSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPE+MTFMQEEEVFPQLEADAPCTSIK EGDS
Subjt:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS

Query:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
        VPSTSIKEED H TKEA  KSE+I TK+NGKRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
Subjt:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD

Query:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA
        RET KKKGDALVSYLKEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDA++SIPATVILRFMFTPA
Subjt:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA

Query:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        EMRADENLASEIETDVKEESTRFGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQFGSELEA
Subjt:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

TrEMBL top hitse value%identityAlignment
A0A0A0KZ32 Uncharacterized protein2.45e-26177.35Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG
        MDNNSVGN EMVT+AGWYILGE+QQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT++++ Q+SN    VPAN+ DDDEL KYQK VG
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG

Query:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV
        E  AT +VS+PSG RNFG+VEGDL+RPTTPPEGEEEFTDDDGT YKWDR LRAWVPQDDAFFKHEQY PE+MTFMQEEEVFPQL+ADAPCTSIK EGDSV
Subjt:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV

Query:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
        PSTSI  E DH TKE   KSEE  TK+N KRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
Subjt:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR

Query:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE
        ET KKKGDALVSY+KEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDAK+SIPATVILRFMFTPAE
Subjt:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE

Query:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        MRADENLASEIETDVKEEST+FGPVDSVK+  N              D  +         FGG+QIHASE+DGLVNH +VRDLEADAARLEQFGSELEA
Subjt:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

A0A1S3B523 splicing factor U2AF-associated protein 24.79e-27079.76Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT+K++EQ+SN S AVPAN+ +DDEL KYQK VG
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG

Query:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV
        E  ATAEVS PSGSRNFGMVEGDL+RPTTPPEGEEEFTDDDGTTYKWDR LRAWVPQDDAFFKHEQY PEDMTFMQEEEVFPQLEADAPCTSIK EGDSV
Subjt:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV

Query:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
        PSTS+  E DH TKE   KSEEI TK+N KRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
Subjt:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR

Query:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE
        ET KKKGDALVSY+KEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDAK+SIPATVILRFMFTPAE
Subjt:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE

Query:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        MRADENLASEIETDVKEEST+FGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQFGSELEA
Subjt:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

A0A5D3BUE8 Splicing factor U2AF-associated protein 23.01e-26979.76Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG
        MDNNSVGNSEMVT+AGWYILGE+QQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLT+K++EQ+SN S AVPAN+ +DDEL KYQK VG
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPAND-DDDELGKYQKGVG

Query:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV
        E  ATAEVS PSGSRNFGMVEGDL+RPTTPPEGEEEFTDDDGTTYKWDR LRAWVPQDDAFFKHEQY PEDMTFMQEEEVFPQLEADAPCTSIK EGDSV
Subjt:  EV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSV

Query:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
        PSTS+  E DH TKE   KSEEI TK+N KRKL  NQVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR
Subjt:  PSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDR

Query:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE
        ET KKKGDALVSY+KEPSV+L MQILDGTPLRPGGK+LMSV+QAKFEQKG                           GRDDAK+SIPATVILRFMFTPAE
Subjt:  ETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPAE

Query:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        MRADENLASEIETDVKEEST+FGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH +VRDLEADAARLEQFGSELEA
Subjt:  MRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

A0A6J1DXR6 splicing factor U2AF-associated protein 24.63e-25776.2Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPA--NDDDDELGKYQKGV
        MD  SVGNSEMV +AGWYILGEDQQHVGPYAFSELREH+LNGYLLESTLAWSEGQSEW+PLSSIPGLT+K F+QE NS   VPA  N DDDEL KYQK V
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPA--NDDDDELGKYQKGV

Query:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS
         E  ATAE  +PSGS+NFG VEGD DRPTTPPEGEEEFTDDDGT YKWDR LRAWVPQDDAFFKHEQYGPE+MTFMQEEEVFP L+ DA CT IK   DS
Subjt:  GEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS

Query:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
        VPSTSIKE+ D  TKEA  KSE+I TKQNGKRKLCD QVEKKEANKG DGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
Subjt:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD

Query:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA
        RET KKKGDALV+YLKEPSV+L +QILDGT LRPGGKILMSV+QAKFEQKG                           GRDDAK+SIPATVILR+MFTPA
Subjt:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA

Query:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        E+RADENL+SEIETDVKEES RFGPVDSVK+  N              D  +         FGGRQIHASE+DG VNH LVRDLEADAARLEQFGSELEA
Subjt:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

A0A6J1G3U1 HIV Tat-specific factor 13.48e-25475.6Show/hide
Query:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDD--DDELGKYQKGV
        MDNN  GN E VT+AGWYILGEDQQHVGPYAFSELR+HFLNGYL+ESTL WSEGQSEW PLSSI GLT+K+F+QES+ S AVP N++  DDEL KYQK V
Subjt:  MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDD--DDELGKYQKGV

Query:  GEVATA-EVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS
        GE  T  E S+PS SRNF MVEGDLDRPTTPPEGEEEFTDDDGTTY+WDRALRAWVPQDDAFFKHEQYG E+MTF+QEEEVFPQL  D  CTSIK E DS
Subjt:  GEVATA-EVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDS

Query:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
        VPSTSIKEED   TKEA  KSE++ TKQNGKRKLCD QVEKKEANKG DGWFELKINTHVY+TGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD
Subjt:  VPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVD

Query:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA
        +ET K KGDALV+YLKEPSV+L MQILDGTPLRPGGKILMSV+QAKFEQKG                           GRDDAKISIPATV+LRFMFTPA
Subjt:  RETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKG---------------------------GRDDAKISIPATVILRFMFTPA

Query:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
        EMR DENLASEIE DVKEESTRFGPVDSVK+  N              D  +         FGGRQIHASE+DGLVNH  VRDLEADA RLEQFGSELEA
Subjt:  EMRADENLASEIETDVKEESTRFGPVDSVKLSVN--------------DLHR---------FGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

SwissProt top hitse value%identityAlignment
O43120 Splicing factor U2AF-associated protein 21.6e-1326.15Show/hide
Query:  DMTFMQEEEVFPQLEAD------APCTSIKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGL
        ++ ++ EE+ +   + +      A  T  +    +  +T  KE  +   +  KR  E   T   G      N+  K E ++ S       IN  VY+ GL
Subjt:  DMTFMQEEEVFPQLEAD------APCTSIKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGL

Query:  PEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQK---------GGR---
        P DVT+DE+ EVF KCG+I ++ +   PR+K+Y   E    KGDAL+ + +  SV L  Q+ D T  R G    M V +A  + K         GG    
Subjt:  PEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQK---------GGR---

Query:  ------------------------DDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVD-------------SVKLSVNDL-------
                                D  K      V+L+ +FT  E+     L  +++ D+ EE+ + G V              +V+ S N+        
Subjt:  ------------------------DDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVD-------------SVKLSVNDL-------

Query:  ---HRFGGRQIHASEEDGLV-----NHGLVRDLEADAARLEQFGSELE
             F GR + AS  DG V         + D E +  RLE+F   LE
Subjt:  ---HRFGGRQIHASEEDGLV-----NHGLVRDLEADAARLEQFGSELE

O43719 HIV Tat-specific factor 15.0e-2026.81Show/hide
Query:  SSAAVPANDDDDELGKYQKGVGEVATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEE
        S   +  ND+ DE  + Q+  G+    +    +G                P    ++ TD   T Y+WD   +AW                         
Subjt:  SSAAVPANDDDDELGKYQKGVGEVATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEE

Query:  VFPQLEADAPCTSIKGEG---DSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVV
         FP++  D   T     G   D   S++   ED HA + A+   +E   +    RK       K E  K   GWF ++   NT+VYV+GLP D+T+DE +
Subjt:  VFPQLEADAPCTSIKGEG---DSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVV

Query:  EVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK------------------
        ++ SK GII  DP+T++ +VKLY D +    KGD L  YLK  SV L +++LD   +R G K+ + V  AKF+ KG  D +K                  
Subjt:  EVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK------------------

Query:  -------------ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASE
                     +     VI++ MF P +   D  + +EI  D++ E ++FG +  + L         SV+              D   FGGRQI A  
Subjt:  -------------ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASE

Query:  EDGLVNHGLVRDLEADAARLEQFGSELEA
         DG  ++ +         RL  + + L A
Subjt:  EDGLVNHGLVRDLEADAARLEQFGSELEA

Q5RB63 HIV Tat-specific factor 1 homolog5.0e-2026.81Show/hide
Query:  SSAAVPANDDDDELGKYQKGVGEVATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEE
        S   +  ND+ DE  + Q+  G+    +    +G                P    ++ TD   T Y+WD   +AW                         
Subjt:  SSAAVPANDDDDELGKYQKGVGEVATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEE

Query:  VFPQLEADAPCTSIKGEG---DSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVV
         FP++  D   T     G   D   S++   ED HA + A+   +E   +    RK       K E  K   GWF ++   NT+VYV+GLP D+T+DE +
Subjt:  VFPQLEADAPCTSIKGEG---DSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVV

Query:  EVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK------------------
        ++ SK GII  DP+T++ +VKLY D +    KGD L  YLK  SV L +++LD   +R G K+ + V  AKF+ KG  D +K                  
Subjt:  EVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK------------------

Query:  -------------ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASE
                     +     VI++ MF P +   D  + +EI  D++ E ++FG +  + L         SV+              D   FGGRQI A  
Subjt:  -------------ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASE

Query:  EDGLVNHGLVRDLEADAARLEQFGSELEA
         DG  ++ +         RL  + + L A
Subjt:  EDGLVNHGLVRDLEADAARLEQFGSELEA

Q61545 RNA-binding protein EWS1.1e-1136.67Show/hide
Query:  NTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAK
        N+ +YV GL ++VT+D++ + F +CG++K +  T +P + +Y+D+ET K KGDA VSY   P+    ++  DG   + G K+ +S+++ K
Subjt:  NTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAK

Q8BGC0 HIV Tat-specific factor 1 homolog1.3e-2028.17Show/hide
Query:  EGDLDRPTTPPEGEEEFTDD--DGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPSTSIKEEDDHATKEAKR
        EGD       P GE        D T Y+WD   +AW P+                    E+     +A+   +S   +G S  + ++++ +  A +E  +
Subjt:  EGDLDRPTTPPEGEEEFTDD--DGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPSTSIKEEDDHATKEAKR

Query:  KSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKE
        K  E+    + KR        K E  K   GWF ++   NT+VYV+GLP D+T+DE +++ SK GII  DP+T++ +VKLY D +    KGD L  YLK+
Subjt:  KSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELK--INTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKE

Query:  PSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK-------------------------------ISIPATVILRFMFTPAEMRADENLASEIE
         SV L +++LD   +R G K+ + V  AKF+ KG  D +K                               +     VIL+ MF P +   D  + +EI 
Subjt:  PSVSLPMQILDGTPLRPGGKILMSVSQAKFEQKGGRDDAK-------------------------------ISIPATVILRFMFTPAEMRADENLASEIE

Query:  TDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA
         D++ E ++FG +  + L         SV+              D   FGGRQI A   DG  ++ +         RL  + + L A
Subjt:  TDVKEESTRFGPVDSVKL---------SVN--------------DLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQFGSELEA

Arabidopsis top hitse value%identityAlignment
AT1G50300.1 TBP-associated factor 152.1e-0539.39Show/hide
Query:  NTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSL
        N  VYV+ LP     + + + F   G++K D  T  P+V LY D+ET + KGDA V+Y ++P  +L
Subjt:  NTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSL

AT5G16260.1 RNA binding (RRM/RBD/RNP motifs) family protein1.8e-11848.45Show/hide
Query:  NSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQE--------------SNSSAAVPAND---
        +S G +   T  GWYILGE+QQ++GPY FSEL  HF NGYLLE+TL W++G+SEWQPLS+IP L S++   E              SN+       D   
Subjt:  NSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQE--------------SNSSAAVPAND---

Query:  ---DDDELGKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDD-AFFKHEQYGPEDMTFMQEEEVFPQ
            +DE  K+Q+ + +  A AE           +VE D +R ++PPEGE+EFTDDDGT YKWDRA R WVPQDD      + YG E+MTF +E+EVFP 
Subjt:  ---DDDELGKYQKGVGEV-ATAEVSNPSGSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDD-AFFKHEQYGPEDMTFMQEEEVFPQ

Query:  LEADAPCTSIKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGII
        +  +   TS+  +  S    + K+E+D + + A+  S       NGKRKL + + EKKE NK  D WFELK+N H+YV GLP+DVTI+EV EVFSKCGII
Subjt:  LEADAPCTSIKGEGDSVPSTSIKEEDDHATKEAKRKSEEIVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGII

Query:  KEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQK---------------------------GGRDDAK
        KED +T KPR+KLY D+ T K KGDAL+SY+KEPSV L ++ILDG PLRP  K+LMSVS+AKFEQK                           GG DD+K
Subjt:  KEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLRPGGKILMSVSQAKFEQK---------------------------GGRDDAK

Query:  ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVND-----LHRF------------------GGRQIHASEEDGLVNHGLVRDL
        +SIPATV+LR+MF+PAE+ ADE+L +E+E DVKEES + GP DSVK+  +      L RF                    RQIHAS +DG VNH  VRD 
Subjt:  ISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVND-----LHRF------------------GGRQIHASEEDGLVNHGLVRDL

Query:  EADAARLEQFGSELEA
        + +A RL+QF +ELEA
Subjt:  EADAARLEQFGSELEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAACAATTCTGTTGGGAACTCAGAAATGGTAACTAAAGCTGGATGGTATATTCTTGGTGAAGATCAGCAACATGTTGGTCCATATGCATTTTCTGAATTGCGTGA
GCATTTTCTAAATGGGTATCTCTTGGAAAGTACATTGGCCTGGTCTGAAGGACAAAGTGAATGGCAGCCATTATCTTCTATTCCTGGGTTGACATCGAAATTATTTGAAC
AAGAGTCCAATTCGTCAGCTGCAGTTCCCGCCAACGATGATGATGATGAGCTTGGGAAATATCAGAAGGGAGTTGGAGAAGTAGCCACAGCTGAAGTGTCTAATCCTTCT
GGTAGTAGGAATTTTGGTATGGTGGAAGGTGATCTGGACAGACCTACTACACCACCAGAGGGAGAGGAGGAATTCACCGATGATGATGGGACTACATACAAGTGGGACAG
GGCTCTCAGGGCCTGGGTACCTCAGGATGATGCATTTTTTAAACATGAGCAATATGGGCCTGAAGACATGACTTTCATGCAAGAAGAAGAAGTCTTTCCACAGCTTGAAG
CCGATGCTCCTTGCACGTCCATCAAGGGAGAAGGTGATTCTGTTCCTTCCACCTCCATCAAGGAAGAAGATGATCATGCCACAAAAGAAGCTAAGAGAAAAAGTGAGGAG
ATTGTAACCAAACAGAATGGGAAGAGAAAATTGTGTGACAATCAAGTTGAGAAAAAGGAAGCAAATAAAGGTTCTGATGGTTGGTTTGAACTAAAAATAAATACGCATGT
TTACGTAACAGGGCTGCCAGAGGATGTTACTATTGATGAAGTTGTGGAAGTTTTTTCAAAATGTGGAATAATCAAGGAGGATCCTGAAACCAAAAAACCCCGTGTTAAGT
TGTATGTTGATAGAGAGACAATGAAGAAGAAGGGTGATGCTTTAGTCTCCTATCTGAAGGAACCCTCTGTTTCTTTGCCTATGCAAATTTTGGATGGCACACCTCTTCGA
CCAGGTGGAAAAATTCTTATGTCTGTTAGTCAAGCTAAGTTTGAGCAAAAAGGCGGCCGAGACGATGCAAAAATTTCAATTCCTGCAACTGTCATTCTCCGTTTTATGTT
CACACCTGCTGAAATGAGGGCCGATGAAAATTTAGCTTCAGAAATAGAAACAGATGTCAAGGAGGAAAGCACAAGGTTTGGTCCTGTGGATTCAGTTAAGCTTAGTGTTA
ATGATTTGCACAGGTTTGGTGGAAGACAAATTCATGCAAGCGAGGAGGATGGTTTAGTGAACCATGGTTTGGTGAGGGATCTTGAAGCCGATGCTGCTCGTTTGGAGCAG
TTTGGCTCTGAACTTGAGGCTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAACAATTCTGTTGGGAACTCAGAAATGGTAACTAAAGCTGGATGGTATATTCTTGGTGAAGATCAGCAACATGTTGGTCCATATGCATTTTCTGAATTGCGTGA
GCATTTTCTAAATGGGTATCTCTTGGAAAGTACATTGGCCTGGTCTGAAGGACAAAGTGAATGGCAGCCATTATCTTCTATTCCTGGGTTGACATCGAAATTATTTGAAC
AAGAGTCCAATTCGTCAGCTGCAGTTCCCGCCAACGATGATGATGATGAGCTTGGGAAATATCAGAAGGGAGTTGGAGAAGTAGCCACAGCTGAAGTGTCTAATCCTTCT
GGTAGTAGGAATTTTGGTATGGTGGAAGGTGATCTGGACAGACCTACTACACCACCAGAGGGAGAGGAGGAATTCACCGATGATGATGGGACTACATACAAGTGGGACAG
GGCTCTCAGGGCCTGGGTACCTCAGGATGATGCATTTTTTAAACATGAGCAATATGGGCCTGAAGACATGACTTTCATGCAAGAAGAAGAAGTCTTTCCACAGCTTGAAG
CCGATGCTCCTTGCACGTCCATCAAGGGAGAAGGTGATTCTGTTCCTTCCACCTCCATCAAGGAAGAAGATGATCATGCCACAAAAGAAGCTAAGAGAAAAAGTGAGGAG
ATTGTAACCAAACAGAATGGGAAGAGAAAATTGTGTGACAATCAAGTTGAGAAAAAGGAAGCAAATAAAGGTTCTGATGGTTGGTTTGAACTAAAAATAAATACGCATGT
TTACGTAACAGGGCTGCCAGAGGATGTTACTATTGATGAAGTTGTGGAAGTTTTTTCAAAATGTGGAATAATCAAGGAGGATCCTGAAACCAAAAAACCCCGTGTTAAGT
TGTATGTTGATAGAGAGACAATGAAGAAGAAGGGTGATGCTTTAGTCTCCTATCTGAAGGAACCCTCTGTTTCTTTGCCTATGCAAATTTTGGATGGCACACCTCTTCGA
CCAGGTGGAAAAATTCTTATGTCTGTTAGTCAAGCTAAGTTTGAGCAAAAAGGCGGCCGAGACGATGCAAAAATTTCAATTCCTGCAACTGTCATTCTCCGTTTTATGTT
CACACCTGCTGAAATGAGGGCCGATGAAAATTTAGCTTCAGAAATAGAAACAGATGTCAAGGAGGAAAGCACAAGGTTTGGTCCTGTGGATTCAGTTAAGCTTAGTGTTA
ATGATTTGCACAGGTTTGGTGGAAGACAAATTCATGCAAGCGAGGAGGATGGTTTAGTGAACCATGGTTTGGTGAGGGATCTTGAAGCCGATGCTGCTCGTTTGGAGCAG
TTTGGCTCTGAACTTGAGGCTCACTAA
Protein sequenceShow/hide protein sequence
MDNNSVGNSEMVTKAGWYILGEDQQHVGPYAFSELREHFLNGYLLESTLAWSEGQSEWQPLSSIPGLTSKLFEQESNSSAAVPANDDDDELGKYQKGVGEVATAEVSNPS
GSRNFGMVEGDLDRPTTPPEGEEEFTDDDGTTYKWDRALRAWVPQDDAFFKHEQYGPEDMTFMQEEEVFPQLEADAPCTSIKGEGDSVPSTSIKEEDDHATKEAKRKSEE
IVTKQNGKRKLCDNQVEKKEANKGSDGWFELKINTHVYVTGLPEDVTIDEVVEVFSKCGIIKEDPETKKPRVKLYVDRETMKKKGDALVSYLKEPSVSLPMQILDGTPLR
PGGKILMSVSQAKFEQKGGRDDAKISIPATVILRFMFTPAEMRADENLASEIETDVKEESTRFGPVDSVKLSVNDLHRFGGRQIHASEEDGLVNHGLVRDLEADAARLEQ
FGSELEAH