; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009033 (gene) of Snake gourd v1 genome

Gene IDTan0009033
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionS4 RNA-binding domain-containing protein
Genome locationLG01:7893926..7898452
RNA-Seq ExpressionTan0009033
SyntenyTan0009033
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR002942 - RNA-binding S4 domain
IPR012677 - Nucleotide-binding alpha-beta plait domain superfamily
IPR017506 - Photosystem II S4
IPR036986 - RNA-binding S4 domain superfamily
IPR040591 - YlmH, putative RNA-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600381.1 hypothetical protein SDJN03_05614, partial [Cucurbita argyrosperma subsp. sororia]4.8e-16393.48Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR AQSSSFRTPLA+NL K+SFHEA FPSSPSSPLG+SG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF FH C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSLRKVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD IS GDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFA+ELI+YV
Subjt:  LKIGEINSTKKGKFAVELIRYV

XP_022941560.1 uncharacterized protein LOC111446880 isoform X2 [Cucurbita moschata]1.5e-16192.86Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR A+SSSFRTPLA+NL K+SFHEA FPSSPSSPLG+SG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF F  C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSLRKVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD IS GDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFA+ELI+YV
Subjt:  LKIGEINSTKKGKFAVELIRYV

XP_022981895.1 uncharacterized protein LOC111480897 isoform X1 [Cucurbita maxima]4.1e-16293.48Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR AQSSS RTPLA+NL+K+SFHEA FP+SPSSPLGSSG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAI+KLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF FH C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSL KVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD ISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFAVELIRYV
Subjt:  LKIGEINSTKKGKFAVELIRYV

XP_023554595.1 uncharacterized protein LOC111811761 isoform X1 [Cucurbita pepo subsp. pepo]6.9e-16293.17Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR   SSSFRTPLA+NL+K+SFHEA FPSSPSSPLG+SG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPDI+SALSITGNF FH C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSLRKVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD IS GDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFA+ELI+YV
Subjt:  LKIGEINSTKKGKFAVELIRYV

XP_038899605.1 putative RNA-binding protein YlmH [Benincasa hispida]8.2e-16395.27Show/hide
Query:  TSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTP
        TSIGA WSLRR AQSSSFR+PLA+NLNKLSFHEA FPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRR VLHTNFLTP
Subjt:  TSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTP

Query:  PVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELV
        PVVKES+LA+QKLADVKAIAQGGYPEAERCRISVGHADEL  DPDI+SALSITGNF FHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVV+VPELV
Subjt:  PVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELV

Query:  DFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGRLKIGE
        DFL SSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWT ITKNGTILKTGDIVSVSGKGRLKIGE
Subjt:  DFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGRLKIGE

Query:  INSTKKGKFAVELIRYV
        INSTKKGKFAVELIRYV
Subjt:  INSTKKGKFAVELIRYV

TrEMBL top hitse value%identityAlignment
A0A0A0LC56 S4 RNA-binding domain-containing protein1.2e-15993.17Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAAT TSI ALWSLRR AQ SSFR PLALNLN+L F+EA F   PSSPLG+SGICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPDIISALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDI+LQEE GAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDA+ASAGFKISRSKLVDLISSGDVRVNWT+ITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEINSTKKGKFAVELIRYV
Subjt:  LKIGEINSTKKGKFAVELIRYV

A0A1S3C3I2 putative RNA-binding protein YlmH isoform X22.3e-15892.24Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAATTT+IGALWSLR  +Q SSFR PL +NLN+L FHE  F   PSSPLG+SGICQLVQAVKGDIDVLLNGVGDKGVIVDVK ILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGH+DELT DPDIISALSITGNFTFHPCSHGDFLG+ILGTGIAREKLGDII+QEE GAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFL+SSLRKVGNVTVSCTRIPLTAL+YEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWT ITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEINSTKKGKFAVELIRYV
Subjt:  LKIGEINSTKKGKFAVELIRYV

A0A6J1FSF6 uncharacterized protein LOC111446880 isoform X16.3e-16192.88Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR A+SSSFRTPLA+NL K+SFHEA FPSSPSSPLG+SG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF F  C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISS-GDVRVNWTTITKNGTILKTGDIVSVSGKG
        VPELVDFLVSSLRKVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD ISS GDVRVNWTTITKNGTILKTGDIVSVSGKG
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISS-GDVRVNWTTITKNGTILKTGDIVSVSGKG

Query:  RLKIGEINSTKKGKFAVELIRYV
        RLKIGEIN TKKGKFA+ELI+YV
Subjt:  RLKIGEINSTKKGKFAVELIRYV

A0A6J1FU11 uncharacterized protein LOC111446880 isoform X27.5e-16292.86Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR A+SSSFRTPLA+NL K+SFHEA FPSSPSSPLG+SG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF F  C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSLRKVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD IS GDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFA+ELI+YV
Subjt:  LKIGEINSTKKGKFAVELIRYV

A0A6J1J103 uncharacterized protein LOC111480897 isoform X12.0e-16293.48Show/hide
Query:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
        MAA T SIGALWSLRR AQSSS RTPLA+NL+K+SFHEA FP+SPSSPLGSSG+CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT
Subjt:  MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHT

Query:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
        NFLTPPVVKESMLAI+KLADVKAIAQGGYPEAERCRISVGHADELT DPD++SALSITGNF FH C+HGDFLGAILGTGIAREKLGDIILQEEKGAQVVI
Subjt:  NFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVI

Query:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
        VPELVDFLVSSL KVGNVTVSCTRIPLT LDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVD ISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR
Subjt:  VPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGR

Query:  LKIGEINSTKKGKFAVELIRYV
        LKIGEIN TKKGKFAVELIRYV
Subjt:  LKIGEINSTKKGKFAVELIRYV

SwissProt top hitse value%identityAlignment
P71020 Putative RNA-binding protein YlmH5.2e-1930.67Show/hide
Query:  TNFLTPPVVKESML--AIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDI-ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGA
        T+FL P   +E ++  A+   ADV     GGY  AER R  +        + D  + A ++     F    H   LGA++G G+ R+K GDI+   E   
Subjt:  TNFLTPPVVKESML--AIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDI-ISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGA

Query:  QVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVS
        Q+++  +  DF+ + L + G   VS  +I L+ L+      +      +SLR+DA+ ++  + SR K   L+ +G V+VNW  +     I+  GD++S+ 
Subjt:  QVVIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVS

Query:  GKGRLKIGEI-NSTKKGKFAVELIR
        G GR  + +I   TKK K+ V   R
Subjt:  GKGRLKIGEI-NSTKKGKFAVELIR

Arabidopsis top hitse value%identityAlignment
AT1G53120.1 RNA-binding S4 domain-containing protein1.5e-11767.28Show/hide
Query:  TSIGALW-----SLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGI--CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVL
        TS+   W     + R VA SS   T     +  LS          S PL  S +  C   +A+KGD+D LL GVGD+ V  +VK IL MA+R+ S+REVL
Subjt:  TSIGALW-----SLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGI--CQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVL

Query:  HTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQV
        HT+FLTPP+VKES+  ++K ADVK +AQGGYPEAERCRIS+GH D LT DPDI++ALSITGNF F PCSHGDFLGAILGTGI+REKLGDI++QEEKGAQV
Subjt:  HTNFLTPPVVKESMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQV

Query:  VIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGK
        +IVPELVDF+V++L KVGNV V+C++IPL AL+YEPP+T +FKT+EASLR+DA+ASAGFKISRSKLVDLISS DVRVNW T+TKNGTI+KTGD+VSVSGK
Subjt:  VIVPELVDFLVSSLRKVGNVTVSCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGK

Query:  GRLKIGEINSTKKGKFAVELIRYV
        GRLKIGEIN TKKGKFAVE+IRY+
Subjt:  GRLKIGEINSTKKGKFAVELIRYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCAACGACAACGAGCATTGGAGCTCTGTGGAGTCTGAGAAGAGTAGCTCAATCTTCCAGTTTCCGAACTCCCCTTGCCCTTAACCTCAACAAGCTCTCCTTCCA
TGAGGCTCACTTTCCCTCTTCTCCATCTTCTCCTCTGGGCTCCTCAGGAATATGCCAATTAGTGCAAGCTGTAAAGGGAGACATTGATGTTTTACTCAATGGAGTTGGAG
ATAAAGGTGTTATTGTAGACGTGAAGCATATTCTTGTGATGGCCAAACGGTCATTATCTAGACGAGAAGTTCTCCATACGAACTTTCTCACCCCACCTGTGGTGAAAGAG
TCAATGCTAGCTATACAAAAACTAGCTGACGTGAAAGCAATAGCTCAGGGAGGATACCCAGAGGCAGAGCGCTGCCGAATTTCTGTTGGACACGCAGATGAACTAACAAG
GGATCCAGACATAATTTCAGCATTGAGTATCACAGGAAATTTTACGTTTCACCCTTGCTCCCATGGGGACTTCCTTGGAGCAATTCTTGGTACAGGCATCGCTAGGGAAA
AGCTTGGTGATATCATACTCCAGGAAGAAAAGGGAGCTCAGGTAGTCATTGTTCCAGAACTTGTTGACTTCCTTGTATCATCACTGCGCAAGGTTGGCAATGTCACAGTT
TCTTGTACGAGGATACCGTTGACAGCTCTTGATTATGAACCACCAAAGACTAAGACATTTAAAACCATTGAGGCATCTCTTAGGGTGGATGCTCTAGCAAGTGCTGGGTT
CAAGATTTCACGGTCTAAACTAGTGGATTTAATCAGTAGCGGTGATGTTCGTGTCAATTGGACGACAATTACCAAAAATGGAACCATACTAAAGACTGGTGATATTGTTT
CTGTCAGTGGGAAAGGGAGACTAAAGATTGGAGAAATAAATTCTACAAAAAAGGGAAAATTTGCTGTCGAGCTTATCAGGTACGTGTAA
mRNA sequenceShow/hide mRNA sequence
CACCGACTGATGAAGTTCAGAACGCATGTGCACTGACTAACCATGGCCGCAACGACAACGAGCATTGGAGCTCTGTGGAGTCTGAGAAGAGTAGCTCAATCTTCCAGTTT
CCGAACTCCCCTTGCCCTTAACCTCAACAAGCTCTCCTTCCATGAGGCTCACTTTCCCTCTTCTCCATCTTCTCCTCTGGGCTCCTCAGGAATATGCCAATTAGTGCAAG
CTGTAAAGGGAGACATTGATGTTTTACTCAATGGAGTTGGAGATAAAGGTGTTATTGTAGACGTGAAGCATATTCTTGTGATGGCCAAACGGTCATTATCTAGACGAGAA
GTTCTCCATACGAACTTTCTCACCCCACCTGTGGTGAAAGAGTCAATGCTAGCTATACAAAAACTAGCTGACGTGAAAGCAATAGCTCAGGGAGGATACCCAGAGGCAGA
GCGCTGCCGAATTTCTGTTGGACACGCAGATGAACTAACAAGGGATCCAGACATAATTTCAGCATTGAGTATCACAGGAAATTTTACGTTTCACCCTTGCTCCCATGGGG
ACTTCCTTGGAGCAATTCTTGGTACAGGCATCGCTAGGGAAAAGCTTGGTGATATCATACTCCAGGAAGAAAAGGGAGCTCAGGTAGTCATTGTTCCAGAACTTGTTGAC
TTCCTTGTATCATCACTGCGCAAGGTTGGCAATGTCACAGTTTCTTGTACGAGGATACCGTTGACAGCTCTTGATTATGAACCACCAAAGACTAAGACATTTAAAACCAT
TGAGGCATCTCTTAGGGTGGATGCTCTAGCAAGTGCTGGGTTCAAGATTTCACGGTCTAAACTAGTGGATTTAATCAGTAGCGGTGATGTTCGTGTCAATTGGACGACAA
TTACCAAAAATGGAACCATACTAAAGACTGGTGATATTGTTTCTGTCAGTGGGAAAGGGAGACTAAAGATTGGAGAAATAAATTCTACAAAAAAGGGAAAATTTGCTGTC
GAGCTTATCAGGTACGTGTAAGCTCTGGCGATATTGGATCGTTTGGTGTTGAACTGGGTAAGTTTGTTATGGGATCTCCTTGGAGGTGAACATCAGGATTGCTTCCTTAG
GAATAACGAAAGTAAGTCATTGTCCAAGTTCAACATGACATTAATCTAATTGAACATCTATTCATCCATTCTTATAAAATTTTCATTTGGAAAATCTGTCTCTCAGCATT
TCAATGACAGGGATTATTAGTTCTTTTCTTTTTGGGACCTGTAATTTTTTTTTCTGTGATGAATTCTCGTTCAAACTCTAATGTTGTCAAAGGAGGGAATGTATATACGA
AGTTGTAGCTTCACATTAACTATTTGTATATTCAACGATCTGTCTTATTTCCATCCCGTAATTTGTAATAATTGATGTGTTGTATTAGTTAGTCTGTTGTATTAGTTGGT
ATGTTAGTTGATGGACCTGAGTATTTAAGTCAGTTCCCTGCTCAATAAGATTGGCTCTATTTTTATCTTTTTCATATTTCATGTGCTCACA
Protein sequenceShow/hide protein sequence
MAATTTSIGALWSLRRVAQSSSFRTPLALNLNKLSFHEAHFPSSPSSPLGSSGICQLVQAVKGDIDVLLNGVGDKGVIVDVKHILVMAKRSLSRREVLHTNFLTPPVVKE
SMLAIQKLADVKAIAQGGYPEAERCRISVGHADELTRDPDIISALSITGNFTFHPCSHGDFLGAILGTGIAREKLGDIILQEEKGAQVVIVPELVDFLVSSLRKVGNVTV
SCTRIPLTALDYEPPKTKTFKTIEASLRVDALASAGFKISRSKLVDLISSGDVRVNWTTITKNGTILKTGDIVSVSGKGRLKIGEINSTKKGKFAVELIRYV