; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021987 (gene) of Snake gourd v1 genome

Gene IDTan0021987
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationLG05:1521599..1522611
RNA-Seq ExpressionTan0021987
SyntenyTan0021987
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR007592 - GLABROUS1 enhancer-binding protein family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037301.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa]3.4e-7270.09Show/hide
Query:  MSTTRRSSPTNASSSDAG-PVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFA
        MST  RS P+NAS       +TLT  NN      FT+EDE++LL  YLQI+RS    KNS PTLDSPA DR++TA+GPKFSH  IADKLHRLKL YHKFA
Subjt:  MSTTRRSSPTNASSSDAG-PVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFA

Query:  RTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREERV----DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNE
        RTKSFIKTPH R+IL++GRSIWGKSPTP T KPQVI  SR++SRRIK++S  R+E V    DLKNFPVLV+EFS+QFPGNGVWREGL+RMEE++LK MNE
Subjt:  RTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREERV----DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNE

Query:  KWVLLHIEEAELKARRAALLQRQL
        KWVLLHIE AELKARRAALL++QL
Subjt:  KWVLLHIEEAELKARRAALLQRQL

KAF8039515.1 hypothetical protein BT93_B1894 [Corymbia citriodora subsp. variegata]3.4e-3242.78Show/hide
Query:  FTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVISR
        F+E DE+ LL    ++     S P  D+P LDRIE +LGP F+   IA+K+ RL+ +YH+ ARTK+ IKTPH R++ E+ R +WGK P+ R  + +    
Subjt:  FTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVISR

Query:  VISRRIKEKSVAREERV-----------------DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEAELKARRAALLQ
             ++E+    E  +                 DL NFP LV E  R FPGNGVWREGLKR+EE  L+ MNE+ V L +EEA L A++A L Q
Subjt:  VISRRIKEKSVAREERV-----------------DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEAELKARRAALLQ

KAG6570827.1 hypothetical protein SDJN03_29742, partial [Cucurbita argyrosperma subsp. sororia]9.5e-7573.02Show/hide
Query:  MSTTRRSSPTNASSSDAGPVTLTISNNFTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIK
        MST  R  P+NAS     P TLT   +F+EEDEL+LLNCYLQIARS     NS PTLDSPALDRIETAL PKF++ HIADKLHRLKLQYHK ARTKS IK
Subjt:  MSTTRRSSPTNASSSDAGPVTLTISNNFTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIK

Query:  TPHHRRILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREERVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEAELK
        TPHHRRILEIGRSIWGK PT R TKPQ I R I RRIK++S + ++ VDL +FPVL+SEFSR+FPGNGVW+EGL+ MEE+SL+ MNEKWVLLHIEEAELK
Subjt:  TPHHRRILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREERVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEAELK

Query:  ARRAALLQRQLGITD
        ARRAALLQ+QL + D
Subjt:  ARRAALLQRQLGITD

KAG6606028.1 hypothetical protein SDJN03_03345, partial [Cucurbita argyrosperma subsp. sororia]4.4e-8076.36Show/hide
Query:  MSTTRRSSPT--NASSSDAGPVTLTISN------NFTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTK
        MST RRS P+  N +SSDA P  LT S       +FT+EDELDLLNCYL++A SKNS PTLDSPALDRI TALG KF H HIADKLHRLK+QYHKFARTK
Subjt:  MSTTRRSSPT--NASSSDAGPVTLTISN------NFTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTK

Query:  SFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREERVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEE
        SFIKTPHH RILEIGRSIWGK   PR TKPQ    VISRRI+ +SVA+++ VDLKNFPVLVSEFSRQFPGNGVWREGLK MEE SLKGMNEKWVLLHIEE
Subjt:  SFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREERVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEE

Query:  AELKARRAALLQRQLGITDT
        AELKARR AL+QRQ+G T T
Subjt:  AELKARRAALLQRQLGITDT

KGN46890.1 hypothetical protein Csa_020607 [Cucumis sativus]1.4e-7369Show/hide
Query:  MSTTRRSSPTNASSSDAGPVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFAR
        MST  RS P++AS     P TLT SNN      F +EDE++LLN YLQI+RS    KNS PTLDS A DR++TA+GPKFSH  IADKLHRLKL YHKFAR
Subjt:  MSTTRRSSPTNASSSDAGPVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFAR

Query:  TKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREE----RVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEK
        TKSFIKTPH R+IL++GRSIWGKSPTP T KPQVI  SR++SRRIK++S+ R+E     VDLKNFPVLV+EFSRQFPGNGVWR+GL+RM E++LK MNEK
Subjt:  TKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREE----RVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEK

Query:  WVLLHIEEAELKARRAALLQRQLGITDTS
        WVLLHIE AELKARRAALL+ QL  T+T+
Subjt:  WVLLHIEEAELKARRAALLQRQLGITDTS

TrEMBL top hitse value%identityAlignment
A0A059DA76 Uncharacterized protein2.8e-3241.23Show/hide
Query:  SSPTNASSSDAGPVTLTISNN-----FTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKTPHHR
        SSP +A SSD  P      N      F+EEDE+ LL      + SK ++PT D P LDRIE +LGP  +   IA+K+ RL+ +YH+ ARTK+ I+TPH R
Subjt:  SSPTNASSSDAGPVTLTISNN-----FTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKTPHHR

Query:  RILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREER-------------------------VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEER
        ++ E+ R +WGK P+ R  + +   R      K+++   EER                          DL +FP LV E  R FPGNGVWREGLKR+EE 
Subjt:  RILEIGRSIWGKSPTPRTTKPQVISRVISRRIKEKSVAREER-------------------------VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEER

Query:  SLKGMNEKWVLLHIEEAELKARRAALLQ
         L+ +NE+ V L +EEA L A++A L Q
Subjt:  SLKGMNEKWVLLHIEEAELKARRAALLQ

A0A0A0KAX7 Uncharacterized protein6.6e-7469Show/hide
Query:  MSTTRRSSPTNASSSDAGPVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFAR
        MST  RS P++AS     P TLT SNN      F +EDE++LLN YLQI+RS    KNS PTLDS A DR++TA+GPKFSH  IADKLHRLKL YHKFAR
Subjt:  MSTTRRSSPTNASSSDAGPVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFAR

Query:  TKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREE----RVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEK
        TKSFIKTPH R+IL++GRSIWGKSPTP T KPQVI  SR++SRRIK++S+ R+E     VDLKNFPVLV+EFSRQFPGNGVWR+GL+RM E++LK MNEK
Subjt:  TKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREE----RVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEK

Query:  WVLLHIEEAELKARRAALLQRQLGITDTS
        WVLLHIE AELKARRAALL+ QL  T+T+
Subjt:  WVLLHIEEAELKARRAALLQRQLGITDTS

A0A2H5MZL8 Uncharacterized protein1.4e-2837.1Show/hide
Query:  SSPTNASSSDAGPVTLTISNN---FTEEDELDLLNCYLQIAR---------SKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSF
        S+P         PV+   S N   FT+ DE+ LL  + ++A          + NS  TLDS + +RI ++LG KF+H  I DK+ RL+++YHK AR+KS 
Subjt:  SSPTNASSSDAGPVTLTISNN---FTEEDELDLLNCYLQIAR---------SKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSF

Query:  IKTPHHRRILEIGRSIWGKSPTPRTTKPQVISRVIS-----RRIKEKSVAREER----VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKW
        +KTPH + + +I + IWGK  T R  + + +    S        ++K+   EER     +L  +PVL  E S+  P N VWRE LK +    LK MN+KW
Subjt:  IKTPHHRRILEIGRSIWGKSPTPRTTKPQVISRVIS-----RRIKEKSVAREER----VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKW

Query:  VLLHIEEAELKARRAALLQRQ
        +L+ IEEA++ +++A L++ Q
Subjt:  VLLHIEEAELKARRAALLQRQ

A0A5A7T6T1 Serine/arginine repetitive matrix protein 1-like1.6e-7270.09Show/hide
Query:  MSTTRRSSPTNASSSDAG-PVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFA
        MST  RS P+NAS       +TLT  NN      FT+EDE++LL  YLQI+RS    KNS PTLDSPA DR++TA+GPKFSH  IADKLHRLKL YHKFA
Subjt:  MSTTRRSSPTNASSSDAG-PVTLTISNN------FTEEDELDLLNCYLQIARS----KNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFA

Query:  RTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREERV----DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNE
        RTKSFIKTPH R+IL++GRSIWGKSPTP T KPQVI  SR++SRRIK++S  R+E V    DLKNFPVLV+EFS+QFPGNGVWREGL+RMEE++LK MNE
Subjt:  RTKSFIKTPHHRRILEIGRSIWGKSPTPRTTKPQVI--SRVISRRIKEKSVAREERV----DLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNE

Query:  KWVLLHIEEAELKARRAALLQRQL
        KWVLLHIE AELKARRAALL++QL
Subjt:  KWVLLHIEEAELKARRAALLQRQL

A0A6P3Z773 probable transcription factor At1g617301.4e-3144.02Show/hide
Query:  STTRRSSPTNASSSD----AGPVTLTISNNFTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKT
        +TT    P+++S S     +G   +     FTEEDE  LL  +  I +S      ++SP LDRIE +LG +FSH  I DKL RL+L+YHK ARTKS IKT
Subjt:  STTRRSSPTNASSSD----AGPVTLTISNNFTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKT

Query:  PHHRRILEIGRSIWG-KSPTPRTTKPQVISRVISRRIKEKSVAREER---VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEA
         H R + +I R IWG K    R  K +          +EK    E+R   V L+ FPVL  E      GN VW++GL  +EE++LKG+NEKW+LL +EEA
Subjt:  PHHRRILEIGRSIWG-KSPTPRTTKPQVISRVISRRIKEKSVAREER---VDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEA

Query:  ELKARRAAL
        E+ A RA L
Subjt:  ELKARRAAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACGACCCGACGCTCGTCGCCGACCAACGCGTCCTCTTCCGACGCCGGTCCCGTAACCCTAACCATATCCAACAATTTCACCGAGGAAGACGAACTCGATCTCCT
CAATTGCTACCTCCAAATCGCCAGATCCAAGAATTCTCACCCCACTCTCGACTCTCCGGCTTTGGACCGCATCGAGACCGCTCTCGGCCCCAAATTCAGCCACTTCCACA
TCGCCGATAAGCTCCACAGGCTGAAGCTCCAGTACCACAAATTCGCGAGAACCAAGTCCTTCATCAAAACCCCTCACCACCGTCGGATTCTCGAGATCGGCCGCAGCATC
TGGGGAAAATCCCCCACACCCAGAACAACTAAACCACAGGTAATTTCACGGGTAATTTCACGCAGAATCAAAGAAAAGTCAGTTGCAAGAGAAGAAAGGGTTGATCTGAA
GAATTTTCCTGTTCTTGTGAGTGAATTTTCTCGGCAGTTTCCAGGAAATGGTGTGTGGAGAGAGGGGCTGAAGAGAATGGAGGAGAGAAGTTTGAAAGGTATGAATGAGA
AGTGGGTTTTGTTGCATATTGAAGAGGCTGAGCTTAAGGCAAGAAGAGCTGCGTTATTACAGCGGCAACTTGGAATTACAGACACTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATTTGTGGAGCCTGGGGCGGGAAATTTTCTTTTCCTTAAAATGAGTCCCCGTCCGATTGGAGAAATCTGAGACCCTTCAGCAAATTTCACACTCAAAACCTTTCTTTGCC
CGTCATTGACTTCCTCACAGAAACCGCCCAACCAAAACAAAATCAAATCCCAACAATTCCGATCCCCATGTCAACGACCCGACGCTCGTCGCCGACCAACGCGTCCTCTT
CCGACGCCGGTCCCGTAACCCTAACCATATCCAACAATTTCACCGAGGAAGACGAACTCGATCTCCTCAATTGCTACCTCCAAATCGCCAGATCCAAGAATTCTCACCCC
ACTCTCGACTCTCCGGCTTTGGACCGCATCGAGACCGCTCTCGGCCCCAAATTCAGCCACTTCCACATCGCCGATAAGCTCCACAGGCTGAAGCTCCAGTACCACAAATT
CGCGAGAACCAAGTCCTTCATCAAAACCCCTCACCACCGTCGGATTCTCGAGATCGGCCGCAGCATCTGGGGAAAATCCCCCACACCCAGAACAACTAAACCACAGGTAA
TTTCACGGGTAATTTCACGCAGAATCAAAGAAAAGTCAGTTGCAAGAGAAGAAAGGGTTGATCTGAAGAATTTTCCTGTTCTTGTGAGTGAATTTTCTCGGCAGTTTCCA
GGAAATGGTGTGTGGAGAGAGGGGCTGAAGAGAATGGAGGAGAGAAGTTTGAAAGGTATGAATGAGAAGTGGGTTTTGTTGCATATTGAAGAGGCTGAGCTTAAGGCAAG
AAGAGCTGCGTTATTACAGCGGCAACTTGGAATTACAGACACTTCTTAAGATGTCAATTACATTACTAATTGAAGATTTTATTTCTTTTCCGATTTGGGAAAATTGTTCT
GTACCATAGTTTGAGAAATGAGAGCTTTTTTGAATGTCAGTGTTGTTCCTTGTATTTCAATTATGGAGCTAATGGTAGAAGTTACAAGTAGGTGTGTTTGGCTGCAGTTG
TATTATATTGTTGTTCAAATCAA
Protein sequenceShow/hide protein sequence
MSTTRRSSPTNASSSDAGPVTLTISNNFTEEDELDLLNCYLQIARSKNSHPTLDSPALDRIETALGPKFSHFHIADKLHRLKLQYHKFARTKSFIKTPHHRRILEIGRSI
WGKSPTPRTTKPQVISRVISRRIKEKSVAREERVDLKNFPVLVSEFSRQFPGNGVWREGLKRMEERSLKGMNEKWVLLHIEEAELKARRAALLQRQLGITDTS