; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020890 (gene) of Snake gourd v1 genome

Gene IDTan0020890
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA binding
Genome locationLG01:112519679..112521745
RNA-Seq ExpressionTan0020890
SyntenyTan0020890
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591445.1 hypothetical protein SDJN03_13791, partial [Cucurbita argyrosperma subsp. sororia]1.5e-10794.5Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        MVEFS+SDP +W EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QLDPKNGGKTG+KRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

XP_022936826.1 uncharacterized protein LOC111443296 [Cucurbita moschata]1.2e-10493.09Show/hide
Query:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP
        +EFS+SDP +W EALSSYPS+IEALGKPNLVSLDEFYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCLP
Subjt:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP

Query:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ
        DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQSQ
Subjt:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ

Query:  LDPKNGGKTGAKRKRKT
        L  KNGGKTG+KRKRKT
Subjt:  LDPKNGGKTGAKRKRKT

XP_022977105.1 uncharacterized protein LOC111477273 [Cucurbita maxima]3.7e-10693.12Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        MVEF++SDP +W EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRL DFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL+FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QLDPKNGGK GAKRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

XP_023536029.1 uncharacterized protein LOC111797290 [Cucurbita pepo subsp. pepo]7.4e-10794.04Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        MVEFS+S+P +W EAL SYPSQIEALGKPNLVSLD+FYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QLDPKNGGKTGAKRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

XP_038898858.1 uncharacterized protein LOC120086335 isoform X1 [Benincasa hispida]2.2e-10390.83Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        M+EFSNSDP++W EALSSY SQIEALGKPNL SLD+FYRNELP LLH RNP+ YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDI KAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEG+IFTPSDVERALWS AIGEKLKG +S
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        +LDP NGGKTG KRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

TrEMBL top hitse value%identityAlignment
A0A0A0L3W6 Uncharacterized protein7.0e-10390.37Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        M+EFS SDP++W EALS+Y SQIEALGKPNLVSLD+FYRNELPL+LH RNPS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FANKL+ KAKELS EG+IFTPSDVERALWS AIGEKLKGS+S
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QLDP NGGKTG KRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

A0A1S3BUF7 uncharacterized protein LOC103493629 isoform X19.1e-10390.32Show/hide
Query:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP
        ++FSNSDP++W EALSSY SQIEALGKPNLVSLD+FYRNELPL+LH R PS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCLP
Subjt:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP

Query:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ
        DISKAVSELTPLKGVGPATASA+LAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL FANKLQ+KAKELS EG+ FTPSDVERALWS AIGEKLKGSQSQ
Subjt:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ

Query:  LDPKNGGKTGAKRKRKT
        LDP NGGKTG KRKRKT
Subjt:  LDPKNGGKTGAKRKRKT

A0A6J1CCK9 uncharacterized protein LOC1110102566.5e-10190.37Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        MVEF NSDP +W EALSSYPSQIEALGKPNLVSLD FYRNELP LLH RNP+ YITT ELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFAN LQ+KAKELSS+GQIFTPSDVERALWS+A GEKLK SQS
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QL+PKNG  +GAKRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

A0A6J1F8K0 uncharacterized protein LOC1114432965.7e-10593.09Show/hide
Query:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP
        +EFS+SDP +W EALSSYPS+IEALGKPNLVSLDEFYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVK ASQKAFQCLP
Subjt:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP

Query:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ
        DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQSQ
Subjt:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ

Query:  LDPKNGGKTGAKRKRKT
        L  KNGGKTG+KRKRKT
Subjt:  LDPKNGGKTGAKRKRKT

A0A6J1ILC1 uncharacterized protein LOC1114772731.8e-10693.12Show/hide
Query:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL
        MVEF++SDP +W EALSSYPSQIEALGKPNLVSLD+FYRNELP LLH RNPS YITTSELSKLMQWKLTRGKWRPRL DFVSSLDESLVK ASQKAFQCL
Subjt:  MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCL

Query:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS
        PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYL+FA KLQDKAKELSSEGQIFTPSDVERALWSTAIGEK+KGSQS
Subjt:  PDISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQS

Query:  QLDPKNGGKTGAKRKRKT
        QLDPKNGGK GAKRKRKT
Subjt:  QLDPKNGGKTGAKRKRKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G12210.1 DNA binding9.1e-5576.81Show/hide
Query:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP
        +EF  SD  VW EALSSY S+IE+L KP LVSLD+FYR +LP LLH+R+P+ Y+TTSELS+LM+WKL+RGKWRPRLLDFVSSLD+S+VKSAS+KAF+ LP
Subjt:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP

Query:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDE
        DISKAV ELT LKGVG ATASAVLAAYAPD+APFMSDE
Subjt:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDE

AT3G12210.2 DNA binding1.1e-7670.23Show/hide
Query:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP
        +EF  SD  VW EALSSY S+IE+L KP LVSLD+FYR +LP LLH+R+P+ Y+TTSELS+LM+WKL+RGKWRPRLLDFVSSLD+S+VKSAS+KAF+ LP
Subjt:  VEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLP

Query:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ
        DISKAV ELT LKGVG ATASAVLAAYAPD+APFMSDEAME ALGNSKDYSLKQYL FA KLQDKAKEL  +G+   PSD+ERALWS  +  K +  +S 
Subjt:  DISKAVSELTPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQ

Query:  LDPKNGGKTGAKRKR
                +G KRKR
Subjt:  LDPKNGGKTGAKRKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAGTTCTCAAACTCCGATCCAAGTGTTTGGTTAGAAGCTCTCTCTTCTTATCCATCTCAAATCGAAGCCTTAGGCAAGCCCAATTTAGTTTCTCTCGACGAATT
TTACCGGAACGAACTCCCTCTTCTTCTCCACAACCGAAACCCTAGCTCTTACATTACTACTTCCGAGCTCTCAAAACTCATGCAGTGGAAGCTCACTCGCGGAAAATGGA
GGCCGCGTCTCCTGGACTTCGTTTCATCATTGGACGAATCGCTCGTCAAATCGGCCTCTCAGAAGGCCTTTCAATGCCTTCCTGATATTTCCAAAGCTGTGTCTGAGCTT
ACGCCGCTCAAAGGCGTTGGTCCGGCCACTGCCTCGGCGGTTCTGGCTGCTTACGCGCCGGATGTTGCGCCTTTTATGTCCGACGAGGCTATGGAGGCGGCTCTTGGAAA
CTCCAAGGATTATTCGTTGAAGCAGTACTTATCGTTCGCAAATAAGTTGCAAGATAAAGCCAAGGAATTAAGCTCAGAAGGACAAATTTTCACACCATCTGACGTAGAGA
GGGCTTTGTGGAGTACAGCTATAGGGGAAAAATTAAAAGGTTCGCAATCACAATTAGATCCCAAGAATGGAGGCAAAACTGGCGCCAAAAGAAAGAGAAAAACTTGA
mRNA sequenceShow/hide mRNA sequence
GCCAAGAGCAAGAGTCGCAGCCGCGAAGAAACACCCATTTTCAAATATCAAAACGCCTGTGGAGTTTAGCTGAGGAACATCGCATAAGCAGCAATGGTGGAGTTCTCAAA
CTCCGATCCAAGTGTTTGGTTAGAAGCTCTCTCTTCTTATCCATCTCAAATCGAAGCCTTAGGCAAGCCCAATTTAGTTTCTCTCGACGAATTTTACCGGAACGAACTCC
CTCTTCTTCTCCACAACCGAAACCCTAGCTCTTACATTACTACTTCCGAGCTCTCAAAACTCATGCAGTGGAAGCTCACTCGCGGAAAATGGAGGCCGCGTCTCCTGGAC
TTCGTTTCATCATTGGACGAATCGCTCGTCAAATCGGCCTCTCAGAAGGCCTTTCAATGCCTTCCTGATATTTCCAAAGCTGTGTCTGAGCTTACGCCGCTCAAAGGCGT
TGGTCCGGCCACTGCCTCGGCGGTTCTGGCTGCTTACGCGCCGGATGTTGCGCCTTTTATGTCCGACGAGGCTATGGAGGCGGCTCTTGGAAACTCCAAGGATTATTCGT
TGAAGCAGTACTTATCGTTCGCAAATAAGTTGCAAGATAAAGCCAAGGAATTAAGCTCAGAAGGACAAATTTTCACACCATCTGACGTAGAGAGGGCTTTGTGGAGTACA
GCTATAGGGGAAAAATTAAAAGGTTCGCAATCACAATTAGATCCCAAGAATGGAGGCAAAACTGGCGCCAAAAGAAAGAGAAAAACTTGATGTCCGCTGCTTCAAGTGAC
AGCCCTACTACTGCTTTTGAGTTTTGTACTTCTAGTTTTCGACACGCAGTTTGTTCAGGACACGCGGTTAGGTTTATCCATCTCTATTCCTAAC
Protein sequenceShow/hide protein sequence
MVEFSNSDPSVWLEALSSYPSQIEALGKPNLVSLDEFYRNELPLLLHNRNPSSYITTSELSKLMQWKLTRGKWRPRLLDFVSSLDESLVKSASQKAFQCLPDISKAVSEL
TPLKGVGPATASAVLAAYAPDVAPFMSDEAMEAALGNSKDYSLKQYLSFANKLQDKAKELSSEGQIFTPSDVERALWSTAIGEKLKGSQSQLDPKNGGKTGAKRKRKT