; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016256 (gene) of Snake gourd v1 genome

Gene IDTan0016256
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor-like protein
Genome locationLG06:5867922..5868729
RNA-Seq ExpressionTan0016256
SyntenyTan0016256
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575793.1 hypothetical protein SDJN03_26432, partial [Cucurbita argyrosperma subsp. sororia]7.3e-9780.08Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NW FSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AK+NYERDD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVLD VSC     G  P  ++NR  DRKEQE+D+REYYKHHKASPLAEIEF DTRKP    TDGTAYDGGGKDVIGWLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSL+RATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

KAG7014335.1 hypothetical protein SDJN02_24512 [Cucurbita argyrosperma subsp. argyrosperma]1.2e-9680.51Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NW FSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AK+NYERDD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  P  ++NRE DRKEQEED REYYKHHKASPLAEIEF DTRKP    TDGTAYDGGGKDVIGWLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSL+RATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

XP_022953997.1 uncharacterized protein LOC111456388 [Cucurbita moschata]4.7e-9679.24Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NW FSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AK+NYERDD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  P  ++NRE DRKEQ++D+REYYKHHKASPLAEIEF DTRKP    TDGTAYDGGGKD+IGWLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSL+RATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

XP_022991237.1 uncharacterized protein LOC111487953 [Cucurbita maxima]4.3e-9781.36Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NWAFSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD AKANY  DD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  PE ++NRE DRKEQEED+REYYKHHKASPLAEIEFADTRKP    TDGTAYDGGGKDVI WLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSLRRATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

XP_023548846.1 uncharacterized protein LOC111807374 [Cucurbita pepo subsp. pepo]2.5e-9780.93Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NW+FSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AKANYERDD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  P  ++NRE DRKEQEED+REYYKHHKASPLAEIEF DTRKP    TDGTAYDGGGKDVIGWLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSLRRA EIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

TrEMBL top hitse value%identityAlignment
A0A0A0K9G7 Uncharacterized protein9.6e-8774.38Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRRG---LTTCRTADPSVHAN---DDNQPAVLSGEPEKSQDNLEPDNAKANYE-RDDVKEADLNGPFGPPKAQYASS
        MQS L AIAPKSNWAF + Q + LRRG   LTT RTADPS+HAN   DDN PAVLSGEPE+SQDNLEPDNAKANY+ RDD K+ D  GPFG P AQ+ASS
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRRG---LTTCRTADPSVHAN---DDNQPAVLSGEPEKSQDNLEPDNAKANYE-RDDVKEADLNGPFGPPKAQYASS

Query:  PRLETTPVGQASKPITQQKRANVTVLDGVSCFGA----LPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGW
        PRLETT VGQASKPITQQKRA+   +D VSC G     L +GKENR T+ KE+EED+R+YYKHHKASPLAEIEFADTRKP    TDGTAYDG    VIGW
Subjt:  PRLETTPVGQASKPITQQKRANVTVLDGVSCFGA----LPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGW

Query:  LPEQLDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        LPEQ+DT +DSLRRATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  LPEQLDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

A0A1S3BR22 uncharacterized protein LOC1034927781.4e-8572.58Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDD--NQPAVLSGEPEKSQDNLEPDNAKANYE-RDDVKEADLNGPFGPPKAQYASSPRL
        MQSRL AIAP+SNWA  + Q + LRR GLTT RTADPSVHANDD  N P+VLSGEPE+SQDNLEPDNAKANYE RDD K+ D NGPFGP KAQ+ASSPRL
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDD--NQPAVLSGEPEKSQDNLEPDNAKANYE-RDDVKEADLNGPFGPPKAQYASSPRL

Query:  ETTPVGQASKPITQQKRANVTVLDGVSCFGA----LPEGKENRETDRKEQE---------EDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGG
        ETT VGQASKPITQQKRA+   +D VSC G     L EGKE+R T+ K++E         ED+R+YYKHHKASPLAEIEF DTRKP    TDGTA  G G
Subjt:  ETTPVGQASKPITQQKRANVTVLDGVSCFGA----LPEGKENRETDRKEQE---------EDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGG

Query:  KDVIGWLPEQLDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        K VIGWLPEQ+DT +DSLRRATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  KDVIGWLPEQLDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

A0A6J1DCP1 uncharacterized protein LOC1110191285.4e-9076.05Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRRGLT---TCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLE
        MQSRLTAIAP S WAFSLAQL RLRRGL    T RTADPSVHA DDN PAV SGEPEKSQ+  EPDNAKANY+R+D  +   NGPFGP KAQY SSPRLE
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRRGLT---TCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLE

Query:  TTPVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQ
        +T VGQ SKPITQQKR + TV+D VSC     G  PE K+ R T R+E+EED+REYYKHHKASPLAEIEFADTRKP    TDGTAYDGGGKDVIGWLPEQ
Subjt:  TTPVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQ

Query:  LDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        +DTAEDSLRR TEIWK+NA+ GDPDAPQSRVLRALRG+
Subjt:  LDTAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

A0A6J1GPT4 uncharacterized protein LOC1114563882.3e-9679.24Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NW FSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD+AK+NYERDD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  P  ++NRE DRKEQ++D+REYYKHHKASPLAEIEF DTRKP    TDGTAYDGGGKD+IGWLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSL+RATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

A0A6J1JL82 uncharacterized protein LOC1114879532.1e-9781.36Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT
        M SRLTAIA K NWAFSLAQ +RLRR GLTTCRTADPSVHANDDN PAV SGEPE+SQDNLEPD AKANY  DD K+ D NGPF PPKAQYASSPRLETT
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRR-GLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETT

Query:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD
        PV QASKPITQQKRA+ TVL  VSC     G  PE ++NRE DRKEQEED+REYYKHHKASPLAEIEFADTRKP    TDGTAYDGGGKDVI WLPEQ D
Subjt:  PVGQASKPITQQKRANVTVLDGVSCF----GALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIGWLPEQLD

Query:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD
        T +DSLRRATEIWKQNAM GDPDAPQSRVLRALRG+
Subjt:  TAEDSLRRATEIWKQNAMCGDPDAPQSRVLRALRGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02700.1 unknown protein9.6e-4748.77Show/hide
Query:  MQSRLTAIAPKSNWAFSLAQLRRLRRGLTTC-RTADPSVHA-NDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLET
        MQSRL A A  +         RRL  G +T  RTADP +HA ND   PA+   +PE   D   P  A      D  + +    P  PPK+  A++ +LE+
Subjt:  MQSRLTAIAPKSNWAFSLAQLRRLRRGLTTC-RTADPSVHA-NDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLET

Query:  TPVGQASKPITQQKRANVTV----LDGVSCFG------ALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIG
        TPVG  S+P  QQKR N T     LD VSC G         EG+   +  R+++ E D+E+YKHHKASPL+EIEFADTRKP    TDGTAY   GKDVIG
Subjt:  TPVGQASKPITQQKRANVTV----LDGVSCFG------ALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKP----TDGTAYDGGGKDVIG

Query:  WLPEQLDTAEDSLRRATEIWKQNAMCGDPDA-PQSRVLRALRGD
        WLPEQLDTAE+SL +AT I+K+NA  GDP+  P SR+LR +RG+
Subjt:  WLPEQLDTAEDSLRRATEIWKQNAMCGDPDA-PQSRVLRALRGD

AT4G02140.1 unknown protein3.1e-0535.42Show/hide
Query:  QLRRLRRGLTTCRTADPSVHA-NDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEAD-LNGPFGPPKAQYASSPRLETTPVGQASKPITQQKR
        +L   R   +T RTADP +HA ND ++P++   +PE   D   P    A+ +  D++    +  P  PPK    +S +LE+TPVG  +    QQKR
Subjt:  QLRRLRRGLTTCRTADPSVHA-NDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEAD-LNGPFGPPKAQYASSPRLETTPVGQASKPITQQKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCCAGATTGACGGCGATCGCACCAAAGTCCAATTGGGCCTTCTCTCTGGCCCAATTACGACGCCTCCGGCGAGGTCTGACGACATGTCGTACAGCTGACCCTTC
CGTTCACGCCAATGACGACAATCAACCCGCCGTTTTATCCGGTGAACCCGAAAAATCACAGGATAATTTAGAGCCAGATAATGCGAAAGCGAATTACGAAAGAGATGATG
TTAAAGAGGCAGATTTAAATGGGCCGTTTGGGCCGCCAAAGGCCCAATACGCCTCCTCCCCTCGGTTAGAAACCACGCCGGTGGGTCAGGCCTCGAAGCCCATTACTCAA
CAAAAGAGAGCCAACGTGACGGTTCTCGACGGCGTCAGCTGCTTCGGCGCGTTGCCGGAGGGGAAAGAAAACAGAGAAACCGACAGGAAAGAACAGGAAGAAGATGACAG
AGAGTATTACAAGCACCACAAGGCGTCGCCGTTAGCCGAGATCGAGTTTGCGGATACGCGGAAGCCGACGGACGGGACGGCGTACGACGGCGGCGGGAAGGATGTGATTG
GGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCACTTCGGAGAGCGACGGAGATTTGGAAGCAAAACGCCATGTGTGGGGATCCCGATGCTCCACAATCGAGGGTT
CTTAGGGCTTTGCGAGGTGATTCAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAATCCAGATTGACGGCGATCGCACCAAAGTCCAATTGGGCCTTCTCTCTGGCCCAATTACGACGCCTCCGGCGAGGTCTGACGACATGTCGTACAGCTGACCCTTC
CGTTCACGCCAATGACGACAATCAACCCGCCGTTTTATCCGGTGAACCCGAAAAATCACAGGATAATTTAGAGCCAGATAATGCGAAAGCGAATTACGAAAGAGATGATG
TTAAAGAGGCAGATTTAAATGGGCCGTTTGGGCCGCCAAAGGCCCAATACGCCTCCTCCCCTCGGTTAGAAACCACGCCGGTGGGTCAGGCCTCGAAGCCCATTACTCAA
CAAAAGAGAGCCAACGTGACGGTTCTCGACGGCGTCAGCTGCTTCGGCGCGTTGCCGGAGGGGAAAGAAAACAGAGAAACCGACAGGAAAGAACAGGAAGAAGATGACAG
AGAGTATTACAAGCACCACAAGGCGTCGCCGTTAGCCGAGATCGAGTTTGCGGATACGCGGAAGCCGACGGACGGGACGGCGTACGACGGCGGCGGGAAGGATGTGATTG
GGTGGTTGCCGGAGCAGCTGGATACGGCGGAGGATTCACTTCGGAGAGCGACGGAGATTTGGAAGCAAAACGCCATGTGTGGGGATCCCGATGCTCCACAATCGAGGGTT
CTTAGGGCTTTGCGAGGTGATTCAGTTTAA
Protein sequenceShow/hide protein sequence
MQSRLTAIAPKSNWAFSLAQLRRLRRGLTTCRTADPSVHANDDNQPAVLSGEPEKSQDNLEPDNAKANYERDDVKEADLNGPFGPPKAQYASSPRLETTPVGQASKPITQ
QKRANVTVLDGVSCFGALPEGKENRETDRKEQEEDDREYYKHHKASPLAEIEFADTRKPTDGTAYDGGGKDVIGWLPEQLDTAEDSLRRATEIWKQNAMCGDPDAPQSRV
LRALRGDSV