; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021642 (gene) of Snake gourd v1 genome

Gene IDTan0021642
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA zinc finger domain-containing protein 8-like isoform X2
Genome locationLG04:1820036..1823685
RNA-Seq ExpressionTan0021642
SyntenyTan0021642
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600026.1 hypothetical protein SDJN03_05259, partial [Cucurbita argyrosperma subsp. sororia]2.9e-5582.99Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNPDNASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTT
        GGQENYSPRTK+SESHHNFKKEE DNDAIGSNSNSASRGNW+  + T
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTT

KAG7030695.1 hypothetical protein SDJN02_04732 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-5786.11Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNPDNASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQENYSPRTK+SESHHNFKKEE DNDAIGSNSNSASRGNWWKG
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

XP_022996680.1 uncharacterized protein LOC111491851 isoform X1 [Cucurbita maxima]2.9e-5584.03Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNPD ASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQENYSP TK+SESHHN KKEE DNDAIGSNSNSASRGNWWKG
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

XP_023546730.1 uncharacterized protein LOC111805742 [Cucurbita pepo subsp. pepo]1.5e-5685.42Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNPDNASVN MN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQENYSPRTK+SESHHNFKKEE DNDAIGSNSNSASRGNWWKG
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

XP_038892195.1 uncharacterized protein LOC120081420 isoform X1 [Benincasa hispida]4.5e-6480Show/hide
Query:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVG----TGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS
        MENKK QVATSSSSSLDH+FG +DS SASSTSTTGYFGSIFPP VVER RK DVG    +G++GNPDNAS+NG   T GANGKDESS+YQNE+ME  YFS
Subjt:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVG----TGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS

Query:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTTSMDLLLDKEEMMMVKTNF
        SSIFYGGQENYSPRT  S+SHHNFKKE KDNDA  SNSNS SRGNWWKGMTTSMDLL DKEE++MVKTNF
Subjt:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTTSMDLLLDKEEMMMVKTNF

TrEMBL top hitse value%identityAlignment
A0A0A0KNZ9 Uncharacterized protein1.9e-5279.05Show/hide
Query:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGT----GKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS
        MENKK QVATSSSSSLDH+FG MDS SASSTSTTGYFGS+FPP VVERERK DVG      +VGNPDNA++NG   TGGA+GKDESSMYQNE+ME  YFS
Subjt:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGT----GKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS

Query:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWK
        SSIFYGGQENYSPRT  S+SH NFKKE KDNDA  SNSN ASRGNWWK
Subjt:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWK

A0A5A7U8T9 TPRXL protein1.7e-5378.81Show/hide
Query:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVG----TGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS
        ME+KK QVATSSSSSLDH+FG MDS SASSTSTTGYFGS+FPP VVERERK DVG    T +VGNPDNA++NG   TG A+GKDESSMYQNE+ME  YFS
Subjt:  MENKK-QVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVG----TGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFS

Query:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMT
        SSIFYGGQENYSPRT  S+SH NFKKE KDNDA  SNSN ASRGNWWKGMT
Subjt:  SSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMT

A0A6J1FNY5 uncharacterized protein LOC1114471801.6e-5482.31Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNP NASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTT
        GGQENYSPRTK+SESHHNFKKEE DNDAIGSNSNSASRGNW+  + T
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTT

A0A6J1K7H1 uncharacterized protein LOC111491851 isoform X11.4e-5584.03Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LVVERE+  DVG GKVGNPD ASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQENYSP TK+SESHHN KKEE DNDAIGSNSNSASRGNWWKG
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

A0A6J1KBQ3 uncharacterized protein LOC111491851 isoform X23.1e-5583.33Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        MENKKQVA SSSS+ D LFG MDS SASSTSTTGYFGSIFP LV+ERE+  DVG GKVGNPD ASVNGMN TGGANGK ESS+Y NE+ME  YFSSSIFY
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQENYSP TK+SESHHN KKEE DNDAIGSNSNSASRGNWWKG
Subjt:  GGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39855.1 unknown protein9.8e-0944.83Show/hide
Query:  MENKKQV---ATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELG-YF
        M+ KK V   ++SSSSSLDH+FG   S S SS STTG F SIFPP   V +       G  K   P N      NE G  +   E   YQ+E  +     
Subjt:  MENKKQV---ATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELG-YF

Query:  SSSIFYGGQENYSPRT
        SSSI+YGGQ+NYS  T
Subjt:  SSSIFYGGQENYSPRT

AT2G39855.2 unknown protein1.0e-1343.33Show/hide
Query:  MENKKQV---ATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELG-YF
        M+ KK V   ++SSSSSLDH+FG   S S SS STTG F SIFPP   V +       G  K   P N      NE G  +   E   YQ+E  +     
Subjt:  MENKKQV---ATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELG-YF

Query:  SSSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        SSSI+YGGQ+NYS  T   ++   +KK+ ++ D     S SASRGNWW+G
Subjt:  SSSIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

AT3G55646.1 unknown protein4.5e-1438.51Show/hide
Query:  ENKKQVATSSS-----SSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSS
        +NKK++ ++SS     SS DH+FG   S S+SS+S TG F SIFPP   ++  +      + G+    S N   E   +N K++ S Y  E+    + SS
Subjt:  ENKKQVATSSS-----SSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSS

Query:  SIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        S++YGGQE YS  T  + +H  +KK+ ++ D     S  ASRGNWW+G
Subjt:  SIFYGGQENYSPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

AT5G02020.1 Encodes a protein involved in salt tolerance, names SIS (Salt Induced Serine rich).4.8e-0835.71Show/hide
Query:  MENKKQVAT-----SSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERER-KHDVGTGKVGNPDNASVNGMNETGGANGKDESSMY-QNESMEL
        ME +K+ A+     SSSS    LFG+ +  + SS S++G  GSIFPP   V+ RE  + +  TG   N   +   G  +      ++  S Y Q++ ++ 
Subjt:  MENKKQVAT-----SSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPP--LVVERER-KHDVGTGKVGNPDNASVNGMNETGGANGKDESSMY-QNESMEL

Query:  GYFSSSIFYGGQENY-SPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
         + SSSI+YGG + Y  P+   S S +  KK+  ++D     S SASRGNWW+G
Subjt:  GYFSSSIFYGGQENY-SPRTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG

AT5G05965.1 unknown protein1.2e-1139.86Show/hide
Query:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY
        ++  K  ++S SSS D+LFG   S SASS ST     SIFPP V  ++  H     + GN   A+             DE S   +E  E  Y+SSSI+Y
Subjt:  MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFY

Query:  GGQENYSP----RTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG
        GGQ++YSP     +  S SH    KE  D   I   + S SRGNWWKG
Subjt:  GGQENYSP----RTKVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACAAGAAGCAAGTGGCTACCTCTTCATCTTCAAGTTTGGATCATCTTTTTGGTACCATGGACTCAAAATCAGCTTCTTCAACATCCACCACTGGATATTTTGG
CTCTATTTTCCCACCGCTGGTGGTGGAGAGAGAGAGGAAACATGATGTTGGCACTGGCAAAGTTGGGAATCCAGATAATGCTTCTGTTAATGGCATGAATGAAACTGGAG
GTGCAAATGGGAAAGATGAAAGCTCCATGTATCAGAATGAATCTATGGAACTAGGTTACTTTAGTTCCTCCATCTTTTATGGTGGCCAAGAAAATTACTCCCCAAGAACC
AAAGTCTCAGAATCTCACCATAATTTTAAGAAAGAAGAGAAAGACAATGATGCAATAGGAAGCAATTCAAACTCTGCTTCAAGAGGAAACTGGTGGAAGGGTATGACAAC
TTCCATGGATTTACTGTTGGACAAAGAAGAGATGATGATGGTTAAGACAAATTTTTAA
mRNA sequenceShow/hide mRNA sequence
CTTCAATTAAAGAATAAAAAAAATGTTGTTACCAACTATTCTTTAGGGAACGCTCTTGAATAAGATAAAATATCCTTTCCTAAATAATTCTCCTGGATTTGTTCCACAGT
ATCTTTGTTCAGTCATCATTCTTGCCTTTAAAACTTGGAATCTTATCCCTAAATCTGAATCCCTTCATCATGTTCATCCGATCAAAAAAACGCCACCCAATCCTCTCCCA
ATTTGGCACAACTCTCCCCAATCACAAAACTCATAGATACTCAAAACAATAATTCATAGCGGGTGGCAAATTCCTCCACAAAACCATATATTCCACTTCCCAATTCTCAG
AATTTTTTCTTCCATGGATATTAGAAGAAGATGCAATGGGGAGTGTATTTGATTGATTGAAGACTTTAGCATATCAGAAGAAATGGGAATCCTTCAAGTTGCCCTTAACC
CCCTTATCCAACGAGACCACCAAAACCCTTTTTCCCTCAAGGATCAATAAGGCACAGTTCGTGAGGACGGGGGAAAAAGAGGGATTTTGTGAACCCAAAAGAAGGATTTG
GATGGGACAATGAAACATGCACATCAGACAGAGAAAAAACACAGGTCAAAGAGATTGTTATGGGACATGTGGCGGATTGAATTCAGGGAATTGCAGAAGGACCACTTGGA
ACCATGTTAGAATTGAATGTGGGCACATGGAGCTGAAGAATCATCGATTTATACTGTCCTTTTTGGGGCTTATAAATCGGATTGAACTGTTTCAGGTCTTGCGTCTGAAA
TCTTCACGTGGGTCTGCTTAATCTCTCTGTTTTTTGATGCATATTATCTGTTTGTTTGTTTGTCTGACTGGAAAATCTTGTCAAGTTCCCTGTTTTTTTTGCTTTTTGTG
TATATGGGTGTGGGCCGTTAAGAGATAACTATCAGGTTTGGGGAGTAGTTTTGTTCGTTGGCTGACAGAGAAAGCTCCAAGTGTTAGTACTATGGAGAACAAGAAGCAAG
TGGCTACCTCTTCATCTTCAAGTTTGGATCATCTTTTTGGTACCATGGACTCAAAATCAGCTTCTTCAACATCCACCACTGGATATTTTGGCTCTATTTTCCCACCGCTG
GTGGTGGAGAGAGAGAGGAAACATGATGTTGGCACTGGCAAAGTTGGGAATCCAGATAATGCTTCTGTTAATGGCATGAATGAAACTGGAGGTGCAAATGGGAAAGATGA
AAGCTCCATGTATCAGAATGAATCTATGGAACTAGGTTACTTTAGTTCCTCCATCTTTTATGGTGGCCAAGAAAATTACTCCCCAAGAACCAAAGTCTCAGAATCTCACC
ATAATTTTAAGAAAGAAGAGAAAGACAATGATGCAATAGGAAGCAATTCAAACTCTGCTTCAAGAGGAAACTGGTGGAAGGGTATGACAACTTCCATGGATTTACTGTTG
GACAAAGAAGAGATGATGATGGTTAAGACAAATTTTTAACTTAGCAATCTGTTTTTACAGGTTCTCTGTATTACTAACACTCCCATCGTAATGGGATTTCCCACCTCCAT
AGGATTTGGTGACAGTATATTCTACATTATATTATTATTTACAAGTGAAACTACCATAGAAATAAAGGAGTAGAGTTTGAAGAGGTCCACAAGGCTCTGGTCAAAGAATG
AAGCTTCTATGGTTCATATTGTTCCATGTTAGTCAATTTTGACAAGTTGTGTGTTTCAAAGCTGAAGAGTGTTGATGTAGCTTTCTTCTGCAAGTGGAACTCTTTTCAAT
ATCTCCATCTTAATAGATCCTTATTATCTGTTATTGTAATAACTATGTTCACACAGTATATGATGAGTCTAGTTCGTTTAGTATATTCCGTGCATCCGAAAGGTAAAACG
TCTACTTTCTTCTTTTATTCGTGCTGCTCCTGGTTGTGATTCGATGCTGATCTCTAAAGTCCATGAACGCAAAAAGCGCTTGAGAGAGCAACAAGAAGAAAGATATGTTA
GGTATGTGTATTGGCTTCACATGGAAAACAGGGAGAACTACAAAATGGAAACGGTTCATCAAAGTTGAAAGAGAGCAATGCTAAGAACAACCAAATGAGCAGCTACTGCA
AGAGTACTTAAACCACATAATGGTGCAAGATTCGATAATGCACGAAGAGTTTCGTCAACGACTGCAAAGACGATAACACCTCCACATTGGTAACTAACGGGACGATCCTC
AATGTAATACCGTTCTGGTATCGATGCTGCAAGTTCTGGTTTAATACCGTCCTGCTTGACAGTATCAGAAAGAAGAATTGCACAAAGACAAGTCATTTTTTTGAACCATT
TCAACAGCCAAACTTACAACATTCAACCGAGAATTGAACACTCAAATCCTTCGCTGCAGGGAGCATGATTCAAGCCTCCGTTGCTTTTCAAGTGTCGTGTGATTCCCACC
CATGATAAGTGGTCAAAATTTTAAAGAAAAGAAAGAGACAAAAAAAAAACTCGCTCTCGCTTTAACTCTTTTATGATTTAATTTTTTTTTGTCTCTTTTTTTTTTCCTTT
AAATTTTCGGCCACTCATTTGAATGAGAGCAAGAGAGCTAGAGTGATAGCATTTTCACGTACGAGAACTTCTAACTCTTCTCTGACTTAATTTTTTTTCTTGTACTTTTT
TCTTTAAATTTTCGATCACTCATTTGAGTGAGAGCGAGTGGGC
Protein sequenceShow/hide protein sequence
MENKKQVATSSSSSLDHLFGTMDSKSASSTSTTGYFGSIFPPLVVERERKHDVGTGKVGNPDNASVNGMNETGGANGKDESSMYQNESMELGYFSSSIFYGGQENYSPRT
KVSESHHNFKKEEKDNDAIGSNSNSASRGNWWKGMTTSMDLLLDKEEMMMVKTNF