; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021289 (gene) of Snake gourd v1 genome

Gene IDTan0021289
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotransposon protein
Genome locationLG11:19119851..19120370
RNA-Seq ExpressionTan0021289
SyntenyTan0021289
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038887234.1 uncharacterized protein LOC120077425 [Benincasa hispida]1.4e-4564.75Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        MTG SK SKH W+KVEDARLVE+L+ LV+ GW+SDN TFRPGYLQHL+++L EK+   +L +NTI CKVR+LKK+YN V E L    SGF WNE FKCV+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRA
         E+++FD WV+S+ N KGM  KPF HYDDL+ VFGKDRA
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRA

XP_038889264.1 uncharacterized protein At2g29880-like [Benincasa hispida]1.5e-4257.93Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        M    K +KH WTKVEDARLVESL+ LV+ GWQ+D ETF+PGYLQ LQ +  EK+ + S+  +TI CKVR LK++Y  +VE L N  +GF WN+ FKCV+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATGIGRR
         EK+VFD WV S+ N KG+R+KPF H D+L+ VFGKDRAT  G +
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATGIGRR

XP_038892629.1 uncharacterized protein At2g29880-like [Benincasa hispida]4.3e-4260.28Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        M G  K SKH W+KVEDA+LVE+L+ LV+ GW+ DN TFRPGYLQHL+++L EK+   +L  NTI CKVR+LKK+YN V E L    SG  WNE FKCV 
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
         E+++FD WV S+ N K M NKPF HYDDL+ +FGKDRA G
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

XP_038896380.1 uncharacterized protein LOC120084641 [Benincasa hispida]3.0e-4362.59Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        M G+ K SKH W+KVED +LVE+L+ LV+ GW+SDN TFR GYLQ+L+++L EK+   +L QNTI CKVR+LKK+YN V E L    SGF WNE FKCV+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRA
         EK++FD WV+S+ N KGM NK FLHYDDL+ VFGKDRA
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRA

XP_038902479.1 uncharacterized protein At2g29880-like [Benincasa hispida]3.5e-4460.27Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        MT   K SKH W+KVEDA+LVE+L+ LV+ GW+SDN TFRPGYLQHL+++L EK+   +L QNTI CKVR+LKK+YN+V E L    SGF WNE FKCV+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATGIGRRD
         E+++FD WV S+ N K M NKPF HYDD + VFGKDR  G    D
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATGIGRRD

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859532.2e-3653.15Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQN-GWQSDNETFRPGYLQHLQKMLAEKLANSSL-EQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC
        M   S+  KHTWTK E+ + VE LV LV + GW+SDN TF+PGYL  LQ+M+AEKL  +++ E +TI+C V++LKK Y+ + E  G   SGF WNE F+C
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQN-GWQSDNETFRPGYLQHLQKMLAEKLANSSL-EQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC

Query:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
        + AE+ +FD+W+KS+   KG+ +K F +YDDL++VFGKDRATG
Subjt:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

A0A5A7U0H7 Retrotransposon protein2.2e-3653.15Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQN-GWQSDNETFRPGYLQHLQKMLAEKLANSSL-EQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC
        M   S+  KHTWTK E+ + VE LV LV + GW+SDN TF+PGYL  LQ+M+AEKL  +++ E +TI+C V++LKK Y+ + E  G   SGF WNE F+C
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQN-GWQSDNETFRPGYLQHLQKMLAEKLANSSL-EQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC

Query:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
        + AE+ +FD+W+KS+   KG+ +K F +YDDL++VFGKDRATG
Subjt:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

A0A5A7VFF1 Retrotransposon protein3.2e-3552.11Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCV
        M  +S+  KH WTK E+A LVE L+ LV   GW+SDNETFRPGYL  L +M+A K+  S++  +TI+ +++ LK+ ++ +V   G   SGF WN+  KC+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCV

Query:  EAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
         AEK+V D WVKS+T  KG+ NK F HYD+L++VFGKDRATG
Subjt:  EAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

A0A5D3CM34 Retrotransposon protein2.5e-3551.75Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNT-INCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC
        M+ +++  +H WTK E+   VE L+ LV   GW+SDN TFRPGYL  L +M+AEKL    +   T I+C+++TLK+ + V+VE  G  YSGF WN+  KC
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNT-INCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKC

Query:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
        + AEK++FD WV+SY   KG+ NKPF +YD+LT+VFG+DRATG
Subjt:  VEAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

A0A5D3D513 Retrotransposon protein3.2e-3552.11Show/hide
Query:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCV
        M  +S+  KH WTK E+A LVE L+ LV   GW+SDNETFRPGYL  L +M+A K+  S++  +TI+ +++ LK+ ++ +V   G   SGF WN+  KC+
Subjt:  MTGTSKNSKHTWTKVEDARLVESLVSLVQ-NGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCV

Query:  EAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
         AEK+V D WVKS+T  KG+ NK F HYD+L++VFGKDRATG
Subjt:  EAEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein2.0e-0526.24Show/hide
Query:  GTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTIN--CKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE
        G  K   + WT  E     + L+ L++  W+  +     G L    K+L          +N  N   +++ LK  Y   ++ L    SGF W+   K   
Subjt:  GTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTIN--CKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVE

Query:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG
        A  +V+  ++K++ N K M+ +   H++DL  +FG   ATG
Subjt:  AEKKVFDAWVKSYTNTKGMRNKPFLHYDDLTFVFGKDRATG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGTACTTCGAAAAACTCCAAACATACGTGGACGAAGGTCGAGGATGCAAGGTTGGTAGAGTCATTGGTATCTTTAGTACAAAATGGGTGGCAATCAGACAACGA
GACCTTCAGGCCTGGATATTTGCAACATCTCCAAAAGATGCTAGCGGAGAAATTGGCTAATTCATCATTAGAACAAAACACTATAAATTGTAAGGTGAGAACTCTGAAAA
AGAAATACAATGTTGTTGTAGAGAAGCTTGGTAATGTTTATAGTGGATTTAGATGGAATGAGGTATTTAAGTGTGTTGAGGCAGAGAAGAAGGTATTTGATGCATGGGTT
AAAAGCTATACAAACACAAAGGGGATGAGGAACAAACCATTTCTACACTATGATGATCTGACATTTGTCTTCGGAAAAGATAGAGCTACAGGAATTGGACGTAGAGACCT
CAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGTACTTCGAAAAACTCCAAACATACGTGGACGAAGGTCGAGGATGCAAGGTTGGTAGAGTCATTGGTATCTTTAGTACAAAATGGGTGGCAATCAGACAACGA
GACCTTCAGGCCTGGATATTTGCAACATCTCCAAAAGATGCTAGCGGAGAAATTGGCTAATTCATCATTAGAACAAAACACTATAAATTGTAAGGTGAGAACTCTGAAAA
AGAAATACAATGTTGTTGTAGAGAAGCTTGGTAATGTTTATAGTGGATTTAGATGGAATGAGGTATTTAAGTGTGTTGAGGCAGAGAAGAAGGTATTTGATGCATGGGTT
AAAAGCTATACAAACACAAAGGGGATGAGGAACAAACCATTTCTACACTATGATGATCTGACATTTGTCTTCGGAAAAGATAGAGCTACAGGAATTGGACGTAGAGACCT
CAGTTGA
Protein sequenceShow/hide protein sequence
MTGTSKNSKHTWTKVEDARLVESLVSLVQNGWQSDNETFRPGYLQHLQKMLAEKLANSSLEQNTINCKVRTLKKKYNVVVEKLGNVYSGFRWNEVFKCVEAEKKVFDAWV
KSYTNTKGMRNKPFLHYDDLTFVFGKDRATGIGRRDLS