; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001301 (gene) of Snake gourd v1 genome

Gene IDTan0001301
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationLG07:51309457..51309807
RNA-Seq ExpressionTan0001301
SyntenyTan0001301
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055407.1 uncharacterized protein E6C27_scaffold80G002530 [Cucumis melo var. makuwa]2.3e-2250Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS+     K+  DRLVE+EEQ+L+L + PD++RFLE +++ + E    +  +  R++GL I+EL  RVDTLE    R  S E GDSSTGS AHIEERV+E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        LDN+QKT++++ ND++
Subjt:  LDNTQKTMMKLFNDLT

KAA0062476.1 uncharacterized protein E6C27_scaffold130G00660 [Cucumis melo var. makuwa]6.0e-2349.14Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQL K   DRLVE++EQ+ +L +  D++ FL+ +++ + E    + A+ +RLDGLPI+EL  RVDTLE K  R  ++EHGDSST   +HIEERV E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        LD++QK++M++ N ++
Subjt:  LDNTQKTMMKLFNDLT

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.1e-2960.34Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQLSKSHVDRLVEIEEQLL+LR+ PD +R LE ++    E  G++ A+NAR+DGLPI+++ +RV+TLE KATRP SFE GDSST     IE R+ E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        L+N+   MM+LFN++T
Subjt:  LDNTQKTMMKLFNDLT

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]2.3e-3061.21Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MSATKQLSKSHVDRLVEIEEQLL+LR+ PD++R LE ++    E  G++ A+NAR+DGLPI+++ +RV+T E KATRP SFE GDSST     IE R+ E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        L+N+  TMM+LFN++T
Subjt:  LDNTQKTMMKLFNDLT

XP_022155185.1 uncharacterized protein LOC111022320 [Momordica charantia]1.5e-2654.31Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQL KSH+DRLVEIEE+LLFLR+ PD +R++E ++  +      +  +NAR+DGL IREL LRV+TLEDK  R  + E G+SS+ S AH+EERVEE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        +D ++KT++++ ++LT
Subjt:  LDNTQKTMMKLFNDLT

TrEMBL top hitse value%identityAlignment
A0A5A7ULF5 Retrotrans_gag domain-containing protein1.1e-2250Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS+     K+  DRLVE+EEQ+L+L + PD++RFLE +++ + E    +  +  R++GL I+EL  RVDTLE    R  S E GDSSTGS AHIEERV+E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        LDN+QKT++++ ND++
Subjt:  LDNTQKTMMKLFNDLT

A0A5D3DUL1 Uncharacterized protein2.9e-2349.14Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQL K   DRLVE++EQ+ +L +  D++ FL+ +++ + E    + A+ +RLDGLPI+EL  RVDTLE K  R  ++EHGDSST   +HIEERV E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        LD++QK++M++ N ++
Subjt:  LDNTQKTMMKLFNDLT

A0A6J1D906 Reverse transcriptase5.5e-3060.34Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQLSKSHVDRLVEIEEQLL+LR+ PD +R LE ++    E  G++ A+NAR+DGLPI+++ +RV+TLE KATRP SFE GDSST     IE R+ E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        L+N+   MM+LFN++T
Subjt:  LDNTQKTMMKLFNDLT

A0A6J1DK29 uncharacterized protein LOC1110218291.1e-3061.21Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MSATKQLSKSHVDRLVEIEEQLL+LR+ PD++R LE ++    E  G++ A+NAR+DGLPI+++ +RV+T E KATRP SFE GDSST     IE R+ E
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        L+N+  TMM+LFN++T
Subjt:  LDNTQKTMMKLFNDLT

A0A6J1DLQ6 uncharacterized protein LOC1110223207.4e-2754.31Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE
        MS TKQL KSH+DRLVEIEE+LLFLR+ PD +R++E ++  +      +  +NAR+DGL IREL LRV+TLEDK  R  + E G+SS+ S AH+EERVEE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEE

Query:  LDNTQKTMMKLFNDLT
        +D ++KT++++ ++LT
Subjt:  LDNTQKTMMKLFNDLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAACTAAGCAAGTCGCATGTCGACCGACTAGTTGAAATTGAAGAGCAGTTGCTATTCTTGAGGGACACACCTGACACGATGCGATTCCTCGAAGA
CCAAATGAAAAATGTCCAAGAGGCAGTGGGTAAAGTAAGGGCCTTGAATGCCCGACTCGATGGGTTACCAATACGAGAACTGACGTTGAGGGTTGACACCCTAGAAGATA
AAGCTACACGTCCCAGTAGCTTCGAACATGGAGATAGCTCCACCGGCTCTGCCGCACACATAGAAGAACGTGTGGAAGAGCTAGATAATACACAAAAGACCATGATGAAG
TTGTTCAACGATCTCACATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAACTAAGCAAGTCGCATGTCGACCGACTAGTTGAAATTGAAGAGCAGTTGCTATTCTTGAGGGACACACCTGACACGATGCGATTCCTCGAAGA
CCAAATGAAAAATGTCCAAGAGGCAGTGGGTAAAGTAAGGGCCTTGAATGCCCGACTCGATGGGTTACCAATACGAGAACTGACGTTGAGGGTTGACACCCTAGAAGATA
AAGCTACACGTCCCAGTAGCTTCGAACATGGAGATAGCTCCACCGGCTCTGCCGCACACATAGAAGAACGTGTGGAAGAGCTAGATAATACACAAAAGACCATGATGAAG
TTGTTCAACGATCTCACATAG
Protein sequenceShow/hide protein sequence
MSATKQLSKSHVDRLVEIEEQLLFLRDTPDTMRFLEDQMKNVQEAVGKVRALNARLDGLPIRELTLRVDTLEDKATRPSSFEHGDSSTGSAAHIEERVEELDNTQKTMMK
LFNDLT