; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017733 (gene) of Snake gourd v1 genome

Gene IDTan0017733
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG04:10677190..10678429
RNA-Seq ExpressionTan0017733
SyntenyTan0017733
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]6.3e-1041.94Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF
        GDFN IR  S+ FG +P   EM  FD  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  ++W+ +        L W ISD++ ++F
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF

XP_008466769.1 PREDICTED: uncharacterized protein LOC103504100 [Cucumis melo]1.8e-0941.94Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF
        GDFN IR  S+ FG +P   EM  FD  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  + W+ +        L W ISD++ ++F
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF

XP_022159081.1 uncharacterized protein LOC111025522 [Momordica charantia]3.7e-1042.11Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALVFEV
        GDFN IR+SS+  G +P + ++  FD  L+Q DL+E RV  +W TW+NK      IL+ L RVL    W       E Q  +W +SD+  LVF V
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALVFEV

XP_038874927.1 uncharacterized protein LOC120067439 [Benincasa hispida]1.3e-1042.27Show/hide
Query:  SWFGHGDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALV
        SW    DFN I  SS+ FG  P++ EM  FD  L++ DL+EL +  NW TW++K+ G+  ILRRL   L  E W       E +   W ISD+N L+
Subjt:  SWFGHGDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALV

XP_038895804.1 uncharacterized protein LOC120083970 [Benincasa hispida]6.3e-1045.65Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALV
        GDFN IR SS+ F   P++ EM  FD  L++ DL+EL V  NW TW++KL G+  IL RL R L  EKW       E +   W ISD++ L+
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALV

TrEMBL top hitse value%identityAlignment
A0A1S3CRZ6 uncharacterized protein LOC1035041008.9e-1041.94Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF
        GDFN IR  S+ FG +P   EM  FD  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  + W+ +        L W ISD++ ++F
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF

A0A5A7SPE5 Reverse transcriptase domain-containing protein1.1e-0735.85Show/hide
Query:  KVECSWFGHG----DFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISD
        ++  +W   G    DFN IR  S+ F  +P   EM  F+  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  + W+ +        L W ISD
Subjt:  KVECSWFGHG----DFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISD

Query:  NNALVF
        +  ++F
Subjt:  NNALVF

A0A5A7TZS0 Reverse transcriptase domain-containing protein3.1e-1041.94Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF
        GDFN IR  S+ FG +P   EM  FD  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  ++W+ +        L W ISD++ ++F
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF

A0A5A7V275 Reverse transcriptase8.9e-1041.94Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF
        GDFN IR  S+ FG +P   EM  FD  +   DL+E  V+ NW TW++K+ GS  +LRRL RVL  + W+ +        L W ISD++ ++F
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVES------QALDWDISDNNALVF

A0A6J1E2U5 uncharacterized protein LOC1110255221.8e-1042.11Show/hide
Query:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALVFEV
        GDFN IR+SS+  G +P + ++  FD  L+Q DL+E RV  +W TW+NK      IL+ L RVL    W       E Q  +W +SD+  LVF V
Subjt:  GDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKW------VESQALDWDISDNNALVFEV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCTAATACTGAGCCTGAAAGGAAAAGTTGAATGCTCTTGGTTTGGTCATGGGGATTTTAATACTATTCGGTATTCTTCGAAAGGGTTTGGCGATAATCCTAATAT
TGTTGAGATGGTTGTGTTTGATAGTACTCTGGTGCAGATTGATCTTCTGGAGTTACGTGTTGAGAGTAATTGGGTTACTTGGAGTAATAAACTCTCGGGATCTACATATA
TTCTTAGGCGTCTGGCTCGTGTCTTGGATAAGGAAAAATGGGTGGAGTCTCAAGCCCTTGATTGGGATATTTCTGATAATAACGCTTTGGTTTTTGAGGTTAGGTTGAGT
AAGAAACTGAGCATTTGGAGTAAAACTGGAGAACTATGCGGACAAAATTGA
mRNA sequenceShow/hide mRNA sequence
GCAATCACCAATACCTCGCCAACCAGATGTTTATTTATGGTATTCTGACTGATATTTTCTCCCAAGTTAAGATTGAAATGATGGTTGTGTATGCAGCTAATACTGAGCCT
GAAAGGAAAAGTTGAATGCTCTTGGTTTGGTCATGGGGATTTTAATACTATTCGGTATTCTTCGAAAGGGTTTGGCGATAATCCTAATATTGTTGAGATGGTTGTGTTTG
ATAGTACTCTGGTGCAGATTGATCTTCTGGAGTTACGTGTTGAGAGTAATTGGGTTACTTGGAGTAATAAACTCTCGGGATCTACATATATTCTTAGGCGTCTGGCTCGT
GTCTTGGATAAGGAAAAATGGGTGGAGTCTCAAGCCCTTGATTGGGATATTTCTGATAATAACGCTTTGGTTTTTGAGGTTAGGTTGAGTAAGAAACTGAGCATTTGGAG
TAAAACTGGAGAACTATGCGGACAAAATTGAGTAACTTAAAACTATATTTGAGGTTCCAAGAGTTGAAGAGAAAGGACAAAGCTACTGGACCCGAGGGCATAACGTTGTA
ATGCTCCTTACAGTATTGCAGTGTCATTTGAGGGGTCTTTTAACCTTAAAAATCTACAAACGTAGCGTTTATATCCATTGATTTTTTTTCCTTTTTTATAAAATTCTTGA
GACAACATTACAACGTTGTTTCGACTCTATTTAAAGAAACTTTGTGTATTCAATTGGGGGAGTCATTTTTTACGACACTTTGGAGAACTAAGGAGGAGACTTGAGGCTCA
AGATCGTGGAACATCACGTCGGAAACGAGGGTAGGCACACATCGTCCAATTTTTCTCTTTCCCTTTATTTTGTTCCGG
Protein sequenceShow/hide protein sequence
MQLILSLKGKVECSWFGHGDFNTIRYSSKGFGDNPNIVEMVVFDSTLVQIDLLELRVESNWVTWSNKLSGSTYILRRLARVLDKEKWVESQALDWDISDNNALVFEVRLS
KKLSIWSKTGELCGQN