; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019959 (gene) of Snake gourd v1 genome

Gene IDTan0019959
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationLG06:51265546..51267689
RNA-Seq ExpressionTan0019959
SyntenyTan0019959
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025406.1 reverse transcriptase [Cucumis melo var. makuwa]7.7e-3580.65Show/hide
Query:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGYVRKHES
        MCKG CL   P SND LP  FKSLLQEFNDMFP EDA T LP LRGIEHQIDFIPGAT+PNMAAYRTNP ETKEIQRQVEELMDKGY+R+  S
Subjt:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGYVRKHES

KAA0048154.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]1.1e-3370.37Show/hide
Query:  EVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDK
        EVRN+   +    VLMCKG CL     SND LP  FKSLLQ+FND FP EDA T LP LR IEHQIDFIP AT+PNMAAYRTNP ETKEIQRQVEELMDK
Subjt:  EVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDK

Query:  GYVRKHES
        GY+R+  S
Subjt:  GYVRKHES

KAA0062085.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]8.5e-3483.15Show/hide
Query:  VLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGY
        VLMCKG CL   P SND LP  FKSLLQEFNDMFP EDA T LP LRGIEHQIDFIPGAT+PNMAAYRTNP ETKEIQRQVEELM+KGY
Subjt:  VLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGY

XP_023520950.1 uncharacterized protein LOC111784506, partial [Cucurbita pepo subsp. pepo]8.2e-3772.32Show/hide
Query:  MRKREVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEE
        +R  ++RN +L Q+   VLMCKGMCL   PK++D LP  FKSLLQEFND+FP EDA T LP LRGIEHQIDF+PGAT+PNMAAYRTNP ETKEIQRQVEE
Subjt:  MRKREVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEE

Query:  LMDKGYVRKHES
        LMDKGYVR+  S
Subjt:  LMDKGYVRKHES

XP_038880394.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120072043 [Benincasa hispida]2.6e-3571.43Show/hide
Query:  MRKREVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEE
        MRK E RN    +    VLMCKGMCL + P SND LP  FKSLLQEFNDMFP ED  T LP LRGIEH+IDFIPGA +PNMAAYRTNPIE KEIQR VEE
Subjt:  MRKREVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEE

Query:  LMDKGYVRKHES
        LM KGYVR+  S
Subjt:  LMDKGYVRKHES

TrEMBL top hitse value%identityAlignment
A0A1U8IBS7 uncharacterized protein LOC1078926868.3e-2757.81Show/hide
Query:  REVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMD
        RE+R  L + Q   +LM K  CL     +N  LP    SLLQEF D+FP+E     LP L GIEHQIDFIPGATIPN  AYRTNP ETKE+QRQV ELMD
Subjt:  REVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMD

Query:  KG-YVRKHESLLGSGDFGTQEGWHMENV
        KG Y RK ES+  SG   T+E W +E+V
Subjt:  KG-YVRKHESLLGSGDFGTQEGWHMENV

A0A5A7SHU3 Reverse transcriptase3.7e-3580.65Show/hide
Query:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGYVRKHES
        MCKG CL   P SND LP  FKSLLQEFNDMFP EDA T LP LRGIEHQIDFIPGAT+PNMAAYRTNP ETKEIQRQVEELMDKGY+R+  S
Subjt:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGYVRKHES

A0A5A7TYP0 Retrovirus-related Pol polyprotein from transposon 17.65.4e-3470.37Show/hide
Query:  EVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDK
        EVRN+   +    VLMCKG CL     SND LP  FKSLLQ+FND FP EDA T LP LR IEHQIDFIP AT+PNMAAYRTNP ETKEIQRQVEELMDK
Subjt:  EVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDK

Query:  GYVRKHES
        GY+R+  S
Subjt:  GYVRKHES

A0A5A7V857 Transposon Ty3-I Gag-Pol polyprotein4.1e-3483.15Show/hide
Query:  VLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGY
        VLMCKG CL   P SND LP  FKSLLQEFNDMFP EDA T LP LRGIEHQIDFIPGAT+PNMAAYRTNP ETKEIQRQVEELM+KGY
Subjt:  VLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGY

A0A5D3CEW5 Transposon Ty3-I Gag-Pol polyprotein1.5e-3172.28Show/hide
Query:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKE--------IQRQVEELMDKGYVRKHE
        MCKG CL   P SND LP  FKSLLQEFNDMFP EDA   LP LRGIEHQIDFIPGAT+PNM AYRTNP ETKE        IQRQVEELMDKGY+R+  
Subjt:  MCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKE--------IQRQVEELMDKGYVRKHE

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAGAGGGAAGTAAGGAATGTGTTGTTGACACAACAACAAGCATGTGTACTTATGTGCAAAGGAATGTGTTTAACAACTAACCCAAAATCTAATGATGTTTTGCC
ACATTTTTTTAAGTCTCTTTTGCAGGAATTTAATGATATGTTTCCACGTGAAGATGCACATACTTGTTTACCTGCTTTAAGAGGGATTGAACATCAAATTGATTTCATAC
CCGGTGCAACTATTCCAAATATGGCAGCTTATAGGACCAATCCAATCGAGACTAAGGAGATTCAAAGGCAAGTGGAAGAACTCATGGATAAAGGTTATGTTAGAAAGCAT
GAGTCCTTGCTCGGTTCCGGTGATTTTGGTACCCAAGAAGGATGGCACATGGAGAATGTGTGTTGA
mRNA sequenceShow/hide mRNA sequence
GAAGATTTCGAGGTAGGATCGAGCTCCCGGTGCCTGATAACCTGCCATAGTCTGCTAGAAGCTTCACAAGTATAACATGTTGTACGTGGGTGGAGATGTATAGGAAGTCT
ATATGTATTGTAGAAGGACTATAAGGTTATGTTTATGTGTACATTTTAGGTTGTGGTTGTGTGATCAATGGTTGGGTATGTTTTTGGTTGTGATGTTCTTATCTCTTTCA
GTTTTCCAGAGAGTCATAAGTAGGGTTACCCCTTACGTAGGTTACGTAATTGCCTATTTTCCGCTGTGTTATGTTTAAGTGTTTCTTATACAAGTTTAGTATGCTTATAA
GGTAAAATAGGGTCAACAGGTATCGTTGGAGAGGTGAACGATGTCTGTTGGCTTCACGCCGTCTACCGGGCTAAGTTAGCAGGTAGTTCGGGAGGGGGTGTGACATCTCA
AACATTGTGCAATCAAGTTTTAAATTCCTAGGTCAAAAGGTGAGTGAAAACCAACAGAGAAAAAAGAGTGATTTTGAGGAAAAAAAATAATGTGAGTTTGGTTATGAGAA
AGAGGGAAGTAAGGAATGTGTTGTTGACACAACAACAAGCATGTGTACTTATGTGCAAAGGAATGTGTTTAACAACTAACCCAAAATCTAATGATGTTTTGCCACATTTT
TTTAAGTCTCTTTTGCAGGAATTTAATGATATGTTTCCACGTGAAGATGCACATACTTGTTTACCTGCTTTAAGAGGGATTGAACATCAAATTGATTTCATACCCGGTGC
AACTATTCCAAATATGGCAGCTTATAGGACCAATCCAATCGAGACTAAGGAGATTCAAAGGCAAGTGGAAGAACTCATGGATAAAGGTTATGTTAGAAAGCATGAGTCCT
TGCTCGGTTCCGGTGATTTTGGTACCCAAGAAGGATGGCACATGGAGAATGTGTGTTGATTGTCGAATCATCAACAAGATAACGGTAAAGTATCGACATCCCATTCTTAG
AGTGGATGACATGCTAGATGAATGACATGGTGGCAATCTGTTTTCAAAAATAGATCTTAAAAGTGGTTATCATCAAATTCGAATGCATGTGGGAGATGAGTGGAAAACGA
CTTTCAAAACTAAATTTGG
Protein sequenceShow/hide protein sequence
MRKREVRNVLLTQQQACVLMCKGMCLTTNPKSNDVLPHFFKSLLQEFNDMFPREDAHTCLPALRGIEHQIDFIPGATIPNMAAYRTNPIETKEIQRQVEELMDKGYVRKH
ESLLGSGDFGTQEGWHMENVC