; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022667 (gene) of Snake gourd v1 genome

Gene IDTan0022667
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG03:16869194..16872567
RNA-Seq ExpressionTan0022667
SyntenyTan0022667
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035808.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.5e-0950.67Show/hide
Query:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +L E     ++S +K +S VE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ GQ SR ++    ++ +S+
Subjt:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

KAA0042349.1 uncharacterized protein E6C27_scaffold795G00510 [Cucumis melo var. makuwa]4.5e-0939.71Show/hide
Query:  LSTRSSRSLIFWST--GIVREVNDVCWLYAVCRAK------LAGGSGGVLEVVQ--IELAEYG----SSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRR
        L+ RS    + W T  GI +   D  +    C AK      L  GS  V E  +   EL+ Y     +S+  S ++ +S VE + G STTS F G EQRR
Subjt:  LSTRSSRSLIFWST--GIVREVNDVCWLYAVCRAK------LAGGSGGVLEVVQ--IELAEYG----SSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRR

Query:  FTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        FTPG+N+SSRQDFKNR+ GQ+SR ++    ++ +S+
Subjt:  FTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

KAA0045363.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.5e-0950.67Show/hide
Query:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +L E     ++S +K +S VE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ GQ SR ++    ++ +S+
Subjt:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

KAA0053247.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.1e-1052.7Show/hide
Query:  LAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +  Y S + +S +K +SVVE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ G+ SR M+    ++ +S+
Subjt:  LAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

TYK00845.1 zf-CCHC domain-containing protein [Cucumis melo var. makuwa]2.4e-1032.74Show/hide
Query:  IVEIELPVPDVLPLSTRSSRSLIFWSTGIVREVNDVCWLYAVCRAKLAGGSGGVL----------EVVQIE----------LAEYGSSKDKSKSKA----
        + EIELPVPD L  S  SS S    S GIVR  +DVCWL+A+ RAK+ GG GG +          +  Q E           +E GSS  + +++A    
Subjt:  IVEIELPVPDVLPLSTRSSRSLIFWSTGIVREVNDVCWLYAVCRAKLAGGSGGVL----------EVVQIE----------LAEYGSSKDKSKSKA----

Query:  ----------------------------------------------------------QSVVEP------TCGPSTTSSFRGREQRRFTPGVNVSSRQDF
                                                                  QS+VE       + G STTS  RGREQRRFTPGV+VS  QDF
Subjt:  ----------------------------------------------------------QSVVEP------TCGPSTTSSFRGREQRRFTPGVNVSSRQDF

Query:  KNRASGQTSRQMNVSGAYRGESK
        K R+ G+  RQM+   AY+ +S+
Subjt:  KNRASGQTSRQMNVSGAYRGESK

TrEMBL top hitse value%identityAlignment
A0A5A7SZ16 Reverse transcriptase2.2e-0950.67Show/hide
Query:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +L E     ++S +K +S VE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ GQ SR ++    ++ +S+
Subjt:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

A0A5A7TLX3 Retrotrans_gag domain-containing protein2.2e-0939.71Show/hide
Query:  LSTRSSRSLIFWST--GIVREVNDVCWLYAVCRAK------LAGGSGGVLEVVQ--IELAEYG----SSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRR
        L+ RS    + W T  GI +   D  +    C AK      L  GS  V E  +   EL+ Y     +S+  S ++ +S VE + G STTS F G EQRR
Subjt:  LSTRSSRSLIFWST--GIVREVNDVCWLYAVCRAK------LAGGSGGVLEVVQ--IELAEYG----SSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRR

Query:  FTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        FTPG+N+SSRQDFKNR+ GQ+SR ++    ++ +S+
Subjt:  FTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

A0A5A7TP97 Reverse transcriptase2.2e-0950.67Show/hide
Query:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +L E     ++S +K +S VE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ GQ SR ++    ++ +S+
Subjt:  ELAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

A0A5A7UFH6 DNA/RNA polymerases superfamily protein2.0e-1052.7Show/hide
Query:  LAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK
        +  Y S + +S +K +SVVE + G ST S FRGREQRRFTPG+N+SSRQDFKNR+ G+ SR M+    ++ +S+
Subjt:  LAEYGSSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESK

A0A5D3BNX1 Zf-CCHC domain-containing protein1.2e-1032.74Show/hide
Query:  IVEIELPVPDVLPLSTRSSRSLIFWSTGIVREVNDVCWLYAVCRAKLAGGSGGVL----------EVVQIE----------LAEYGSSKDKSKSKA----
        + EIELPVPD L  S  SS S    S GIVR  +DVCWL+A+ RAK+ GG GG +          +  Q E           +E GSS  + +++A    
Subjt:  IVEIELPVPDVLPLSTRSSRSLIFWSTGIVREVNDVCWLYAVCRAKLAGGSGGVL----------EVVQIE----------LAEYGSSKDKSKSKA----

Query:  ----------------------------------------------------------QSVVEP------TCGPSTTSSFRGREQRRFTPGVNVSSRQDF
                                                                  QS+VE       + G STTS  RGREQRRFTPGV+VS  QDF
Subjt:  ----------------------------------------------------------QSVVEP------TCGPSTTSSFRGREQRRFTPGVNVSSRQDF

Query:  KNRASGQTSRQMNVSGAYRGESK
        K R+ G+  RQM+   AY+ +S+
Subjt:  KNRASGQTSRQMNVSGAYRGESK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGATTTGTTTTGAAAAGAAAAGAAAATAAAAGAAAAGAAAGGAAAAGGAAAAGAATGAAAAAGGAAAGTGGAAGGACAAGAGGAGATAGGTCGAGTGTACGGGC
CAAAGTTCTAGTTCTAACACTAATAGTGGAGATCGAGCTCCCAGTGCCTGACGTCCTGCCATTGTCTACTAGAAGCTCCAGGAGCTTGATATTTTGGTCAACAGGTATCG
TTAGAGAGGTGAACGATGTCTGTTGGCTTTACGCCGTCTGTCGGGCTAAGTTAGCAGGTGGTTCAGGAGGGGTTCTTGAGGTAGTTCAGATTGAACTAGCAGAGTATGGT
TCAAGTAAGGATAAGAGCAAGTCCAAGGCGCAGTCAGTAGTGGAGCCAACTTGTGGGCCTTCAACAACGAGTAGTTTTCGGGGTCGCGAACAGCGAAGATTCACACCTGG
AGTGAATGTTTCAAGTCGTCAGGACTTTAAGAATCGAGCTAGCGGCCAGACATCGAGGCAGATGAATGTTAGTGGTGCCTATCGAGGCGAGTCAAAGAGCACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGATTTGTTTTGAAAAGAAAAGAAAATAAAAGAAAAGAAAGGAAAAGGAAAAGAATGAAAAAGGAAAGTGGAAGGACAAGAGGAGATAGGTCGAGTGTACGGGC
CAAAGTTCTAGTTCTAACACTAATAGTGGAGATCGAGCTCCCAGTGCCTGACGTCCTGCCATTGTCTACTAGAAGCTCCAGGAGCTTGATATTTTGGTCAACAGGTATCG
TTAGAGAGGTGAACGATGTCTGTTGGCTTTACGCCGTCTGTCGGGCTAAGTTAGCAGGTGGTTCAGGAGGGGTTCTTGAGGTAGTTCAGATTGAACTAGCAGAGTATGGT
TCAAGTAAGGATAAGAGCAAGTCCAAGGCGCAGTCAGTAGTGGAGCCAACTTGTGGGCCTTCAACAACGAGTAGTTTTCGGGGTCGCGAACAGCGAAGATTCACACCTGG
AGTGAATGTTTCAAGTCGTCAGGACTTTAAGAATCGAGCTAGCGGCCAGACATCGAGGCAGATGAATGTTAGTGGTGCCTATCGAGGCGAGTCAAAGAGCACCTAG
Protein sequenceShow/hide protein sequence
MVGFVLKRKENKRKERKRKRMKKESGRTRGDRSSVRAKVLVLTLIVEIELPVPDVLPLSTRSSRSLIFWSTGIVREVNDVCWLYAVCRAKLAGGSGGVLEVVQIELAEYG
SSKDKSKSKAQSVVEPTCGPSTTSSFRGREQRRFTPGVNVSSRQDFKNRASGQTSRQMNVSGAYRGESKST