; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004253 (gene) of Snake gourd v1 genome

Gene IDTan0004253
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationLG04:63781788..63799591
RNA-Seq ExpressionTan0004253
SyntenyTan0004253
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037515.1 uncharacterized protein E6C27_scaffold277G001260 [Cucumis melo var. makuwa]1.6e-1050.53Show/hide
Query:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSG-KAGSTGIVREVNDVCWLHAVCRTKLAGSLGGGL
        + PRGNR S++   L + +        FLL TL+VL DN+A+V IELPV D LP     SG  + S GIVRE +DVCWLH V R K AG  GGG+
Subjt:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSG-KAGSTGIVREVNDVCWLHAVCRTKLAGSLGGGL

KAA0042035.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]4.0e-0943.24Show/hide
Query:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSGK-----------------AGSTGIVREVNDVCWLHAVCRTK
        PRGNR S++   L + +        F LSTLSVL DN+ VV IELP+PD LP     S                    GS GI+R  +DVCWLHAV R K
Subjt:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSGK-----------------AGSTGIVREVNDVCWLHAVCRTK

Query:  LAGSLGGGLIP
          G  GGG++P
Subjt:  LAGSLGGGLIP

KAA0060199.1 uncharacterized protein E6C27_scaffold386G00120 [Cucumis melo var. makuwa]7.8e-1349.55Show/hide
Query:  RPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIV-CLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSLGGG----LIP
        RPRGNR S++   L + +        F LSTLSVL DN+AVV IELPVPD LP         + STGIVR  +DVCWLHAV R K AG  GGG    +  
Subjt:  RPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIV-CLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSLGGG----LIP

Query:  GLEQRGPTLSL
        G  Q+G T ++
Subjt:  GLEQRGPTLSL

KAA0067117.1 hypothetical protein E6C27_scaffold38G001760 [Cucumis melo var. makuwa]2.8e-1042.52Show/hide
Query:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLP---------------------YIVCLSGK------------AGSTG
        + PRGNRSS++   L + +        FLLSTLSVL+DNNAVV IELPVPD LP                     ++  L  K             GS G
Subjt:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLP---------------------YIVCLSGK------------AGSTG

Query:  IVREVNDVCWLHAVCRTKLAGSLGGGL
        IVR  +DVCWLH V R K+AG  GGG+
Subjt:  IVREVNDVCWLHAVCRTKLAGSLGGGL

TYK11725.1 uncharacterized protein E5676_scaffold304G00370 [Cucumis melo var. makuwa]9.2e-1451.89Show/hide
Query:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYI------------VCLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSL
        PRGNR S++   L + +        FLLSTLSVL DN+AVV IELPVPD LP +            VC     GS GIVR  +DVCWLHAVCR K+AG  
Subjt:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYI------------VCLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSL

Query:  GGGLIP
        GGG++P
Subjt:  GGGLIP

TrEMBL top hitse value%identityAlignment
A0A5A7TKT0 Reverse transcriptase1.9e-0943.24Show/hide
Query:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSGK-----------------AGSTGIVREVNDVCWLHAVCRTK
        PRGNR S++   L + +        F LSTLSVL DN+ VV IELP+PD LP     S                    GS GI+R  +DVCWLHAV R K
Subjt:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSGK-----------------AGSTGIVREVNDVCWLHAVCRTK

Query:  LAGSLGGGLIP
          G  GGG++P
Subjt:  LAGSLGGGLIP

A0A5A7VIY9 Uncharacterized protein1.3e-1042.52Show/hide
Query:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLP---------------------YIVCLSGK------------AGSTG
        + PRGNRSS++   L + +        FLLSTLSVL+DNNAVV IELPVPD LP                     ++  L  K             GS G
Subjt:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLP---------------------YIVCLSGK------------AGSTG

Query:  IVREVNDVCWLHAVCRTKLAGSLGGGL
        IVR  +DVCWLH V R K+AG  GGG+
Subjt:  IVREVNDVCWLHAVCRTKLAGSLGGGL

A0A5D3BAD1 Retrotrans_gag domain-containing protein3.8e-1349.55Show/hide
Query:  RPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIV-CLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSLGGG----LIP
        RPRGNR S++   L + +        F LSTLSVL DN+AVV IELPVPD LP         + STGIVR  +DVCWLHAV R K AG  GGG    +  
Subjt:  RPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIV-CLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSLGGG----LIP

Query:  GLEQRGPTLSL
        G  Q+G T ++
Subjt:  GLEQRGPTLSL

A0A5D3BSA6 Retrotrans_gag domain-containing protein7.9e-1150.53Show/hide
Query:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSG-KAGSTGIVREVNDVCWLHAVCRTKLAGSLGGGL
        + PRGNR S++   L + +        FLL TL+VL DN+A+V IELPV D LP     SG  + S GIVRE +DVCWLH V R K AG  GGG+
Subjt:  DRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYIVCLSG-KAGSTGIVREVNDVCWLHAVCRTKLAGSLGGGL

A0A5D3CJL4 Retrotrans_gag domain-containing protein4.5e-1451.89Show/hide
Query:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYI------------VCLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSL
        PRGNR S++   L + +        FLLSTLSVL DN+AVV IELPVPD LP +            VC     GS GIVR  +DVCWLHAVCR K+AG  
Subjt:  PRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVV-IELPVPDNLPYI------------VCLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSL

Query:  GGGLIP
        GGG++P
Subjt:  GGGLIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGATTTGGTTTGTTTGACAGGCCAAGAGGAAATAGGTCGAGTGTTCGGGCCAAGGTTCTACCCGAGGCTATATGGTACCGTGTGCACACAGGTCATTTTCTTTT
GTCGACGTTGAGTGTACTCCATGACAACAATGCTGTGGTGATCGAGCTCCCGGTGCCTGATAACCTGCCATATATAGTATGCTTATCAGGTAAAGCAGGGTCAACAGGTA
TCGTTAGGGAGGTGAACGATGTCTGTTGGCTTCACGCCGTCTGTCGGACTAAGTTAGCTGGTAGCTTGGGAGGGGGGTTGATTCCAGGACTTGAACAACGGGGCCCCACC
CTCTCACTTGCCCGAGAGGGAGGTTGTTTATGGTCCTTAAGGAGTAAGGAGCAACTTTTCATTAGAGGAGCAGTGGCACTTAAGAAGGAAGAGTCATGCGAAAAACCGAC
TAACTTAAATGCATACCAAAACTTATGGGAATGCACCCAAAGGTCACAAATGACCCAAACTCACGGTAACTACATGGTCCCAACATACATGGTACACAACAAAACAGGGT
CGCGAAAACTTACCCATAACTATGTTAGTCAATCACGGAAAACTGAGCTTCCAATCACTTGCCCTTGCCAAAACTCTTTGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGATTTGGTTTGTTTGACAGGCCAAGAGGAAATAGGTCGAGTGTTCGGGCCAAGGTTCTACCCGAGGCTATATGGTACCGTGTGCACACAGGTCATTTTCTTTT
GTCGACGTTGAGTGTACTCCATGACAACAATGCTGTGGTGATCGAGCTCCCGGTGCCTGATAACCTGCCATATATAGTATGCTTATCAGGTAAAGCAGGGTCAACAGGTA
TCGTTAGGGAGGTGAACGATGTCTGTTGGCTTCACGCCGTCTGTCGGACTAAGTTAGCTGGTAGCTTGGGAGGGGGGTTGATTCCAGGACTTGAACAACGGGGCCCCACC
CTCTCACTTGCCCGAGAGGGAGGTTGTTTATGGTCCTTAAGGAGTAAGGAGCAACTTTTCATTAGAGGAGCAGTGGCACTTAAGAAGGAAGAGTCATGCGAAAAACCGAC
TAACTTAAATGCATACCAAAACTTATGGGAATGCACCCAAAGGTCACAAATGACCCAAACTCACGGTAACTACATGGTCCCAACATACATGGTACACAACAAAACAGGGT
CGCGAAAACTTACCCATAACTATGTTAGTCAATCACGGAAAACTGAGCTTCCAATCACTTGCCCTTGCCAAAACTCTTTGAATTGA
Protein sequenceShow/hide protein sequence
MLGFGLFDRPRGNRSSVRAKVLPEAIWYRVHTGHFLLSTLSVLHDNNAVVIELPVPDNLPYIVCLSGKAGSTGIVREVNDVCWLHAVCRTKLAGSLGGGLIPGLEQRGPT
LSLAREGGCLWSLRSKEQLFIRGAVALKKEESCEKPTNLNAYQNLWECTQRSQMTQTHGNYMVPTYMVHNKTGSRKLTHNYVSQSRKTELPITCPCQNSLN