; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001715 (gene) of Snake gourd v1 genome

Gene IDTan0001715
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:34526687..34527010
RNA-Seq ExpressionTan0001715
SyntenyTan0001715
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.9e-3265.71Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DL FVL E+CP  P ++A QSV+DAYD W KANDKA+++ILAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-3569.52Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DLRFVL E+CP  P +NA QSVKDAYDHW KANDKA +Y+LAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]8.2e-3569.52Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DLRFVL E+CP  P +NA QSVKDAYDHW KANDKA +Y+LAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]7.7e-3367.62Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MS+SIIALL  ++L GENY  WK+NLN ILV+DDLRFVL E CPQ PV NA  +V++AYD WIK+NDKAKVYILAS+ +V AKKHE  V+ +EIM SLQ+
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

XP_038889308.1 uncharacterized protein LOC120079220 [Benincasa hispida]1.0e-3265.71Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSS IAL K+E+LT +NY T K NL+ IL++DDLRF+LTEKCPQVP+ NA  +VKDAYDHW KAN+KA+VYILAS+++V  K+HE M++A EI+ SLQ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.4e-3265.71Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DL FVL E+CP  P ++A QSV+DAYD W KANDKA+++ILAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

A0A5A7VA67 Gag/pol protein4.0e-3569.52Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DLRFVL E+CP  P +NA QSVKDAYDHW KANDKA +Y+LAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

A0A5D3BUN8 Gag/pol protein1.4e-3265.71Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DL FVL E+CP  P ++A QSV+DAYD W KANDKA+++ILAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

A0A5D3D0D9 Gag/pol protein4.0e-3569.52Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MSSSIIALLK+++LTGENY TWK+ LNMILV+ DLRFVL E+CP  P +NA QSVKDAYDHW KANDKA +Y+LAS+ ++ +KKHE MV+AR+IM SL+ 
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

A0A6J1DWL0 uncharacterized protein LOC1110247343.7e-3367.62Show/hide
Query:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN
        MS+SIIALL  ++L GENY  WK+NLN ILV+DDLRFVL E CPQ PV NA  +V++AYD WIK+NDKAKVYILAS+ +V AKKHE  V+ +EIM SLQ+
Subjt:  MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQN

Query:  IFGQP
        +FGQP
Subjt:  IFGQP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCGATAATAGCCTTACTCAAAAGGGAACGTTTAACTGGTGAGAATTATACTACGTGGAAGGCCAACCTGAATATGATTCTAGTTGTTGACGACTTACGGTT
TGTACTTACTGAGAAATGTCCTCAGGTCCCTGTTCGAAACGCTCCTCAATCTGTTAAGGATGCGTACGACCATTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTATGAAGTTTTTGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAATATATTTGGACAACCGTATGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCGATAATAGCCTTACTCAAAAGGGAACGTTTAACTGGTGAGAATTATACTACGTGGAAGGCCAACCTGAATATGATTCTAGTTGTTGACGACTTACGGTT
TGTACTTACTGAGAAATGTCCTCAGGTCCCTGTTCGAAACGCTCCTCAATCTGTTAAGGATGCGTACGACCATTGGATCAAGGCCAATGACAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTATGAAGTTTTTGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAATATATTTGGACAACCGTATGGATAA
Protein sequenceShow/hide protein sequence
MSSSIIALLKRERLTGENYTTWKANLNMILVVDDLRFVLTEKCPQVPVRNAPQSVKDAYDHWIKANDKAKVYILASVYEVFAKKHEGMVSAREIMSSLQNIFGQPYG