; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022017 (gene) of Snake gourd v1 genome

Gene IDTan0022017
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:52189744..52190109
RNA-Seq ExpressionTan0022017
SyntenyTan0022017
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-4068.64Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+SLI+ LL S++L  +N+TTWKSNLNTILVVDDLRFVL+EECPQ PA NA +T +EAYDRWIKAN+KA+VYILAS+S+VLAKKHE + +A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVY
        MFGQP   +RH+++KY+Y
Subjt:  MFGQPSGPIRHESLKYVY

XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]5.6e-4067.23Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ L  S++L G N++TWK+NLNTILVVDDLRFVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]4.3e-4067.23Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ LL S++L G N++TWK+NLNTILVVDDL+FVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]1.9e-4068.07Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ LL S++L G N++TWK+NLNTILVVDDLRFVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

XP_038891685.1 uncharacterized protein LOC120081079 [Benincasa hispida]9.6e-4068.64Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+SLII LL S++L G+N++ WKSNLNTIL+VDDLRFVLSEECPQ PA NA +TV+EAYDRW+KAN+KA VYILAS+S+VLAKKHE + +A+EI+ SL+E
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVY
        MFGQ S  +RHE++K++Y
Subjt:  MFGQPSGPIRHESLKYVY

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein2.7e-4068.64Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+SLI+ LL S++L  +N+TTWKSNLNTILVVDDLRFVL+EECPQ PA NA +T +EAYDRWIKAN+KA+VYILAS+S+VLAKKHE + +A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVY
        MFGQP   +RH+++KY+Y
Subjt:  MFGQPSGPIRHESLKYVY

A0A6J1CP29 uncharacterized protein LOC1110134172.7e-4067.23Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ L  S++L G N++TWK+NLNTILVVDDLRFVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

A0A6J1DFZ2 uncharacterized protein LOC1110200957.9e-4069.75Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        MS+ IIALL + RL GEN+  WKSNLNTILV+DDL+FVL E+CPQ  A NA   V+ AYDRWIKANDKAKVYILAS+S+VLAKKHE  ++A+EIM SLQ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFGQPS   RHE+LK++YN
Subjt:  MFGQPSGPIRHESLKYVYN

A0A6J1DUZ9 uncharacterized protein LOC1110242942.1e-4067.23Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ LL S++L G N++TWK+NLNTILVVDDL+FVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

A0A6J1DXQ5 uncharacterized protein LOC1110244579.4e-4168.07Show/hide
Query:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE
        M+S I+ LL S++L G N++TWK+NLNTILVVDDLRFVL+EECPQ PA NA + V+EA+DRW+KANDKA+VYILAS+++VLAKKHE +++A+EIM SL+ 
Subjt:  MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQE

Query:  MFGQPSGPIRHESLKYVYN
        MFG+PS  +RHE+LKYVYN
Subjt:  MFGQPSGPIRHESLKYVYN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTTAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTGTCTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAAACTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCCGGACCGATTCGA
CACGAATCCCTCAAATACGTTTATAACCCCGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTTAATAATAGCCTTACTTAAAAGCGACCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTACGGTT
TGTACTGTCTGAGGAATGTCCTCAGATCCCTGCTCGTAACGCTCCTCAAACTGTTAAGGAGGCGTACGACCGCTGGATCAAGGCCAATGATAAGGCCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGCTGCAGGAAATGTTTGGACAACCGTCCGGACCGATTCGA
CACGAATCCCTCAAATACGTTTATAACCCCGTATGA
Protein sequenceShow/hide protein sequence
MSSLIIALLKSDRLTGENFTTWKSNLNTILVVDDLRFVLSEECPQIPARNAPQTVKEAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSAREIMSSLQEMFGQPSGPIR
HESLKYVYNPV