; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001209 (gene) of Snake gourd v1 genome

Gene IDTan0001209
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:44782989..44783535
RNA-Seq ExpressionTan0001209
SyntenyTan0001209
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035676.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-3470.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL +KKL G+N  TWK+ LNTILVVDDL+F+LTEECP+ P SNA+R  R+AYDRWIKAN+KA+VY+LASMSD+L KKHE + T KEI+DS+KG
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]3.9e-3569.23Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ L  ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P +NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

XP_022157095.1 uncharacterized protein LOC111023904 [Momordica charantia]3.0e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P  NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]2.3e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P  NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]1.3e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P +NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein7.2e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL +KKL G+N  TWK+ LNTILVVDDL+F+LTEECP+ P SNA+R  R+AYDRWIKAN+KA+VY+LASMSD+L KKHE + T KEI+DS+KG
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

A0A6J1CP29 uncharacterized protein LOC1110134171.9e-3569.23Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ L  ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P +NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

A0A6J1DS54 uncharacterized protein LOC1110239041.4e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P  NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

A0A6J1DUZ9 uncharacterized protein LOC1110242941.1e-3570.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P  NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

A0A6J1DXQ5 uncharacterized protein LOC1110244576.5e-3670.19Show/hide
Query:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG
        M+SSI+ LL ++KL G N  TWKN LNTILVVDDL+FVLTEECP+ P +NA+RNVR+A+DRW+KANDKA+VY+LASM+D+L KKHE ++TAKEIMDS+K 
Subjt:  MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKG

Query:  MFGQ
        MFG+
Subjt:  MFGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGAAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGACGATTTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCGGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGC
TGGCAAGTATGTCTGACATGTTAGGCAAGAAGCATGAGGGCATGATTACCGCTAAGGAAATCATGGATTCTGTGAAGGGTATGTTTGGACAACAGCCACACAAGCCCGAC
ATAATGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGAAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGACGATTTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCGGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGCCAAGGTCTACATGC
TGGCAAGTATGTCTGACATGTTAGGCAAGAAGCATGAGGGCATGATTACCGCTAAGGAAATCATGGATTCTGTGAAGGGTATGTTTGGACAACAGCCACACAAGCCCGAC
ATAATGCCCTAAAGTACATATTCAACTCGAGGATGCCAGAGGGTACATCTGTGCGGGATCATGTCCTAGATATGATGGTGCACTTTAACATCGCGGAGTCGAATGGTGCT
TCCATCGATGAGTCGAGCAGTCGACTTCATTCTCGAAACCCTTCCGAGATAGTTTCTGCAGATTTAGAAGTAATCTTTGTTATGAACGAGCTTACTTTTAATCTTAC
Protein sequenceShow/hide protein sequence
MSSSIISLLGAKKLTGENMMTWKNKLNTILVVDDLKFVLTEECPRVPGSNASRNVRDAYDRWIKANDKAKVYMLASMSDMLGKKHEGMITAKEIMDSVKGMFGQQPHKPD
IMP