; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017637 (gene) of Snake gourd v1 genome

Gene IDTan0017637
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:23646700..23647044
RNA-Seq ExpressionTan0017637
SyntenyTan0017637
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-3167.33Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+S I+ LL +EKL  +N  TWK+ LNTILVVDDL+FVLTEECPQ P SNA+R  R+AYDRWIKAN+K +VY+LASM D+LAKKHE + TAKEIMD + G
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

XP_022143540.1 uncharacterized protein LOC111013417 [Momordica charantia]3.7e-3368.32Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ L  +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P +NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

XP_022157095.1 uncharacterized protein LOC111023904 [Momordica charantia]2.8e-3369.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P  NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

XP_022157632.1 uncharacterized protein LOC111024294 [Momordica charantia]2.2e-3369.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P  NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]1.3e-3369.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P +NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein5.7e-3267.33Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+S I+ LL +EKL  +N  TWK+ LNTILVVDDL+FVLTEECPQ P SNA+R  R+AYDRWIKAN+K +VY+LASM D+LAKKHE + TAKEIMD + G
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

A0A6J1CP29 uncharacterized protein LOC1110134171.8e-3368.32Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ L  +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P +NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

A0A6J1DS54 uncharacterized protein LOC1110239041.4e-3369.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P  NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

A0A6J1DUZ9 uncharacterized protein LOC1110242941.0e-3369.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P  NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

A0A6J1DXQ5 uncharacterized protein LOC1110244576.1e-3469.31Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG
        M+SSI+ LL +EKL G N  TWKN LNTILVVDDL+FVLTEECPQ P +NA+RNVR+A+DRW+KANDK +VY+LASM D+LAKKHE ++TAKEIMD +  
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISG

Query:  V
        +
Subjt:  V

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCAGAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGACCAAGGTCTACATGC
TGGCAAGTATGTGCGACATATTAGCCAAGAAGCATGAGGGCATGATTACCGCTAAGGAAATCATGGATTACATCAGCGGGGTATGTTTGGACAACAGTCCACACAAGCCC
GACATAATGCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCAGAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTGCTAACTGAGGAGTGTCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGACCAAGGTCTACATGC
TGGCAAGTATGTGCGACATATTAGCCAAGAAGCATGAGGGCATGATTACCGCTAAGGAAATCATGGATTACATCAGCGGGGTATGTTTGGACAACAGTCCACACAAGCCC
GACATAATGCCATAA
Protein sequenceShow/hide protein sequence
MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECPQVPGSNASRNVRDAYDRWIKANDKTKVYMLASMCDILAKKHEGMITAKEIMDYISGVCLDNSPHKP
DIMP