; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017159 (gene) of Snake gourd v1 genome

Gene IDTan0017159
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationContig00117:130783..131853
RNA-Seq ExpressionTan0017159
SyntenyTan0017159
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]8.3e-2164.63Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC  H+TSD+N +SLA EYNG+E + VGNGQT PIS++G
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]2.0e-1960.98Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++G
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]2.2e-2157.61Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++GQ+ G   +P+
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR

XP_011658579.1 uncharacterized protein LOC105436058 [Cucumis sativus]8.8e-2360.87Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC  H+TSD+N +SLA EYNG+E + VGNGQT PIS++GQ+ G   +P+
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR

XP_022150845.1 uncharacterized protein LOC111018892 [Momordica charantia]1.4e-2043.64Show/hide
Query:  NGCRGVYPNNPTNGGRG-NNGGTRSN--------GSFSNAGNVVLPSQMDESCVKFVIGMV-IQPLDCYNRMNYHFQGRHPPMQLATMAVVQNQQFA---
        N     +PN   + GRG NNG  ++N        G   ++GN     Q D      + G +    LDCYNRMN+HFQGRHPP QLA M  VQN  +    
Subjt:  NGCRGVYPNNPTNGGRG-NNGGTRSN--------GSFSNAGNVVLPSQMDESCVKFVIGMV-IQPLDCYNRMNYHFQGRHPPMQLATMAVVQNQQFA---

Query:  -SSASPWLTDSGCTAHVTSDLNQL---SLASEYNGDELISVGNGQTLPISN--TGQMLGSNPIPR
         SS + WL DS C  H+T+DL+ L   S+AS+YNG+E ISVG+GQ+ PI++   GQ+ GSN +P+
Subjt:  -SSASPWLTDSGCTAHVTSDLNQL---SLASEYNGDELISVGNGQTLPISN--TGQMLGSNPIPR

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X29.8e-2060.98Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++G
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X31.1e-2157.61Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++GQ+ G   +P+
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPR

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X19.8e-2060.98Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++G
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG

A0A5D3CLI6 T4.54.9e-1960.49Show/hide
Query:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNT
        LDC+NRMNY+FQGRHPP QLA M   QN  F S   S  LTDSGC   +TSD+N +SLA EYNG+E + +GNGQT P+S++
Subjt:  LDCYNRMNYHFQGRHPPMQLATMAVVQNQQFASSA-SPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNT

A0A6J1D9L6 uncharacterized protein LOC1110188926.8e-2143.64Show/hide
Query:  NGCRGVYPNNPTNGGRG-NNGGTRSN--------GSFSNAGNVVLPSQMDESCVKFVIGMV-IQPLDCYNRMNYHFQGRHPPMQLATMAVVQNQQFA---
        N     +PN   + GRG NNG  ++N        G   ++GN     Q D      + G +    LDCYNRMN+HFQGRHPP QLA M  VQN  +    
Subjt:  NGCRGVYPNNPTNGGRG-NNGGTRSN--------GSFSNAGNVVLPSQMDESCVKFVIGMV-IQPLDCYNRMNYHFQGRHPPMQLATMAVVQNQQFA---

Query:  -SSASPWLTDSGCTAHVTSDLNQL---SLASEYNGDELISVGNGQTLPISN--TGQMLGSNPIPR
         SS + WL DS C  H+T+DL+ L   S+AS+YNG+E ISVG+GQ+ PI++   GQ+ GSN +P+
Subjt:  -SSASPWLTDSGCTAHVTSDLNQL---SLASEYNGDELISVGNGQTLPISN--TGQMLGSNPIPR

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.2e-0752.08Show/hide
Query:  SASPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG
        S++ WL DSG T H+TSD N LSL   Y G + + V +G T+PIS+TG
Subjt:  SASPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAACAATCCCACAACAATCTCAATACTCTCGAGGACGAGGTAATGGTGGTGGTCGATCTCGCTTTCAGGGTTGTGGTAATACCAATCGATGTGGTAGTAATGGATG
CAGAGGGGTATACCCGAATAATCCTACAAATGGTGGTCGTGGAAATAATGGGGGAACTCGATCAAATGGATCTTTCTCAAATGCTGGAAATGTTGTTCTACCCAGTCAGA
TGGACGAATCATGTGTCAAATTTGTAATAGGTATGGTCATTCAACCCCTGGATTGTTATAATAGAATGAATTATCATTTTCAAGGCAGGCATCCTCCTATGCAACTTGCT
ACTATGGCAGTAGTTCAAAACCAACAATTTGCCTCTTCTGCATCTCCTTGGCTCACTGATTCAGGCTGCACTGCGCATGTTACGTCTGATTTGAATCAACTTTCGTTAGC
CTCGGAATATAATGGTGATGAACTAATATCAGTAGGAAATGGCCAAACTCTTCCCATATCTAATACAGGACAAATGCTCGGGTCAAATCCTATACCAAGGACCTACTGTA
AATGGTCTTTACCCCATTCCCAGGCGGCCTACAGCCTTCACCAACCCTACTACACAACGTTTTGCTCATGTCAACAAGGTGTCTTTATCCTCTTTCAGGCAGAATCAGTT
AGGACATCCTAA
mRNA sequenceShow/hide mRNA sequence
TCTCGAAAATCAGAACTAGCGTGAAGAATATGTTCTTCCGCAATCTGCGCAATGGTAACAATCCCACAACAATCTCAATACTCTCGAGGACGAGGTAATGGTGGTGGTCG
ATCTCGCTTTCAGGGTTGTGGTAATACCAATCGATGTGGTAGTAATGGATGCAGAGGGGTATACCCGAATAATCCTACAAATGGTGGTCGTGGAAATAATGGGGGAACTC
GATCAAATGGATCTTTCTCAAATGCTGGAAATGTTGTTCTACCCAGTCAGATGGACGAATCATGTGTCAAATTTGTAATAGGTATGGTCATTCAACCCCTGGATTGTTAT
AATAGAATGAATTATCATTTTCAAGGCAGGCATCCTCCTATGCAACTTGCTACTATGGCAGTAGTTCAAAACCAACAATTTGCCTCTTCTGCATCTCCTTGGCTCACTGA
TTCAGGCTGCACTGCGCATGTTACGTCTGATTTGAATCAACTTTCGTTAGCCTCGGAATATAATGGTGATGAACTAATATCAGTAGGAAATGGCCAAACTCTTCCCATAT
CTAATACAGGACAAATGCTCGGGTCAAATCCTATACCAAGGACCTACTGTAAATGGTCTTTACCCCATTCCCAGGCGGCCTACAGCCTTCACCAACCCTACTACACAACG
TTTTGCTCATGTCAACAAGGTGTCTTTATCCTCTTTCAGGCAGAATCAGTTAGGACATCCTAACCCTGTTATTCTTTTGTCAATCCTCTTATGTTTGTGAACACTATTTA
CATGGTAAAATGCATAAACTCTCCTTCCCACATTCTTCTACTACTTCCCTGTATCCGCTAGAAATTTTGCATTCTGATATATGGGGCCCTGCCCCTGAAACTTCTGTTAA
TGGCCATAAATACTATGTTGCTTTTGTTGATGATATGTC
Protein sequenceShow/hide protein sequence
MVTIPQQSQYSRGRGNGGGRSRFQGCGNTNRCGSNGCRGVYPNNPTNGGRGNNGGTRSNGSFSNAGNVVLPSQMDESCVKFVIGMVIQPLDCYNRMNYHFQGRHPPMQLA
TMAVVQNQQFASSASPWLTDSGCTAHVTSDLNQLSLASEYNGDELISVGNGQTLPISNTGQMLGSNPIPRTYCKWSLPHSQAAYSLHQPYYTTFCSCQQGVFILFQAESV
RTS