; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001760 (gene) of Snake gourd v1 genome

Gene IDTan0001760
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:47124932..47125285
RNA-Seq ExpressionTan0001760
SyntenyTan0001760
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]4.2e-4069.23Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        M+SS +QLLAS+KLN  N+ TWK+NLNTILV+DDLRFVLTEECP  P+++A + VR+AFD+W +ANDKARVYILA+++DVL+KKHE ++TAKEIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        MFG+PSS++R++ALKYV
Subjt:  MFGQPSSSVRYDALKYV

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]1.7e-4172.65Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLRFVLTEECPP P+ +A +TVRDA+D+W +AN+KARVYILA+IS+VLSKKHE + T +EIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        +FGQPS+++ +DA+KYV
Subjt:  MFGQPSSSVRYDALKYV

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]4.5e-4275.21Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLRFVLTEECPP  + ++ QTVRDA D+W +AN+KARVYILA+ISDVLSKKHEG+ TA+EIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        +FGQPS+S+ +DA+KYV
Subjt:  MFGQPSSSVRYDALKYV

XP_022159023.1 uncharacterized protein LOC111025468 [Momordica charantia]5.4e-4070.94Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLR VLTEECPP P+ +A +TVR+A+D+W +ANDKARVYILA+ISDVLSKKHE + TA+E+M SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        + GQP +S+ +DA++YV
Subjt:  MFGQPSSSVRYDALKYV

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]4.5e-4271.79Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S +Q LAS+KLNDDN+GTWKSNLNTILVIDDL+FVLTEECPP+P+ +  +T+ DA D+WT+AN+KA+VYILA+ISD+LSKKHE M+ AKEIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        +FGQPSSS  +DA+KYV
Subjt:  MFGQPSSSVRYDALKYV

TrEMBL top hitse value%identityAlignment
A0A6J1CP29 uncharacterized protein LOC1110134175.9e-4068.38Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        M+SS +QL AS+KLN  N+ TWK+NLNTILV+DDLRFVLTEECP  P+++A + VR+AFD+W +ANDKARVYILA+++DVL+KKHE ++TAKEIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        MFG+PSS++R++ALKYV
Subjt:  MFGQPSSSVRYDALKYV

A0A6J1DWG6 uncharacterized protein LOC1110250218.2e-4272.65Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLRFVLTEECPP P+ +A +TVRDA+D+W +AN+KARVYILA+IS+VLSKKHE + T +EIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        +FGQPS+++ +DA+KYV
Subjt:  MFGQPSSSVRYDALKYV

A0A6J1DXP1 uncharacterized protein LOC1110254682.6e-4070.94Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLR VLTEECPP P+ +A +TVR+A+D+W +ANDKARVYILA+ISDVLSKKHE + TA+E+M SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        + GQP +S+ +DA++YV
Subjt:  MFGQPSSSVRYDALKYV

A0A6J1DXQ5 uncharacterized protein LOC1110244572.0e-4069.23Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        M+SS +QLLAS+KLN  N+ TWK+NLNTILV+DDLRFVLTEECP  P+++A + VR+AFD+W +ANDKARVYILA+++DVL+KKHE ++TAKEIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        MFG+PSS++R++ALKYV
Subjt:  MFGQPSSSVRYDALKYV

A0A6J1E205 uncharacterized protein LOC1110252582.2e-4275.21Show/hide
Query:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA
        MS+S IQLLASDKLN DN+G WKSNLNTILVIDDLRFVLTEECPP  + ++ QTVRDA D+W +AN+KARVYILA+ISDVLSKKHEG+ TA+EIM SL A
Subjt:  MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLA

Query:  MFGQPSSSVRYDALKYV
        +FGQPS+S+ +DA+KYV
Subjt:  MFGQPSSSVRYDALKYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCTGACAAACTTAACGACGATAACTTTGGTACTTGGAAATCAAACTTGAATACGATTCTTGTGATTGATGATCTAAGGTT
CGTTTTAACGGAGGAATGTCCTCCCCTTCCCAGCTCGTCTGCAACCCAAACTGTTCGGGATGCATTTGACAAATGGACTAGGGCTAATGATAAGGCCCGAGTCTACATCT
TAGCCAACATATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGTTCATTATTGGCCATGTTTGGACAACCGTCCTCGTCGGTCCGT
TATGATGCTCTCAAGTACGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCTGACAAACTTAACGACGATAACTTTGGTACTTGGAAATCAAACTTGAATACGATTCTTGTGATTGATGATCTAAGGTT
CGTTTTAACGGAGGAATGTCCTCCCCTTCCCAGCTCGTCTGCAACCCAAACTGTTCGGGATGCATTTGACAAATGGACTAGGGCTAATGATAAGGCCCGAGTCTACATCT
TAGCCAACATATCTGATGTGTTGTCTAAGAAACATGAGGGCATGATCACCGCAAAGGAGATCATGGGTTCATTATTGGCCATGTTTGGACAACCGTCCTCGTCGGTCCGT
TATGATGCTCTCAAGTACGTTTAG
Protein sequenceShow/hide protein sequence
MSSSFIQLLASDKLNDDNFGTWKSNLNTILVIDDLRFVLTEECPPLPSSSATQTVRDAFDKWTRANDKARVYILANISDVLSKKHEGMITAKEIMGSLLAMFGQPSSSVR
YDALKYV