; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013001 (gene) of Snake gourd v1 genome

Gene IDTan0013001
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:38620072..38620395
RNA-Seq ExpressionTan0013001
SyntenyTan0013001
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032529.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-3569.52Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNGDN+E WKSNLNTILV+DDLRF+LT+ECP  P+S+A RT R+A+++W +AN+KARVYILA+++DVL+KKHES+ T K+IM +L+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQL
        MFGQL
Subjt:  MFGQL

XP_022157095.1 uncharacterized protein LOC111023904 [Momordica charantia]2.6e-3672.64Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+ +A R VREAF++W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        MFG+LS
Subjt:  MFGQLS

XP_022157844.1 uncharacterized protein LOC111024457 [Momordica charantia]7.4e-3671.7Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+++A R VREAF++W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        MFG+ S
Subjt:  MFGQLS

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]9.7e-3672.64Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLRFVLT+ECPP P+ +A RTVR+A+++W +AN+KARVYILAS+S+VLSKKHE + T +EIM SLQA
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        +FGQ S
Subjt:  MFGQLS

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]5.7e-3671.7Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS IQLL SEKLNGDN+  WKSNLNTILV+DDLRFVLT+ECP  P+S+A RTVREA+++W +AN+KAR+YILAS+SDVL+KKHES+ TAKEI+ SL+ 
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        +FGQ S
Subjt:  MFGQLS

TrEMBL top hitse value%identityAlignment
A0A5A7STL5 Gag/pol protein8.0e-3669.52Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNGDN+E WKSNLNTILV+DDLRF+LT+ECP  P+S+A RT R+A+++W +AN+KARVYILA+++DVL+KKHES+ T K+IM +L+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQL
        MFGQL
Subjt:  MFGQL

A0A6J1DS54 uncharacterized protein LOC1110239041.2e-3672.64Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+ +A R VREAF++W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        MFG+LS
Subjt:  MFGQLS

A0A6J1DWG6 uncharacterized protein LOC1110250214.7e-3672.64Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLRFVLT+ECPP P+ +A RTVR+A+++W +AN+KARVYILAS+S+VLSKKHE + T +EIM SLQA
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        +FGQ S
Subjt:  MFGQLS

A0A6J1DXP1 uncharacterized protein LOC1110254688.0e-3674.04Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        MS+S IQLLAS+KLNGDN+  WKSNLNTILVIDDLR VLT+ECPP P+ +A RTVREA+++W +ANDKARVYILAS+SDVLSKKHE + TA+E+M SLQA
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQ
        + GQ
Subjt:  MFGQ

A0A6J1DXQ5 uncharacterized protein LOC1110244573.6e-3671.7Show/hide
Query:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA
        M+SS +QLLASEKLNG N+ TWK+NLNTILV+DDLRFVLT+ECP  P+++A R VREAF++W +ANDKARVYILAS++DVL+KKHE ++TAKEIM SL+A
Subjt:  MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQA

Query:  MFGQLS
        MFG+ S
Subjt:  MFGQLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGAAAAACTTAACGGTGATAACTTTGAAACTTGGAAATCAAACCTGAATACGATTCTTGTGATTGATGATCTAAGGTT
CGTCTTGACGGATGAATGTCCTCCCCTTCCCAGTTCGTCTGCAATTCGAACAGTTCGGGAAGCATTTGAAAAATGGACTAGGGCTAATGATAAAGCTCGGGTCTACATCT
TAGCAAGCCTATCTGATGTGTTGTCTAAGAAACATGAGAGCATGATTACCGCAAAGGAGATCATGGGATCATTACAAGCCATGTTTGGACAACTGTCCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGCTCTTTTATTCAATTACTCGCCTCCGAAAAACTTAACGGTGATAACTTTGAAACTTGGAAATCAAACCTGAATACGATTCTTGTGATTGATGATCTAAGGTT
CGTCTTGACGGATGAATGTCCTCCCCTTCCCAGTTCGTCTGCAATTCGAACAGTTCGGGAAGCATTTGAAAAATGGACTAGGGCTAATGATAAAGCTCGGGTCTACATCT
TAGCAAGCCTATCTGATGTGTTGTCTAAGAAACATGAGAGCATGATTACCGCAAAGGAGATCATGGGATCATTACAAGCCATGTTTGGACAACTGTCCTTGTAG
Protein sequenceShow/hide protein sequence
MSSSFIQLLASEKLNGDNFETWKSNLNTILVIDDLRFVLTDECPPLPSSSAIRTVREAFEKWTRANDKARVYILASLSDVLSKKHESMITAKEIMGSLQAMFGQLSL