; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008528 (gene) of Snake gourd v1 genome

Gene IDTan0008528
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:66620471..66620806
RNA-Seq ExpressionTan0008528
SyntenyTan0008528
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]1.8e-4077.06Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLRFVLTEECPP P+  ANR VRDAYDRW++ANEKARVYILASIS+VLSKKHE + T +EIM SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        +FGQPS+++
Subjt:  MFGQPSSSV

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]5.7e-3976.15Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLRFVLTEECPP  +  +N+ VRDA+DRW +ANEKARVYILASISDVLSKKHE + TA+EIM SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        +FGQPS+S+
Subjt:  MFGQPSSSV

XP_022159023.1 uncharacterized protein LOC111025468 [Momordica charantia]1.7e-3874.31Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLR VLTEECPP P+  ANR VR+AYDRW++AN+KARVYILASISDVLSKKHE + TA+E+M SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        + GQP +S+
Subjt:  MFGQPSSSV

XP_038880476.1 uncharacterized protein LOC120072136 [Benincasa hispida]1.7e-3875Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S +Q LA  KL+ DNYGTWKSNLNTILV+DDL+FVLTEECPP+P+   NR + DA+DRW +ANEKA+VYILASISD+LSKKHE MV AKEIM SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSS
        +FGQPSSS
Subjt:  MFGQPSSS

XP_038902401.1 uncharacterized protein LOC120089040 [Benincasa hispida]9.7e-3976.15Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MSNS IQLL  +K +G++Y  WKSNLNTILV+DDLRFVLTEECPP+PS T NR V+DAYDRW+RANEKAR YIL SISDVL+KK+ETM+ A EIM SLQ 
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        MFGQPSSSV
Subjt:  MFGQPSSSV

TrEMBL top hitse value%identityAlignment
A0A5A7T0E9 Gag/pol protein2.0e-3771.56Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        M++S +QLLA  KL+GDNY TWKSNLNTILV+DDLRF+LTEECP  P+S ANR  R+AYDRWI+ANEKARVYILAS+SDVL+KKHE + T KEI+ SL+ 
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        MFGQP  S+
Subjt:  MFGQPSSSV

A0A5A7TXW7 Gag/pol protein2.3e-3873.39Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        M+NS +QLLA  KL+GDNY TWK NLNTILV++DLRFVLTEECP  P+STANR VR+AYDRW++ANEKARVYI+A++SDVL+KKHE++ TAKEIM SL  
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        MFGQPS S+
Subjt:  MFGQPSSSV

A0A6J1DWG6 uncharacterized protein LOC1110250218.6e-4177.06Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLRFVLTEECPP P+  ANR VRDAYDRW++ANEKARVYILASIS+VLSKKHE + T +EIM SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        +FGQPS+++
Subjt:  MFGQPSSSV

A0A6J1DXP1 uncharacterized protein LOC1110254688.0e-3974.31Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLR VLTEECPP P+  ANR VR+AYDRW++AN+KARVYILASISDVLSKKHE + TA+E+M SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        + GQP +S+
Subjt:  MFGQPSSSV

A0A6J1E205 uncharacterized protein LOC1110252582.8e-3976.15Show/hide
Query:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA
        MS S IQLLA +KL+GDNYG WKSNLNTILV+DDLRFVLTEECPP  +  +N+ VRDA+DRW +ANEKARVYILASISDVLSKKHE + TA+EIM SLQA
Subjt:  MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQA

Query:  MFGQPSSSV
        +FGQPS+S+
Subjt:  MFGQPSSSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACTCTTTTATCCAGTTACTCGCCTTCAATAAACTTAGCGGCGATAATTATGGAACCTGGAAATCAAACTTGAATACGATTCTTGTTCTTGATGATCTGAGGTT
CGTCTTAACAGAGGAATGTCCTCCCCTTCCCAGCTCGACTGCAAACCGAATTGTTCGGGATGCTTATGATAGATGGATTAGGGCTAATGAGAAGGCCCGAGTCTATATCT
TAGCCAGCATATCTGATGTGTTGTCTAAGAAACATGAGACCATGGTCACCGCAAAGGAGATCATGGGATCATTACAGGCGATGTTTGGACAACCGTCCTCATCGGTCCCA
TTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAACTCTTTTATCCAGTTACTCGCCTTCAATAAACTTAGCGGCGATAATTATGGAACCTGGAAATCAAACTTGAATACGATTCTTGTTCTTGATGATCTGAGGTT
CGTCTTAACAGAGGAATGTCCTCCCCTTCCCAGCTCGACTGCAAACCGAATTGTTCGGGATGCTTATGATAGATGGATTAGGGCTAATGAGAAGGCCCGAGTCTATATCT
TAGCCAGCATATCTGATGTGTTGTCTAAGAAACATGAGACCATGGTCACCGCAAAGGAGATCATGGGATCATTACAGGCGATGTTTGGACAACCGTCCTCATCGGTCCCA
TTATGA
Protein sequenceShow/hide protein sequence
MSNSFIQLLAFNKLSGDNYGTWKSNLNTILVLDDLRFVLTEECPPLPSSTANRIVRDAYDRWIRANEKARVYILASISDVLSKKHETMVTAKEIMGSLQAMFGQPSSSVP
L