; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035321 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035321
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGag/pol protein
Genome locationchr3:18818695..18819015
RNA-Seq ExpressionLag0035321
SyntenyLag0035321
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-3063.21Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MSSS+IALLK ++LT EN+ TWKS LN ILV+ DLRFVL EECPP P   A+Q++KDA+ HWTKAN+KA +Y+LASL ++L+K++E + TAR+IM+SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        M G PS
Subjt:  MSGLPS

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-3063.21Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MSSS+IALLK ++LT EN+ TWKS LN ILV+ DLRFVL EECPP P   A+Q++KDA+ HWTKAN+KA +Y+LASL ++L+K++E + TAR+IM+SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        M G PS
Subjt:  MSGLPS

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]5.1e-2961.32Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MS+S I LL ++KL  +N+  WKSNLNTILV++DLRFVLTEECPP P   A + ++DA+  W KANEKARVYILAS+ EVL+K++E + T REIM+SLQ 
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        + G PS
Subjt:  MSGLPS

XP_022158791.1 uncharacterized protein LOC111025258 [Momordica charantia]1.5e-2861.32Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MS+S I LL ++KL  +N+  WKSNLNTILV++DLRFVLTEECPP     + Q ++DAH  W KANEKARVYILAS+ +VL+K++E + TAREIM+SLQ 
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        + G PS
Subjt:  MSGLPS

XP_038882358.1 uncharacterized protein LOC120073622 [Benincasa hispida]8.7e-2959.43Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        M+SS+I LL +EKL  +N+  WKSNLNTILVV+DLRFVLTEECP  P   A + +++A+  W KANEKAR+YILAS+ +VLAK++E++ TA+EI++SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        + G PS
Subjt:  MSGLPS

TrEMBL top hitse value%identityAlignment
A0A5A7TZD0 Gag/pol protein1.2e-2860.38Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MSSS+IALLK ++LT EN+ TWKS LN ILV+ DL FVL EECPP P   A+Q+++DA+  WTKAN+KAR++ILAS+ ++L+K++E + TAR+IM+SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        M G PS
Subjt:  MSGLPS

A0A5A7VA67 Gag/pol protein1.0e-3063.21Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MSSS+IALLK ++LT EN+ TWKS LN ILV+ DLRFVL EECPP P   A+Q++KDA+ HWTKAN+KA +Y+LASL ++L+K++E + TAR+IM+SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        M G PS
Subjt:  MSGLPS

A0A5D3D0D9 Gag/pol protein1.0e-3063.21Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MSSS+IALLK ++LT EN+ TWKS LN ILV+ DLRFVL EECPP P   A+Q++KDA+ HWTKAN+KA +Y+LASL ++L+K++E + TAR+IM+SL+E
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        M G PS
Subjt:  MSGLPS

A0A6J1DWG6 uncharacterized protein LOC1110250212.5e-2961.32Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MS+S I LL ++KL  +N+  WKSNLNTILV++DLRFVLTEECPP P   A + ++DA+  W KANEKARVYILAS+ EVL+K++E + T REIM+SLQ 
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        + G PS
Subjt:  MSGLPS

A0A6J1E205 uncharacterized protein LOC1110252587.2e-2961.32Show/hide
Query:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE
        MS+S I LL ++KL  +N+  WKSNLNTILV++DLRFVLTEECPP     + Q ++DAH  W KANEKARVYILAS+ +VL+K++E + TAREIM+SLQ 
Subjt:  MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQE

Query:  MSGLPS
        + G PS
Subjt:  MSGLPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCAGTTATTGCCTTGCTTAAAAACGAAAAATTAACGAGCGAGAATTTCCCAACATGGAAATCGAACTTAAACACAATACTTGTGGTAGAAGATCTGAGGTT
CGTCTTGACGGAGGAGTGTCCTCCAGTTCCTCCTTGCACTGCCACTCAGGCAATAAAGGATGCCCACGGACACTGGACGAAGGCCAATGAAAAGGCCCGAGTCTATATAT
TGGCCAGCTTGCCAGAAGTATTGGCCAAGCGTTATGAAAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTCTGGACTTCCATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCAGTTATTGCCTTGCTTAAAAACGAAAAATTAACGAGCGAGAATTTCCCAACATGGAAATCGAACTTAAACACAATACTTGTGGTAGAAGATCTGAGGTT
CGTCTTGACGGAGGAGTGTCCTCCAGTTCCTCCTTGCACTGCCACTCAGGCAATAAAGGATGCCCACGGACACTGGACGAAGGCCAATGAAAAGGCCCGAGTCTATATAT
TGGCCAGCTTGCCAGAAGTATTGGCCAAGCGTTATGAAAACGTGGAAACTGCCAGGGAGATTATGAATTCCCTGCAGGAGATGTCTGGACTTCCATCCTAA
Protein sequenceShow/hide protein sequence
MSSSVIALLKNEKLTSENFPTWKSNLNTILVVEDLRFVLTEECPPVPPCTATQAIKDAHGHWTKANEKARVYILASLPEVLAKRYENVETAREIMNSLQEMSGLPS