; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036871 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036871
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr2:1791815..1792646
RNA-Seq ExpressionLag0036871
SyntenyLag0036871
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045111.1 putative glutathione S-transferase isoform X1 [Cucumis melo var. makuwa]8.5e-3439.9Show/hide
Query:  FRCQSVKKFIDPDYDVPTKFLTSTEE-------SSSA---KISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHE
        F    ++ ++D + + P K++  T         SSSA    I NP+Y+ W RQD LI++W L S+   I+ ++L C S +++W  L   +SS+++A+  +
Subjt:  FRCQSVKKFIDPDYDVPTKFLTSTEE-------SSSA---KISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHE

Query:  LKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSV
         K KL  MKKG + L+E+F KI+  VDALA+  KPIS DDH+ YI+ GLG EY   +SV++   + P +Q V S+LLTQES+++    + I S+ +LP+V
Subjt:  LKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSV

Query:  NLT
        N+T
Subjt:  NLT

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-3542.33Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ F++ + + P+K+L STE SS++     NP Y+ W RQD LI++W L S+S  I+ ++L CKS +E+W  L   FSS+++A+  + K KL  +KKG++
Subjt:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL
         L+E+F KI   VDALA+  KP+S DDH+ YI+ GLG +Y   +SV++   + P +Q V S+LLTQES+    + + + S+  LPSVN+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.5e-3542.33Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ F++ + + P+K+L STE SS++     NP Y+ W RQD LI++W L S+S  I+ ++L CKS +E+W  L   FSS+++A+  + K KL  +KKG++
Subjt:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL
         L+E+F KI   VDALA+  KP+S DDH+ YI+ GLG +Y   +SV++   + P +Q V S+LLTQES+    + + + S+  LPSVN+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL

XP_022136882.1 dr1-associated corepressor homolog isoform X1 [Momordica charantia]2.5e-3349.08Show/hide
Query:  IRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLG
        ++QD LIT+W  +S+   I+ E++ C + REVW  L   ++S+++ARV +LK+KLE +KKGNL L+++F K+K LVD+LAAAGK ++ +DH+ +I+ GL 
Subjt:  IRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLG

Query:  PEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNSN
         E++ TVSV++   +   LQ VYS+LL+ E R +R+   SIN+DGTLPSVNLTQ   Q  NSN
Subjt:  PEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNSN

XP_022154487.1 uncharacterized protein LOC111021757 [Momordica charantia]1.2e-4349.25Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEE--SSSAKISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ +ID + D P +F+ +TE+  SSS+   NP Y  WI+QD LI+AW L S++  I++++LDCKS RE+W  L   F+S+ +ARV +LK KLE  KKGNL
Subjt:  VKKFIDPDYDVPTKFLTSTEE--SSSAKISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNS
         L+++F KIKNLVD+LA AGK +S +DH+ +I+ GLGPE+D  +SV+T  +    LQ V S+LL QE R +R+    INSDG+LPSVNLT +   + N+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNS

TrEMBL top hitse value%identityAlignment
A0A5A7TUB3 Putative glutathione S-transferase isoform X14.1e-3439.9Show/hide
Query:  FRCQSVKKFIDPDYDVPTKFLTSTEE-------SSSA---KISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHE
        F    ++ ++D + + P K++  T         SSSA    I NP+Y+ W RQD LI++W L S+   I+ ++L C S +++W  L   +SS+++A+  +
Subjt:  FRCQSVKKFIDPDYDVPTKFLTSTEE-------SSSA---KISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHE

Query:  LKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSV
         K KL  MKKG + L+E+F KI+  VDALA+  KPIS DDH+ YI+ GLG EY   +SV++   + P +Q V S+LLTQES+++    + I S+ +LP+V
Subjt:  LKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSV

Query:  NLT
        N+T
Subjt:  NLT

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-3542.33Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ F++ + + P+K+L STE SS++     NP Y+ W RQD LI++W L S+S  I+ ++L CKS +E+W  L   FSS+++A+  + K KL  +KKG++
Subjt:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL
         L+E+F KI   VDALA+  KP+S DDH+ YI+ GLG +Y   +SV++   + P +Q V S+LLTQES+    + + + S+  LPSVN+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-3542.33Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ F++ + + P+K+L STE SS++     NP Y+ W RQD LI++W L S+S  I+ ++L CKS +E+W  L   FSS+++A+  + K KL  +KKG++
Subjt:  VKKFIDPDYDVPTKFLTSTEESSSAKIS--NPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL
         L+E+F KI   VDALA+  KP+S DDH+ YI+ GLG +Y   +SV++   + P +Q V S+LLTQES+    + + + S+  LPSVN+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNL

A0A6J1C8R2 dr1-associated corepressor homolog isoform X21.2e-3349.08Show/hide
Query:  IRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLG
        ++QD LIT+W  +S+   I+ E++ C + REVW  L   ++S+++ARV +LK+KLE +KKGNL L+++F K+K LVD+LAAAGK ++ +DH+ +I+ GL 
Subjt:  IRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLG

Query:  PEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNSN
         E++ TVSV++   +   LQ VYS+LL+ E R +R+   SIN+DGTLPSVNLTQ   Q  NSN
Subjt:  PEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNSN

A0A6J1DLT9 uncharacterized protein LOC1110217575.8e-4449.25Show/hide
Query:  VKKFIDPDYDVPTKFLTSTEE--SSSAKISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL
        ++ +ID + D P +F+ +TE+  SSS+   NP Y  WI+QD LI+AW L S++  I++++LDCKS RE+W  L   F+S+ +ARV +LK KLE  KKGNL
Subjt:  VKKFIDPDYDVPTKFLTSTEE--SSSAKISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTKLELMKKGNL

Query:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNS
         L+++F KIKNLVD+LA AGK +S +DH+ +I+ GLGPE+D  +SV+T  +    LQ V S+LL QE R +R+    INSDG+LPSVNLT +   + N+
Subjt:  GLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)5.2e-1327.33Show/hide
Query:  WIRQDSLITAWFLASISPAIVAELLDCKST-REVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGG
        W  +D L+  W   +I+ +++  ++    T R++WL L   F     AR  + + +L      +L + E+  K+K+L D L     PIS    V +++ G
Subjt:  WIRQDSLITAWFLASISPAIVAELLDCKST-REVWLHLSTRFSSKHVARVHELKTKLELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGG

Query:  LGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQ
        L  +YD  ++V+      P      SMLL +ESRL   S +S++        N+  + P+Q
Subjt:  LGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAGAATTCATCTGCTGTGGTCACTGCTGTCTTTTTCATATTCTTTAGATGTCAGAGCGTTAAGAAATTCATAGATCCAGATTATGATGTTCCTACTAAATTTCT
GACATCTACTGAGGAATCATCTTCCGCGAAGATCTCCAATCCAGATTATGAGTACTGGATTCGACAGGACAGTTTAATCACGGCCTGGTTTCTTGCTTCGATCTCGCCTG
CAATTGTTGCTGAATTACTTGATTGTAAATCGACCAGAGAAGTGTGGCTACATCTCTCAACTCGTTTTTCTTCAAAACACGTTGCTAGGGTTCATGAATTGAAAACCAAG
CTTGAACTAATGAAGAAAGGGAATTTAGGGCTTCAAGAATTCTTCACCAAGATTAAAAATCTAGTTGATGCCTTAGCCGCTGCTGGGAAGCCTATATCGCACGATGATCA
TGTTCACTATATTATTGGAGGATTGGGGCCAGAATATGACCCTACTGTTTCTGTTCTTACCAGAAACGATGAAATGCCACCGTTGCAACGAGTGTACTCCATGCTTCTTA
CTCAAGAAAGTCGACTCCAGCGTCATTCTACTACTTCGATTAATAGTGATGGTACTCTGCCATCTGTTAATCTGACTCAATCCAAACCACAGCAGCCTAACTCTAATGTG
AATGATCCTACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAGAATTCATCTGCTGTGGTCACTGCTGTCTTTTTCATATTCTTTAGATGTCAGAGCGTTAAGAAATTCATAGATCCAGATTATGATGTTCCTACTAAATTTCT
GACATCTACTGAGGAATCATCTTCCGCGAAGATCTCCAATCCAGATTATGAGTACTGGATTCGACAGGACAGTTTAATCACGGCCTGGTTTCTTGCTTCGATCTCGCCTG
CAATTGTTGCTGAATTACTTGATTGTAAATCGACCAGAGAAGTGTGGCTACATCTCTCAACTCGTTTTTCTTCAAAACACGTTGCTAGGGTTCATGAATTGAAAACCAAG
CTTGAACTAATGAAGAAAGGGAATTTAGGGCTTCAAGAATTCTTCACCAAGATTAAAAATCTAGTTGATGCCTTAGCCGCTGCTGGGAAGCCTATATCGCACGATGATCA
TGTTCACTATATTATTGGAGGATTGGGGCCAGAATATGACCCTACTGTTTCTGTTCTTACCAGAAACGATGAAATGCCACCGTTGCAACGAGTGTACTCCATGCTTCTTA
CTCAAGAAAGTCGACTCCAGCGTCATTCTACTACTTCGATTAATAGTGATGGTACTCTGCCATCTGTTAATCTGACTCAATCCAAACCACAGCAGCCTAACTCTAATGTG
AATGATCCTACCTGA
Protein sequenceShow/hide protein sequence
MRKNSSAVVTAVFFIFFRCQSVKKFIDPDYDVPTKFLTSTEESSSAKISNPDYEYWIRQDSLITAWFLASISPAIVAELLDCKSTREVWLHLSTRFSSKHVARVHELKTK
LELMKKGNLGLQEFFTKIKNLVDALAAAGKPISHDDHVHYIIGGLGPEYDPTVSVLTRNDEMPPLQRVYSMLLTQESRLQRHSTTSINSDGTLPSVNLTQSKPQQPNSNV
NDPT