; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020546 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020546
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon 297
Genome locationchr7:252505..253608
RNA-Seq ExpressionLag0020546
SyntenyLag0020546
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
InterPro domainsIPR016197 - Chromo-like domain superfamily
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW41094.1 hypothetical protein CK203_069772 [Vitis vinifera]7.3e-2944.37Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        ++ VE+ L+S VD+ +P T+K++G+IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS
         T+VIL + WL TLGD   N + L +   +  + ++L GDPS
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS

RVW54082.1 Retrovirus-related Pol polyprotein from transposon 297 [Vitis vinifera]4.7e-2843.66Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        +++VE+ L+S V + +P T+K++G IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS
         T+VIL + WL TLGD   N + L +   +  + ++L GDPS
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS

TXG69438.1 hypothetical protein EZV62_004373 [Acer yangbiense]2.1e-3634.45Show/hide
Query:  EVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGT
        E VEV L+S V + +PKT+K++G++G  +V+ L+D GATHNFI+  L+Q+L+L +   + YG+ +G    VRG  ICK + L L+ I IV++FLP+ +G+
Subjt:  EVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGT

Query:  TNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPSFISSFPS--------------------------------------------------NF
        ++VIL + WL TLG   TN +   + F L    V L GDPS   +  S                                                   F
Subjt:  TNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPSFISSFPS--------------------------------------------------NF

Query:  EVIR---------CSPVCLGKNCSNDK------------VQPSKILGIR-QSPADPSKLEVLIQWTDMEVSEATWEDAVWIVNQFPYFHLEDKMVFWGG
         V++           P  LG   S               V P  +LG+R +S A P  +EVLI+W  +   EATWED + IVNQFP FHLEDK+  W G
Subjt:  EVIR---------CSPVCLGKNCSNDK------------VQPSKILGIR-QSPADPSKLEVLIQWTDMEVSEATWEDAVWIVNQFPYFHLEDKMVFWGG

XP_024026844.1 uncharacterized protein LOC112093161 [Morus notabilis]5.6e-2946.1Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        EE+VE+ L+S V + +PKT+KL+G +G  EVI+L+DSGATHNFIA  L++ L L+++   GYG+++G  + V+G  IC+ + +S++ I +++DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP
        ++++IL V WLETLG    N ++L + F+L G  V L GDP
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP

XP_024030016.1 uncharacterized protein LOC112094120 [Morus notabilis]6.6e-3043.83Show/hide
Query:  VVVQTSKMASECPLGERP--PEEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICK
        +V++  ++A E   G  P   EEVVE+ L+S V + SPKT+KL+G++   EV VL+DSGAT+NFI++ L++++EL++++   YG+++G  + V+G  +C+
Subjt:  VVVQTSKMASECPLGERP--PEEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICK

Query:  DIVLSLRTITIVQDFLPINMGTTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP
         IV+SL+ I IV+DFLP+ +G++++IL + WLETLG  T N +SL + F +   +V L GDP
Subjt:  DIVLSLRTITIVQDFLPINMGTTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP

TrEMBL top hitse value%identityAlignment
A0A438E000 Retrotrans_gag domain-containing protein3.5e-2944.37Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        ++ VE+ L+S VD+ +P T+K++G+IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS
         T+VIL + WL TLGD   N + L +   +  + ++L GDPS
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS

A0A438F372 Retrovirus-related Pol polyprotein from transposon 2972.3e-2843.66Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        +++VE+ L+S V + +P T+K++G IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS
         T+VIL + WL TLGD   N + L +   +  + ++L GDPS
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS

A0A438HFS6 Uncharacterized protein1.9e-2743.26Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        ++ VE+ L+S V +  P T+K++G IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP
         T+VIL + WL TLGD   N + L +   +  + ++L GDP
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDP

A0A5C7IJS7 Uncharacterized protein1.0e-3634.45Show/hide
Query:  EVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGT
        E VEV L+S V + +PKT+K++G++G  +V+ L+D GATHNFI+  L+Q+L+L +   + YG+ +G    VRG  ICK + L L+ I IV++FLP+ +G+
Subjt:  EVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGT

Query:  TNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPSFISSFPS--------------------------------------------------NF
        ++VIL + WL TLG   TN +   + F L    V L GDPS   +  S                                                   F
Subjt:  TNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPSFISSFPS--------------------------------------------------NF

Query:  EVIR---------CSPVCLGKNCSNDK------------VQPSKILGIR-QSPADPSKLEVLIQWTDMEVSEATWEDAVWIVNQFPYFHLEDKMVFWGG
         V++           P  LG   S               V P  +LG+R +S A P  +EVLI+W  +   EATWED + IVNQFP FHLEDK+  W G
Subjt:  EVIR---------CSPVCLGKNCSNDK------------VQPSKILGIR-QSPADPSKLEVLIQWTDMEVSEATWEDAVWIVNQFPYFHLEDKMVFWGG

A5B2I6 Reverse transcriptase domain-containing protein8.7e-2843.66Show/hide
Query:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG
        ++ VE+ L+S V + +P T+K++G IG+ EVI+LVDSGATHNF++L+L+Q+L L L     YG+++G  + V+G  IC+ + +S++ +T+V+DFLP+ +G
Subjt:  EEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMG

Query:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS
         T+VIL + WL TLGD   N + L +   +  + ++L GDPS
Subjt:  TTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein4.8e-1030.47Show/hide
Query:  VDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGTT--NVILEVH
        +D+   K ++  G I   +V+V +DSGAT NFI ++L   L+L  +  +   ++LG    ++    C  I L ++ + I ++FL +++  T  +VIL   
Subjt:  VDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGTT--NVILEVH

Query:  WLETLGDETTNHQSLHLSFTLNGSKVIL
        WL  LG+   N Q+   SF+ N   + L
Subjt:  WLETLGDETTNHQSLHLSFTLNGSKVIL

AT3G30770.1 Eukaryotic aspartyl protease family protein1.2e-0528Show/hide
Query:  EVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGTTNV
        +V   S  +    K ++  G I   +V+V++DSGAT+NFI+ +L   L+L  +  +   ++LG    ++    C  I L ++ + I ++FL +++  T+V
Subjt:  EVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICKDIVLSLRTITIVQDFLPINMGTTNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGGAGTTGGAGTAGAAGAGCAAGTAGGAGAAGTGGTCGTGCAGACTTCAAAGATGGCCAGCGAATGTCCATTGGGAGAAAGACCGCCAGAAGAAGTAGTGGAGGT
TGGACTGCATTCCAAGGTCGATATTGGATCTCCGAAAACCATCAAGTTGGAGGGTTTAATTGGAGCCACAGAGGTTATTGTCTTGGTTGACAGTGGTGCTACACATAATT
TCATTGCTTTGAAGTTGCTTCAAGAGCTTGAATTGTCGTTAAATGAGAACGATGGATACGGTATTATTTTGGGGTACTGGGTCATTGTTCGCGGGACTGAGATTTGTAAG
GATATTGTGCTTTCTTTGAGGACCATAACAATTGTTCAAGATTTTCTGCCTATCAATATGGGTACCACTAATGTTATTTTGGAAGTTCATTGGCTTGAAACTCTGGGAGA
CGAAACTACTAATCATCAGTCACTACACCTAAGTTTCACCTTGAATGGTTCTAAAGTCATTCTTCATGGTGATCCTTCCTTTATTAGTTCGTTCCCAAGTAACTTTGAAG
TCATCCGGTGTTCACCAGTTTGTTTGGGGAAGAATTGTTCAAATGACAAGGTGCAACCCTCGAAGATTTTGGGCATTCGACAGTCACCTGCTGATCCTTCAAAATTAGAG
GTGTTGATCCAGTGGACTGATATGGAGGTTTCTGAAGCTACATGGGAAGATGCAGTGTGGATTGTAAATCAATTTCCTTATTTTCATCTTGAGGACAAGATGGTTTTTTG
GGGTGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGGAGTTGGAGTAGAAGAGCAAGTAGGAGAAGTGGTCGTGCAGACTTCAAAGATGGCCAGCGAATGTCCATTGGGAGAAAGACCGCCAGAAGAAGTAGTGGAGGT
TGGACTGCATTCCAAGGTCGATATTGGATCTCCGAAAACCATCAAGTTGGAGGGTTTAATTGGAGCCACAGAGGTTATTGTCTTGGTTGACAGTGGTGCTACACATAATT
TCATTGCTTTGAAGTTGCTTCAAGAGCTTGAATTGTCGTTAAATGAGAACGATGGATACGGTATTATTTTGGGGTACTGGGTCATTGTTCGCGGGACTGAGATTTGTAAG
GATATTGTGCTTTCTTTGAGGACCATAACAATTGTTCAAGATTTTCTGCCTATCAATATGGGTACCACTAATGTTATTTTGGAAGTTCATTGGCTTGAAACTCTGGGAGA
CGAAACTACTAATCATCAGTCACTACACCTAAGTTTCACCTTGAATGGTTCTAAAGTCATTCTTCATGGTGATCCTTCCTTTATTAGTTCGTTCCCAAGTAACTTTGAAG
TCATCCGGTGTTCACCAGTTTGTTTGGGGAAGAATTGTTCAAATGACAAGGTGCAACCCTCGAAGATTTTGGGCATTCGACAGTCACCTGCTGATCCTTCAAAATTAGAG
GTGTTGATCCAGTGGACTGATATGGAGGTTTCTGAAGCTACATGGGAAGATGCAGTGTGGATTGTAAATCAATTTCCTTATTTTCATCTTGAGGACAAGATGGTTTTTTG
GGGTGGGTAG
Protein sequenceShow/hide protein sequence
MLGVGVEEQVGEVVVQTSKMASECPLGERPPEEVVEVGLHSKVDIGSPKTIKLEGLIGATEVIVLVDSGATHNFIALKLLQELELSLNENDGYGIILGYWVIVRGTEICK
DIVLSLRTITIVQDFLPINMGTTNVILEVHWLETLGDETTNHQSLHLSFTLNGSKVILHGDPSFISSFPSNFEVIRCSPVCLGKNCSNDKVQPSKILGIRQSPADPSKLE
VLIQWTDMEVSEATWEDAVWIVNQFPYFHLEDKMVFWGG