; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008067 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008067
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon opus
Genome locationchr9:11292411..11294039
RNA-Seq ExpressionLag0008067
SyntenyLag0008067
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]2.0e-2843.52Show/hide
Query:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA----------
        QWS  R  +S+KV  V+EVD ++ +   +A++ N LKN+ +  +VQ        A+  Q A  SCVYCG+ H +E CP+N ASV +VG +          
Subjt:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA----------

Query:  ---PAKFGY----------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS
           P  F                  SLE +M+DYMA+ND +IQSQ ASL+ LE Q+GQLAN+LK RPQG +PSD E+P R+GK+  +AVTLRS
Subjt:  ---PAKFGY----------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]1.3e-2737.15Show/hide
Query:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------
        QWS  R  +S+KV  V+EVD ++ +   +A++ N LKN+ +  +VQ        A+  Q A  SCVYCG+ H +E CP+NLASV +VG            
Subjt:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------

Query:  --FAPA-----KFGY------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIR
          + PA      F +                              SLE +M+DYMA+ND +IQSQ ASLR LE Q+GQLAN+LK RPQG +PSD E+P R
Subjt:  --FAPA-----KFGY------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIR

Query:  EGKKQVQAVTLRS--------------ESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPPP---------YDPPLPFPQRQKSKNQDG
        +GK+  +AVTLRS              E  S Q +G   K      +   VE PP V               PP PFPQR K +  DG
Subjt:  EGKKQVQAVTLRS--------------ESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPPP---------YDPPLPFPQRQKSKNQDG

XP_030497888.1 uncharacterized protein LOC115713544 [Cannabis sativa]1.5e-2827.24Show/hide
Query:  QWS-NVRGSSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA----------
        QWS N   +S+KV  V+EVD ++ +   +A++ N LKN+ +  ++Q        A+  Q A  SCVY G+ H +E CP+N  S     F+          
Subjt:  QWS-NVRGSSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA----------

Query:  ---------PAKFGY----------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS-
                 P +F                  SLE +M+DY  +ND +IQSQ ASL+ LE Q+GQLAN+LK+RPQG +PSD ++P R+GK+  +AV LRS 
Subjt:  ---------PAKFGY----------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRS-

Query:  ------ESGSGQYDGGSSKDVGAISSVPDV---EPPPYV---------------PPPPYDPPLPFPQRQ-------------------------------
               + +G  +    +  G +   P +   E PP V               PPPP+  PL F ++Q                               
Subjt:  ------ESGSGQYDGGSSKDVGAISSVPDV---EPPPYV---------------PPPPYDPPLPFPQRQ-------------------------------

Query:  -----------------------------------------------------KSKNQD-----------------------------GSPFLATSRSLM
                                                             + K +D                             G PFLAT R+L+
Subjt:  -----------------------------------------------------KSKNQD-----------------------------GSPFLATSRSLM

Query:  DVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVLDEIAEDHFEKELMEYHTQKFGEIQIEDLEIGGLEHEHKVVGEISSFERNLESLEPIDKESK
        DVQ GE T RV+DQKV F++F+AM++P+++E+CS + V+D I  + F KE+ +    +     +EDLE    + E +V          +E ++P  K  +
Subjt:  DVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVLDEIAEDHFEKELMEYHTQKFGEIQIEDLEIGGLEHEHKVVGEISSFERNLESLEPIDKESK

Query:  PIE
        P E
Subjt:  PIE

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]2.4e-2638.91Show/hide
Query:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------
        QWS  R  +S+KV  V+EVD ++ +   +A++ N LKN+ +  +VQ        A+  Q A  SCVYCG+ H +E CP+NLASV +VG            
Subjt:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------

Query:  --FAPA-----KFGY--------------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIP
          + PA      F +                                      SLE +M+DYMA+ND +IQSQ ASLR LE Q+GQLAN+LK RPQG +P
Subjt:  --FAPA-----KFGY--------------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIP

Query:  SDIEHPIREGKKQVQAVTLRS
        SD E+P R+GK+  +A+TLRS
Subjt:  SDIEHPIREGKKQVQAVTLRS

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]2.7e-2538.39Show/hide
Query:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------
        QWS  R  +S+KV  V+EVD ++ +   +A++ N LKN+ +  +VQ        A+  Q A  SCVYCG+ H +E CP+N ASV +VG            
Subjt:  QWSNVRG-SSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG------------

Query:  --FAPA-----KFGY-----------------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQG
          + PA      F +                                         SLE +M+DYMA+ND +IQSQ ASLR LE Q+GQLAN+LK RPQG
Subjt:  --FAPA-----KFGY-----------------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQG

Query:  NIPSDIEHPIREGKKQVQAVTLRS
         +PSD E+P R+GK+  +AVTLRS
Subjt:  NIPSDIEHPIREGKKQVQAVTLRS

TrEMBL top hitse value%identityAlignment
A0A5B6VBU6 Reverse transcriptase-like protein4.2e-1626.39Show/hide
Query:  KKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFAPAKFGYSLELMMKDYMARND
        ++V    ++D ++++   +++LAN +K       +Q+P  V+ + +V      SCVYCGE+  +E CP+N ASV+ +G            ++K+ +  + 
Subjt:  KKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFAPAKFGYSLELMMKDYMARND

Query:  VIIQSQQASLRVLEFQVGQLANELKARP---QGNIPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPPPYDPPLPFPQ
          + S+  + + LE Q  +     K  P   + N  S  +H +  G+ +   VTL+  + S  Y                                    
Subjt:  VIIQSQQASLRVLEFQVGQLANELKARP---QGNIPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPPPYDPPLPFPQ

Query:  RQKSKNQDGSPFLATSRSLMDVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVLDEIAEDHFEKELMEYHTQKFGEIQIEDLE
               +  PFL T R+++DVQ+GE T RV+DQ++ F++F A+KY +D+++C  + +LD I E+ FEK   EYH +K  E    D++
Subjt:  RQKSKNQDGSPFLATSRSLMDVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVLDEIAEDHFEKELMEYHTQKFGEIQIEDLE

A0A5B6VWJ0 Retroelement pol polyprotein-like2.2e-1727.61Show/hide
Query:  QWSNVRGSS-KKVKSVIEVDGVSTIRVDIATLANTLKNITV---VSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG---------
        QW   R +S ++V  + EVD ++++   ++++++  KN+T     S   QPP        NQ    + VYCGE H  E CP+N  SV+++G         
Subjt:  QWSNVRGSS-KKVKSVIEVDGVSTIRVDIATLANTLKNITV---VSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG---------

Query:  -----------------------------------------------FAPAKFGYSLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGN
                                                          AK   SLE ++K YMA+ND +IQSQ A+L+ LE QVGQLA EL+ R QG 
Subjt:  -----------------------------------------------FAPAKFGYSLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGN

Query:  IPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVP-------------------------------PPPYDPPLPFPQRQK
        +PSD E+P   GK+  +A+TLRSE           K+        +V+P    P                               PP  + P   P +  
Subjt:  IPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVP-------------------------------PPPYDPPLPFPQRQK

Query:  SKNQD---------------------------GSPFLATSRSLMDVQQGEFTTRV
         K +D                           G PFLAT R+++DVQ+GE T RV
Subjt:  SKNQD---------------------------GSPFLATSRSLMDVQQGEFTTRV

A0A6J1EQ90 uncharacterized protein LOC1114364118.5e-1728.47Show/hide
Query:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG-----------
        CQW++VR +  +K + V+EVD +S+I   +A++ N L+N+ +  +      V + A++NQ AAESCVYCGEEH ++ CP+N AS+F+VG           
Subjt:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG-----------

Query:  --------------------------FAPAKFGY---------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQ
                                    P K  Y                                 S+E ++K+YMA+ND +IQSQQASLR LE Q+G 
Subjt:  --------------------------FAPAKFGY---------------------------------SLELMMKDYMARNDVIIQSQQASLRVLEFQVGQ

Query:  LANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPP--PYDPPLPFPQRQKSKNQD
          N  +        +D +               R+E  + Q +   SKD   +   P ++           Y P  PFPQR K K ++
Subjt:  LANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPYVPPP--PYDPPLPFPQRQKSKNQD

A0A6J1EQ90 uncharacterized protein LOC1114364111.2e-0247.06Show/hide
Query:  GSPFLATSRSLMDVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVL
        G PFL   R+L+DV +G  T R+  QKV+F++ D+MKYP  +E+CS +  L
Subjt:  GSPFLATSRSLMDVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVL

A0A6J1EQ90 uncharacterized protein LOC1114364111.4e-1631.47Show/hide
Query:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG-----------
        CQW++VR +  +K + V+EVD +S+I   +A++ N L+N+ +  +      V ++A +NQ AAESCVYCGEEH ++ CP+N AS+F+VG           
Subjt:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVG-----------

Query:  -----------------------------------------------------------FAPAKFGYSLELMMKDYMARNDVIIQSQQASLRVLEFQ
                                                                    A    G S+E ++K+YMA+NDV+IQ+QQASLR LE Q
Subjt:  -----------------------------------------------------------FAPAKFGYSLELMMKDYMARNDVIIQSQQASLRVLEFQ

A0A6J1G7Q6 uncharacterized protein LOC1114515981.0e-2234.38Show/hide
Query:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA---------
        CQW +VR +  KK + V+EVD +S+I   +A++ N L+N+             +   + Q A ESCVYCGE+H ++ CP+N AS+F+VG           
Subjt:  CQWSNVRGS-SKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLASVFFVGFA---------

Query:  ----------------------------------PAKFGYS---------------------------LELMMKDYMARNDVIIQSQQASLRVLEFQVGQ
                                          P  FG                             LE ++K+YMARND +IQSQQ SLR LE QVGQ
Subjt:  ----------------------------------PAKFGYS---------------------------LELMMKDYMARNDVIIQSQQASLRVLEFQVGQ

Query:  LANELKARPQGNIPSDIEHPIREG
        LANEL+ RP G +P+D E P REG
Subjt:  LANELKARPQGNIPSDIEHPIREG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCCCACTGTTTGCCATATTTTGTCAATGGTCAAACGTTAGAGGCTCTAG
TAAGAAAGTTAAGAGTGTGATAGAAGTGGATGGTGTGTCTACCATTAGGGTCGATATTGCAACATTAGCTAACACTCTTAAAAACATAACTGTGGTAAGCAATGTTCAGC
AGCCACCAGTGGTGGAATCTATTGCATCTGTGAATCAAGTGGCAGCTGAATCTTGTGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAACAATCTAGCTTCT
GTGTTTTTTGTAGGCTTTGCCCCAGCAAAATTCGGGTATTCTCTGGAGTTAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCAAAGTCAACAGGCTTCATT
GAGAGTCCTAGAGTTTCAGGTGGGCCAGCTAGCTAATGAGTTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGG
TGCAGGCAGTGACTTTAAGGAGTGAATCGGGGTCTGGACAATATGATGGAGGCAGCAGCAAAGATGTTGGAGCAATTAGTTCTGTTCCAGATGTAGAACCCCCACCTTAT
GTACCACCCCCACCCTATGACCCACCCTTACCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGATGGTAGTCCATTTTTGGCAACTAGTAGATCATTGATGGATGTCCA
ACAAGGGGAGTTTACAACGAGGGTGCATGACCAAAAGGTGAAGTTTCATATGTTTGATGCAATGAAATATCCTAATGATCTTGAGGATTGCTCGTTCATTCAAGTGTTGG
ATGAGATTGCTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCAT
AAAGTTGTAGGTGAGATTTCTAGTTTTGAGAGGAATTTGGAATCCTTAGAGCCAATAGATAAGGAATCTAAGCCTATTGAACCTTATAATTCATTGACATTGCTCCAGCA
ACCTGAGATTAGGAAATCCTTCATTGGTGAACGGTTACTTACTGTAGCTCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCAATTACCTTGATTTTGGGAA
TTTGCCTCCTGGTTAATCAAAAAGACAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCCCACTGTTTGCCATATTTTGTCAATGGTCAAACGTTAGAGGCTCTAG
TAAGAAAGTTAAGAGTGTGATAGAAGTGGATGGTGTGTCTACCATTAGGGTCGATATTGCAACATTAGCTAACACTCTTAAAAACATAACTGTGGTAAGCAATGTTCAGC
AGCCACCAGTGGTGGAATCTATTGCATCTGTGAATCAAGTGGCAGCTGAATCTTGTGTCTATTGTGGTGAAGAGCATAATTATGAGTTTTGTCCTAACAATCTAGCTTCT
GTGTTTTTTGTAGGCTTTGCCCCAGCAAAATTCGGGTATTCTCTGGAGTTAATGATGAAGGATTATATGGCTCGTAATGATGTCATAATCCAAAGTCAACAGGCTTCATT
GAGAGTCCTAGAGTTTCAGGTGGGCCAGCTAGCTAATGAGTTGAAGGCACGACCTCAAGGGAACATTCCTTCAGATATTGAACACCCTATAAGGGAAGGTAAGAAGCAGG
TGCAGGCAGTGACTTTAAGGAGTGAATCGGGGTCTGGACAATATGATGGAGGCAGCAGCAAAGATGTTGGAGCAATTAGTTCTGTTCCAGATGTAGAACCCCCACCTTAT
GTACCACCCCCACCCTATGACCCACCCTTACCTTTTCCACAAAGGCAGAAGTCTAAGAACCAAGATGGTAGTCCATTTTTGGCAACTAGTAGATCATTGATGGATGTCCA
ACAAGGGGAGTTTACAACGAGGGTGCATGACCAAAAGGTGAAGTTTCATATGTTTGATGCAATGAAATATCCTAATGATCTTGAGGATTGCTCGTTCATTCAAGTGTTGG
ATGAGATTGCTGAGGACCACTTTGAGAAGGAATTGATGGAGTACCATACCCAAAAATTTGGAGAAATCCAAATTGAGGATTTGGAAATAGGTGGATTGGAGCATGAGCAT
AAAGTTGTAGGTGAGATTTCTAGTTTTGAGAGGAATTTGGAATCCTTAGAGCCAATAGATAAGGAATCTAAGCCTATTGAACCTTATAATTCATTGACATTGCTCCAGCA
ACCTGAGATTAGGAAATCCTTCATTGGTGAACGGTTACTTACTGTAGCTCATATTAAGGCAGTGAAAACACCTTGGTATGATGACTTTTCAATTACCTTGATTTTGGGAA
TTTGCCTCCTGGTTAATCAAAAAGACAGATGA
Protein sequenceShow/hide protein sequence
MRKKLLVRLGKGLRSFCESVPPLFAIFCQWSNVRGSSKKVKSVIEVDGVSTIRVDIATLANTLKNITVVSNVQQPPVVESIASVNQVAAESCVYCGEEHNYEFCPNNLAS
VFFVGFAPAKFGYSLELMMKDYMARNDVIIQSQQASLRVLEFQVGQLANELKARPQGNIPSDIEHPIREGKKQVQAVTLRSESGSGQYDGGSSKDVGAISSVPDVEPPPY
VPPPPYDPPLPFPQRQKSKNQDGSPFLATSRSLMDVQQGEFTTRVHDQKVKFHMFDAMKYPNDLEDCSFIQVLDEIAEDHFEKELMEYHTQKFGEIQIEDLEIGGLEHEH
KVVGEISSFERNLESLEPIDKESKPIEPYNSLTLLQQPEIRKSFIGERLLTVAHIKAVKTPWYDDFSITLILGICLLVNQKDR