; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007871 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007871
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:6867865..6869306
RNA-Seq ExpressionLag0007871
SyntenyLag0007871
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3485417.1 reverse transcriptase [Gossypium australe]3.8e-3938.35Show/hide
Query:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------
        M+ +CWN +G+G+P+A+R LR+L K QNP I+FL ET  +  + E ++    +   F + ++GS GGL L WKD                          
Subjt:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------

Query:  -LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHF
          F+GFYG+P   +R   WSLL+RLSQ ++ PW++ GDFNEI+   EK GG ++ Q ++  F D ++ C L D+GYSG ++TW RG +    I+ERLD  
Subjt:  -LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHF

Query:  LANKSF
        + N+ +
Subjt:  LANKSF

XP_023921342.1 uncharacterized protein LOC112032803 [Quercus suber]1.5e-4039.25Show/hide
Query:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------
        M ILCWN +G+GNPQ ++ L  LI+ ++P ++FL+ETW D ++ E+IK++ K++ +  V   G  GG+ ++WK                           
Subjt:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------

Query:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL
         F+GFY  P    R +SW+ L RL     LPW+  GDFNEI+  DEK+GG  R  SQ+  F +V+D C   DLG+ GGK+TW RG  G N I ERLD  +
Subjt:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL

Query:  ANKSFTDSFKDIKI
        A   + D F   K+
Subjt:  ANKSFTDSFKDIKI

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]5.9e-4039.25Show/hide
Query:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMW--------------------------KD
        MR L WN +G+GNPQ++R LR++++  +P  +FLSET       E+ K+ + + +  V+PS G SGGL L+W                          K 
Subjt:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMW--------------------------KD

Query:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL
          +GFYGNP +H+RK+SW LL+ LS+   LPW+  GDFNEIVS  EK+GG  R Q Q++ F + ID C  +DLG+ G +FTW   + G + +  RLD  L
Subjt:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL

Query:  ANKSFTDSFKDIKI
          + + D +KD+++
Subjt:  ANKSFTDSFKDIKI

XP_030958962.1 uncharacterized protein LOC115980904 [Quercus lobata]7.7e-4036.92Show/hide
Query:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------
        M I+ WN  G+GNP+  + L  +I+ ++P ++F++ETW+D ++ ++IK  + ++++F V      GGL L W++                          
Subjt:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------

Query:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL
         F+GFYG P+ HKR DSW  L  L    NLPW+  GDFNEI+   EK+GG+NRGQ+Q+  F DV+D C  +DLG+SG  +TW +     + I ERLD  L
Subjt:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL

Query:  ANKSFTDSFKDIKI
        A   +   F   K+
Subjt:  ANKSFTDSFKDIKI

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]1.0e-3939.25Show/hide
Query:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMW--------------------------KD
        MR L WN +G+GNPQ++R LR++++  +P  +FLSET       E+ K+ + + +  V+PS G SGGL L+W                          K 
Subjt:  MRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMW--------------------------KD

Query:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL
          +GFYGNP +H+RK+SW LL+ LS+   LPW+  GDFNEIVS  EK+GG  R Q Q++ F + ID C  +DLG+ G +FTW   + G + +  RLD  L
Subjt:  LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFL

Query:  ANKSFTDSFKDIKI
          + + D +KD+++
Subjt:  ANKSFTDSFKDIKI

TrEMBL top hitse value%identityAlignment
A0A2N9FD73 Uncharacterized protein9.5e-4440.27Show/hide
Query:  LHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD---------
        + G    GGW  APP AMR+L WN +G+GNP A+RAL HL+K Q P I+FL ET  D    E+I++ + Y+ +F VPS G SGGL L+WK+         
Subjt:  LHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD---------

Query:  -----------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWA
                           +GFYG P  H+R +SW+LL+ L++    PW+  GDFNEI+  +EK+G + +   ++  F DV+  C+L+D+GY G +FTW 
Subjt:  -----------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWA

Query:  RGEVGPNEIKERLDHFLANKSFTDSF
           VG   ++ERLD  LA+ +++  F
Subjt:  RGEVGPNEIKERLDHFLANKSFTDSF

A0A2N9G1F9 Uncharacterized protein3.0e-4239.17Show/hide
Query:  APPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD---------------------
        APP  M +L WN QG+GNP+A+RAL H++K + P+++FL ET  D+ + E I++++ +D+ F VPS G SGGL L+WK                      
Subjt:  APPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD---------------------

Query:  -----LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKER
               +GFYG P  H+R++SW+LL+ LS    LPW   GDFNEI++ +EK GG  R   QI  F + ++ C+ VDLG+ G  +TW         I+ R
Subjt:  -----LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKER

Query:  LDHFLANKSFTDSFKDI
        LD  L + S   SF  +
Subjt:  LDHFLANKSFTDSFKDI

A0A2N9GKW3 Reverse transcriptase domain-containing protein4.0e-4236.36Show/hide
Query:  DNNISSDSKKVNRG--PAYFELHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSK
        DN+ +    + N G  P + +LH   IDGG   APP+AM  L WN +G+GNP+ ++ +  L++ Q+P ++FL ETW D    E+++ ++++ + F+  S+
Subjt:  DNNISSDSKKVNRG--PAYFELHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSK

Query:  GSSGGLMLMWKD--------------------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFI
           GGL L WK                            F+GFYG P  HKR++SW+LL RL+    LPW   GDFNE+V  +EK G +NR + Q+  F 
Subjt:  GSSGGLMLMWKD--------------------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFI

Query:  DVIDACSLVDLGYSGGKFTWARGEVGPNEIK-ERLDHFLANKSFTDSFKDIKI
        DV+D C  VDLG++G KFTW      P ++  ERLD  +A   +   F   ++
Subjt:  DVIDACSLVDLGYSGGKFTWARGEVGPNEIK-ERLDHFLANKSFTDSFKDIKI

A0A2N9HDH5 Uncharacterized protein5.7e-4132.51Show/hide
Query:  EGGTSMTMEVAEEENLEISS--KGVLEEEEEDLAEAINNQINRLSLAEQKGIRIMAIEDDDIEDTTKDLSETLWPTKFSPKNSSSGRRELEGKKPKQNIV
        E    +T E+ E  N E  +    V+++E EDL        NR+ L +     I+   D D               K   +  +SG+  +   + + N+V
Subjt:  EGGTSMTMEVAEEENLEISS--KGVLEEEEEDLAEAINNQINRLSLAEQKGIRIMAIEDDDIEDTTKDLSETLWPTKFSPKNSSSGRRELEGKKPKQNIV

Query:  ---TDSNMDCGKRREYEERDNNISSDSKKVNRGPAY--FELHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSK
                  G +R   E+++  S   ++  +G +       G  + GG   APP+ M I+ WN +G+GN +A+ AL +L+K Q P+I+FL ET  D  K
Subjt:  ---TDSNMDCGKRREYEERDNNISSDSKKVNRGPAY--FELHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSK

Query:  CEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVS
         E I++++++   F VPS G SGGL L+W D                           F+GFYGNPI H+R+ SW+LL++L    +LPW++ GDFNEI+S
Subjt:  CEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVS

Query:  DDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFLANKSFTDSF
         DE+ G     Q  +  F +VI+ C LVDLGY G  FTW  G      +++RLD  LA+ S+   F
Subjt:  DDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKERLDHFLANKSFTDSF

A0A2N9HWG1 Reverse transcriptase domain-containing protein1.1e-4434.92Show/hide
Query:  IEDDDIEDTTKDLSETLWPTKFSPKNSSSGRRELEGKK--PKQNIVTDSNMDCGKRREYEERDNNI-------SSDSKKVNRGPAYFELHGG-DIDGGWV
        +E D ++   +DL   +        N   G +    KK   K+NI  +    C       + ++           DS+K   G +Y     G  + GG  
Subjt:  IEDDDIEDTTKDLSETLWPTKFSPKNSSSGRRELEGKK--PKQNIVTDSNMDCGKRREYEERDNNI-------SSDSKKVNRGPAYFELHGG-DIDGGWV

Query:  PAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------
         APP+ M I+ WN +G+GN +A+ AL +L+K Q P+I+FL ET  D  K E I++++++   F VPS G SGGL L+W D                    
Subjt:  PAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSDSSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKD--------------------

Query:  ------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKE
               F+GFYGNPI+H+R++SW+LL++L    +LPW++ GDFNEI+S DE+ G ++  Q  +  F +VI+ C LVDLGY G  FTW  G      I++
Subjt:  ------LFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYSGGKFTWARGEVGPNEIKE

Query:  RLDHFLANKSFTDSF
        RLD  LA+ S+   F
Subjt:  RLDHFLANKSFTDSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein6.5e-0531.43Show/hide
Query:  KRKDSWSLLERLSQCS---NLPWIIGGDFNEIVSDDEKVGGNNRGQSQIN-RFIDVIDAC----SLVDLGYSGGKFTWARGEVGPNEIKERLDHFLANKS
        +R+  W  + RLS  S   N PW++ GDFN+I S  E     +   S I+ + ++ + AC     LVDL   G  +TW+  +   N I  +LD  + N  
Subjt:  KRKDSWSLLERLSQCS---NLPWIIGGDFNEIVSDDEKVGGNNRGQSQIN-RFIDVIDAC----SLVDLGYSGGKFTWARGEVGPNEIKERLDHFLANKS

Query:  FTDSF
        +  +F
Subjt:  FTDSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAGGAGAGGCAGCTGCGAAGCGGCCTAAGGGGATCCGATAGTGATCAAGTTGGGGTCGAAAGAGCAGAGTCCTATCAGTATTGGGAGGGAGGCACGTCCATGAC
CATGGAAGTTGCTGAGGAGGAAAACCTAGAGATCAGTTCGAAAGGGGTATTAGAAGAAGAAGAAGAAGATCTTGCTGAAGCCATAAACAACCAGATCAATAGGCTAAGCC
TTGCTGAACAAAAAGGCATAAGAATTATGGCTATAGAAGATGATGACATAGAAGATACAACAAAAGACTTAAGTGAGACTCTGTGGCCTACAAAATTCTCACCCAAAAAC
TCATCCTCTGGGAGAAGGGAGTTGGAAGGCAAAAAACCAAAACAAAACATAGTTACTGATAGTAACATGGATTGTGGCAAGAGAAGGGAATATGAAGAAAGGGACAACAA
TATTTCTAGTGATTCTAAGAAAGTGAACCGAGGACCTGCTTACTTCGAGTTGCACGGAGGGGATATCGACGGAGGCTGGGTACCAGCCCCACCGAACGCCATGAGGATCC
TATGTTGGAACGCTCAAGGGATGGGGAACCCTCAAGCAATCCGTGCTCTGAGACACCTGATCAAGGGCCAAAACCCCCAGATTATTTTTTTGTCGGAGACATGGAGTGAT
TCTAGTAAATGTGAGAAGATTAAGTTGGAGATGAAATATGATGATATGTTCGTTGTCCCTAGCAAAGGGTCAAGCGGGGGTTTGATGCTCATGTGGAAGGATTTGTTCTC
GGGGTTTTATGGGAACCCGATCATGCACAAGAGAAAGGATTCTTGGAGTTTATTGGAGAGACTATCTCAATGTTCGAATCTCCCCTGGATCATTGGAGGAGATTTCAATG
AGATCGTCTCTGATGATGAGAAAGTGGGTGGCAACAATAGAGGGCAATCCCAAATCAATAGATTCATAGATGTTATCGATGCCTGCAGCTTGGTCGACCTGGGATATAGT
GGTGGTAAATTTACTTGGGCTAGAGGAGAAGTGGGGCCAAATGAGATTAAGGAGAGGCTGGACCATTTTTTAGCCAACAAAAGCTTCACAGACTCCTTTAAGGATATTAA
GATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGAGGAGAGGCAGCTGCGAAGCGGCCTAAGGGGATCCGATAGTGATCAAGTTGGGGTCGAAAGAGCAGAGTCCTATCAGTATTGGGAGGGAGGCACGTCCATGAC
CATGGAAGTTGCTGAGGAGGAAAACCTAGAGATCAGTTCGAAAGGGGTATTAGAAGAAGAAGAAGAAGATCTTGCTGAAGCCATAAACAACCAGATCAATAGGCTAAGCC
TTGCTGAACAAAAAGGCATAAGAATTATGGCTATAGAAGATGATGACATAGAAGATACAACAAAAGACTTAAGTGAGACTCTGTGGCCTACAAAATTCTCACCCAAAAAC
TCATCCTCTGGGAGAAGGGAGTTGGAAGGCAAAAAACCAAAACAAAACATAGTTACTGATAGTAACATGGATTGTGGCAAGAGAAGGGAATATGAAGAAAGGGACAACAA
TATTTCTAGTGATTCTAAGAAAGTGAACCGAGGACCTGCTTACTTCGAGTTGCACGGAGGGGATATCGACGGAGGCTGGGTACCAGCCCCACCGAACGCCATGAGGATCC
TATGTTGGAACGCTCAAGGGATGGGGAACCCTCAAGCAATCCGTGCTCTGAGACACCTGATCAAGGGCCAAAACCCCCAGATTATTTTTTTGTCGGAGACATGGAGTGAT
TCTAGTAAATGTGAGAAGATTAAGTTGGAGATGAAATATGATGATATGTTCGTTGTCCCTAGCAAAGGGTCAAGCGGGGGTTTGATGCTCATGTGGAAGGATTTGTTCTC
GGGGTTTTATGGGAACCCGATCATGCACAAGAGAAAGGATTCTTGGAGTTTATTGGAGAGACTATCTCAATGTTCGAATCTCCCCTGGATCATTGGAGGAGATTTCAATG
AGATCGTCTCTGATGATGAGAAAGTGGGTGGCAACAATAGAGGGCAATCCCAAATCAATAGATTCATAGATGTTATCGATGCCTGCAGCTTGGTCGACCTGGGATATAGT
GGTGGTAAATTTACTTGGGCTAGAGGAGAAGTGGGGCCAAATGAGATTAAGGAGAGGCTGGACCATTTTTTAGCCAACAAAAGCTTCACAGACTCCTTTAAGGATATTAA
GATTTAG
Protein sequenceShow/hide protein sequence
MDEERQLRSGLRGSDSDQVGVERAESYQYWEGGTSMTMEVAEEENLEISSKGVLEEEEEDLAEAINNQINRLSLAEQKGIRIMAIEDDDIEDTTKDLSETLWPTKFSPKN
SSSGRRELEGKKPKQNIVTDSNMDCGKRREYEERDNNISSDSKKVNRGPAYFELHGGDIDGGWVPAPPNAMRILCWNAQGMGNPQAIRALRHLIKGQNPQIIFLSETWSD
SSKCEKIKLEMKYDDMFVVPSKGSSGGLMLMWKDLFSGFYGNPIMHKRKDSWSLLERLSQCSNLPWIIGGDFNEIVSDDEKVGGNNRGQSQINRFIDVIDACSLVDLGYS
GGKFTWARGEVGPNEIKERLDHFLANKSFTDSFKDIKI