; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006387 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006387
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:41947161..41947964
RNA-Seq ExpressionLag0006387
SyntenyLag0006387
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]1.2e-4744.64Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWW
        MKLLCWN RG+GNP+ +R LR  + N +P +VFL ETK K       K +L    CF V   G SGGL +LWK D++V ++SFS  HIDA ++  D   W
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWW

Query:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM
         FTG YGNPE+  R  +W L++RL++  D PW++GGDFNE+L  +EK GGR + +  M+ FR  I  C L DLG++G K+TW         I ERL RF+
Subjt:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM

Query:  ATHRLIDKAKNIEVLHLNYHQSDH
          ++         V H     SDH
Subjt:  ATHRLIDKAKNIEVLHLNYHQSDH

XP_024021734.1 uncharacterized protein LOC112091706 [Morus notabilis]8.3e-5242.73Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLDD-WW
        M L+ WNVRG+GNPR    LR+ +R+ +P + FL ET+ +      +K +  F   F V  +G SGGLM++WK +++V+I+S+S+ HID  ++     WW
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLDD-WW

Query:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM
         FTGFYGNP    R  SW L+ RL ++S+LPW++ GDFNE+LF S+KEGG  +   +M++FR  ++ C LVDLG+ G KFTW    + G  I+ERL R +
Subjt:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM

Query:  ATHRLIDKAKNIEVLHLNYHQSDHRFL
             I+      V +++++ SDHR L
Subjt:  ATHRLIDKAKNIEVLHLNYHQSDHRFL

XP_028075737.1 uncharacterized protein LOC114277953 [Camellia sinensis]1.1e-4844.93Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHID----AEMKVLD
        MK+LCWN RG+GNPRT+R L+L ++   P +VFL ETK        ++ KL    CF V   GLSGGL +LW  ++Q+ IKSFS+GH+D    +E  V  
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHID----AEMKVLD

Query:  DWWHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLY
          W FTGFYGNP    R DSWEL++RL     LPW+  GDFNE+L+  EK G   + Q+ MD FR  +  C L DLG++G  FTW         I+ERL 
Subjt:  DWWHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLY

Query:  RFMATHRLIDKAKNIEVLHLNYHQSDH
        R +   R ++      V HL    SDH
Subjt:  RFMATHRLIDKAKNIEVLHLNYHQSDH

XP_030969682.1 uncharacterized protein LOC115989960 [Quercus lobata]5.0e-4944.73Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWW
        M LL WN RG+GN RT+ +L   V    P IVFL ETK K      +KEK K   CF VPS G SGGL++LWK +++V++++FS+ HIDA +   +  WW
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWW

Query:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM
        H TGFYG+P   KR +SW  +K L + + LPW++ GDFNE++  SEKEGG  + +K M +F + ID C L DLG+ G KFTW  +   G  I+ERL R +
Subjt:  HFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFM

Query:  ATHRLIDKAKNIEVLHLNYHQSDHRFLRILSMKLGKK
        AT   +      ++ HL+   SDH  L +   +  KK
Subjt:  ATHRLIDKAKNIEVLHLNYHQSDHRFLRILSMKLGKK

XP_042972796.1 uncharacterized protein LOC122304603 [Carya illinoinensis]2.1e-4745.32Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLDDW--
        MK + WN RG+GNPR +R+L   VR   P ++FL ETK        ++ ++ F+CCF V S G  GG+ +LW+N+++++IKSFS  HIDA++   D    
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLDDW--

Query:  WHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRF
        W FTG YG+ EIEKR+++W L++ L    ++PW++ GDFNEVL   EK GGR + +  M  FR+ +D C+L+DLG+KG K+TW  R  +   + ERL RF
Subjt:  WHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRF

Query:  MAT
        +AT
Subjt:  MAT

TrEMBL top hitse value%identityAlignment
A0A1U8HV94 uncharacterized protein LOC1078899123.3e-4641.48Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCF----EVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD
        MK+L WNVRG+GNPRT+  LR  ++ +NP IVF  ETK    + +   E+++  C F    +V S G  GGL + W++D+ ++++SFSK HID  ++  D
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCF----EVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD

Query:  DW--WHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKER
        +   W FTGFYG+  ++ R +SW+L+K L    +LPW + GDFNE+++  EK GG  + ++ MD FRT +  CHLVD+GY G+ FTWKR +     I+ER
Subjt:  DW--WHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKER

Query:  LYRFMATHRLIDKAKNIEVLHLNYHQSDH
        L R +     +    +  + HL +  SDH
Subjt:  LYRFMATHRLIDKAKNIEVLHLNYHQSDH

A0A5C7HJN1 Uncharacterized protein3.3e-4642.33Show/hide
Query:  VGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEM-KVLDDWWHFTGFYGNPE
        +GN RT+ +L+  ++  +P++VFL+ETK KG++ +  K++L F+  F V  +G SGGL++LW +D +V++ S SKGHID  + +  +  W F+GFYG   
Subjt:  VGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEM-KVLDDWWHFTGFYGNPE

Query:  IEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFMATHRLIDKAK
           ++DSWEL++RL ++ D  W+ GGDFNE+L   EK GG  K    + +FR  ID C+LVDLG++G K TW  R      ++ER+ R +A    ID   
Subjt:  IEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFMATHRLIDKAK

Query:  NIEVLHLNYHQSDHR
           V HL Y+ SDHR
Subjt:  NIEVLHLNYHQSDHR

A0A6P4PK91 uncharacterized protein LOC1084782163.0e-4741.05Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCF----EVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMK--V
        MK++CWN+RG+GNPR +R L   ++  NP +VF  ETK    +N    +K++  C F    +V + G  GG+ + WK D+QV++K FS  HID  +K   
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCF----EVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMK--V

Query:  LDDWWHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKER
        + D W F GFYG+P I+ +  SW L++ L   SD PW++GGDFNE+++  E  GG+++ +K M+ FR  ++ CHL D+GY G  FTW+R +     I+ER
Subjt:  LDDWWHFTGFYGNPEIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKER

Query:  LYRFMATHRLIDKAKNIEVLHLNYHQSDH
        L R +A  + I       V HL    SDH
Subjt:  LYRFMATHRLIDKAKNIEVLHLNYHQSDH

A0A7J6DZ24 CCHC-type domain-containing protein1.0e-4743.52Show/hide
Query:  GVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWWHFTGFYGNP
        G+GNP  L +LR  VR ++P +VFL+ETK  G    G++ ++ F   F V   G SGGL++LW +D +V++KSFS GHIDA +K    + W FTGFYGNP
Subjt:  GVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DWWHFTGFYGNP

Query:  EIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFMATHRLIDKA
        +   R DSW+L+ RL  + DLPWI GGDFNE+L  +EK+GG  +   A+ +F+ A+D C LVD+G++G  FTW  + +    ++ERL R+       +  
Subjt:  EIEKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFMATHRLIDKA

Query:  KNIEVLHLNYHQSDHR
         +++V++ ++  SDHR
Subjt:  KNIEVLHLNYHQSDHR

A0A803NZC3 Uncharacterized protein2.4e-4941.92Show/hide
Query:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDK-GLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DW
        MK++CWN RG+ NPR  R LRL + +H PD+VFL E+K + G + +  +  LKF    EVP  GLSGGL+ LWK ++ V I ++    +D  M  +D   
Subjt:  MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDK-GLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLD-DW

Query:  WHFTGFYGNPEIEKRKDSWELVKRL-HAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYR
        WHF+GFYG P + +R+ +WEL+K+L  +    PW++ GDFNEVL  ++K GG  +C+  +D FR A+D C L +L ++GD++TW  +  +   +KERL  
Subjt:  WHFTGFYGNPEIEKRKDSWELVKRL-HAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYR

Query:  FMATHRLIDKAKNIEVLHLNYHQSDHRFL
            H  +D      V HL++  SDHR L
Subjt:  FMATHRLIDKAKNIEVLHLNYHQSDHRFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTCTTATGTTGGAACGTTCGGGGGGTGGGGAACCCCCGAACGCTCCGCTCGCTCCGGTTAGAGGTGCGCAATCACAACCCCGACATTGTTTTTCTTGCAGAAAC
AAAAGATAAAGGTTTAGTTAACAGAGGATTGAAAGAGAAGCTGAAGTTTGATTGTTGTTTTGAAGTTCCTAGTGCTGGCCTTAGCGGGGGGCTTATGATTTTGTGGAAAA
ATGATATGCAAGTTAACATAAAGTCCTTTTCTAAGGGTCATATAGATGCTGAGATGAAAGTTCTAGATGACTGGTGGCATTTCACAGGTTTTTACGGTAATCCAGAAATA
GAGAAGCGTAAGGATTCGTGGGAGCTGGTCAAAAGATTGCACGCTATGTCAGACCTCCCTTGGATTATAGGGGGAGATTTCAATGAAGTTTTGTTCGATTCAGAGAAGGA
AGGAGGTCGTAGAAAATGCCAAAAAGCGATGGATGAATTCAGGACTGCTATTGACTTATGCCATCTAGTGGATCTCGGTTACAAGGGAGATAAATTTACCTGGAAAAGAA
GGGATAAAAAGGGGGAAACCATAAAGGAACGACTATATAGGTTCATGGCAACTCACAGGCTGATTGATAAAGCGAAAAATATAGAGGTCCTCCACTTGAACTATCATCAA
TCGGATCATAGATTCTTGCGCATATTAAGCATGAAGTTAGGAAAAAAGGTGATTGGCAGAGAAGAAAAAGGACACCCAAATTTGAGGAAAGTTGGCTTCAGTTTGAAGAC
TGTAAAAATATTGTCAAAGAGGCCTGGATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTCTTATGTTGGAACGTTCGGGGGGTGGGGAACCCCCGAACGCTCCGCTCGCTCCGGTTAGAGGTGCGCAATCACAACCCCGACATTGTTTTTCTTGCAGAAAC
AAAAGATAAAGGTTTAGTTAACAGAGGATTGAAAGAGAAGCTGAAGTTTGATTGTTGTTTTGAAGTTCCTAGTGCTGGCCTTAGCGGGGGGCTTATGATTTTGTGGAAAA
ATGATATGCAAGTTAACATAAAGTCCTTTTCTAAGGGTCATATAGATGCTGAGATGAAAGTTCTAGATGACTGGTGGCATTTCACAGGTTTTTACGGTAATCCAGAAATA
GAGAAGCGTAAGGATTCGTGGGAGCTGGTCAAAAGATTGCACGCTATGTCAGACCTCCCTTGGATTATAGGGGGAGATTTCAATGAAGTTTTGTTCGATTCAGAGAAGGA
AGGAGGTCGTAGAAAATGCCAAAAAGCGATGGATGAATTCAGGACTGCTATTGACTTATGCCATCTAGTGGATCTCGGTTACAAGGGAGATAAATTTACCTGGAAAAGAA
GGGATAAAAAGGGGGAAACCATAAAGGAACGACTATATAGGTTCATGGCAACTCACAGGCTGATTGATAAAGCGAAAAATATAGAGGTCCTCCACTTGAACTATCATCAA
TCGGATCATAGATTCTTGCGCATATTAAGCATGAAGTTAGGAAAAAAGGTGATTGGCAGAGAAGAAAAAGGACACCCAAATTTGAGGAAAGTTGGCTTCAGTTTGAAGAC
TGTAAAAATATTGTCAAAGAGGCCTGGATCTTAG
Protein sequenceShow/hide protein sequence
MKLLCWNVRGVGNPRTLRSLRLEVRNHNPDIVFLAETKDKGLVNRGLKEKLKFDCCFEVPSAGLSGGLMILWKNDMQVNIKSFSKGHIDAEMKVLDDWWHFTGFYGNPEI
EKRKDSWELVKRLHAMSDLPWIIGGDFNEVLFDSEKEGGRRKCQKAMDEFRTAIDLCHLVDLGYKGDKFTWKRRDKKGETIKERLYRFMATHRLIDKAKNIEVLHLNYHQ
SDHRFLRILSMKLGKKVIGREEKGHPNLRKVGFSLKTVKILSKRPGS