; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0004631 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0004631
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:5597938..5599091
RNA-Seq ExpressionLag0004631
SyntenyLag0004631
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO59710.1 reverse transcriptase [Corchorus capsularis]3.7e-2628.83Show/hide
Query:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNK--GWWRFSGFY------------------------------------------------------
        TGRSGGL + W+  +D+Q+ SFS+ HID  + +N+    WR +GFY                                                      
Subjt:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNK--GWWRFSGFY------------------------------------------------------

Query:  -------------GFSGNKFTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKS-NEKLRRFEEGWLKYKET
                     G+ GN FTW+RG  +   IHERLD+ +             + HL+   SDH PI+ +    +  +K +S + K   FE GW K  + 
Subjt:  -------------GFSGNKFTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKS-NEKLRRFEEGWLKYKET

Query:  KVIVQKNWENQPGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAE--EDLSALLEEEEAYWRCRSREVWLKNG
        + +V   WE   GLG  +   ++ +S+ K +         S++  ID+  ++++K+S     GH+ N ++ E  E+++ LLEEEE++W   SR  WL  G
Subjt:  KVIVQKNWENQPGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAE--EDLSALLEEEEAYWRCRSREVWLKNG

Query:  DKNTKWFHAKASQRRRRNHIEGIPSQ
        D+NT +FHA+AS+RR++N IE +  +
Subjt:  DKNTKWFHAKASQRRRRNHIEGIPSQ

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]5.2e-2836.36Show/hide
Query:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGWWRFSGFYGFSGNKFTWRRG--------------------------------------KNSRMR
        TG+SGGL+LLWN + +V+I+S S GHID+ I    G WRF+GFY   GN  T++R                                         S+MR
Subjt:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGWWRFSGFYGFSGNKFTWRRG--------------------------------------KNSRMR

Query:  ---IHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSIIK
           I ERLD++LIN +M     +LKV HL  LSSDHRPI+ASW F          ++  RFEE WL+    + I+   W + PG+G      KI + + +
Subjt:  ---IHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSIIK

Query:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNED
        L++W++ RL  S+K  I  KE+E+ +L   D
Subjt:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNED

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]1.3e-2632.73Show/hide
Query:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKG-WWRFSGFYG---------------------FSGNKFTWRRGKNSRMRIHERLDKYLINHAMCIN
        G+ GGL LLWN E  VQIKSF++ HIDA I+   G   R +G YG                     + G  +TW  G+     + ERLD+++ N+A    
Subjt:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKG-WWRFSGFYG---------------------FSGNKFTWRRGKNSRMRIHERLDKYLINHAMCIN

Query:  ARDLKVCHLNFLSSDHRPIVASWRF-AEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANN---LCKKID-NSIIKLHQWSRHRLKGSIKA
          D    +++  +SDH P+V   +     +   +    L  +E+ W  Y   K I++K W  Q    A N   + +K+  NS+ +L  WS+   +G  K 
Subjt:  ARDLKVCHLNFLSSDHRPIVASWRF-AEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANN---LCKKID-NSIIKLHQWSRHRLKGSIKA

Query:  VIDKKEEEIHKLSNEDFEGHLDN-IDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI
         ++K   ++  L     +    N I + E  +  +L ++E YW+ RSR  WLK GDKNTK+FH KAS R+++N I GI
Subjt:  VIDKKEEEIHKLSNEDFEGHLDN-IDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI

XP_027124356.1 uncharacterized protein LOC113741069 [Coffea arabica]2.7e-3235.1Show/hide
Query:  GEYKTPTGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW--WRFSGFY-----------------------------GFSGNKFTWRRGKNSRMRI
        G Y     R GGL LLW  E+D+ IKSFSEGHID+ +K+ +G   WRF+GFY                             GF+G++FTW RGK+   RI
Subjt:  GEYKTPTGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW--WRFSGFY-----------------------------GFSGNKFTWRRGKNSRMRI

Query:  HERLDKYLINHAMCINARDLKVCHLNFLSSDHRPI---VASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQ-PGLGANNLCKKIDNSIIK
         ERLD+ +     C       V HL    SDH PI   V S      IQK K      +FE+ WL  +E + I++ NW          N+      ++  
Subjt:  HERLDKYLINHAMCINARDLKVCHLNFLSSDHRPI---VASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQ-PGLGANNLCKKIDNSIIK

Query:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGH--LDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGIPSQMME
        L+ W R +  GS++  I + +    KL +  +  H  +        +L  L+E+EE YW+ RSR  WLK GD+NT +FH+KA+QR+ +N I+   S  +E
Subjt:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGH--LDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGIPSQMME

Query:  SG
        SG
Subjt:  SG

XP_030495126.1 uncharacterized protein LOC115710915 [Cannabis sativa]9.8e-2728.75Show/hide
Query:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW-WRFSGFY--------------------------------------------------------
        G+SGGL LLW +   V + SF++ HIDA I K +   WRF+GFY                                                        
Subjt:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW-WRFSGFY--------------------------------------------------------

Query:  -----------GFSGNKFTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKV
                    + G++FTW  G+ + M I+ERLD+ ++N+         KV HL+   SDH P++ ++       + K     R  +E+ W   +E + 
Subjt:  -----------GFSGNKFTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKV

Query:  IVQKNW-ENQPGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKN
        I+Q NW E      A +L + I+N    L QW++ + K ++  + D K +E+ K SN+        +   E+DL+  L +EE +W+ RSR +WL +GD+N
Subjt:  IVQKNW-ENQPGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKN

Query:  TKWFHAKASQRRRRNHIEGI
        T++FH KA+ RR++N I G+
Subjt:  TKWFHAKASQRRRRNHIEGI

TrEMBL top hitse value%identityAlignment
A0A2N9FDP4 Reverse transcriptase domain-containing protein1.3e-2731.06Show/hide
Query:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGI-KKNKGWWRFSGFY---------------------------------------GFSGNKFTWRRGKNSRM
        GRSGGL LLW  E DV I++FS+ HIDA +  K    WR +GFY                                       GF G K+TW   ++   
Subjt:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGI-KKNKGWWRFSGFY---------------------------------------GFSGNKFTWRRGKNSRM

Query:  RIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGAN--NLCKKIDNSIIK
         I  RLD+ L             V H +   SDH  +V S        + K  + +RRFEE W    + + ++Q++W     +G+    LC+KI      
Subjt:  RIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGAN--NLCKKIDNSIIK

Query:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEG-HLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI
        L  WSR   + +    ++ K E +  L  ++  G H   I +  ++++ LL ++E +WR RSRE+WL  GDKNT++FH KA QR+ +N ++G+
Subjt:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEG-HLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI

A0A6J1DUG8 uncharacterized protein LOC1110241352.5e-2836.36Show/hide
Query:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGWWRFSGFYGFSGNKFTWRRG--------------------------------------KNSRMR
        TG+SGGL+LLWN + +V+I+S S GHID+ I    G WRF+GFY   GN  T++R                                         S+MR
Subjt:  TGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGWWRFSGFYGFSGNKFTWRRG--------------------------------------KNSRMR

Query:  ---IHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSIIK
           I ERLD++LIN +M     +LKV HL  LSSDHRPI+ASW F          ++  RFEE WL+    + I+   W + PG+G      KI + + +
Subjt:  ---IHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSIIK

Query:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNED
        L++W++ RL  S+K  I  KE+E+ +L   D
Subjt:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNED

A0A6P6XBX4 uncharacterized protein LOC1137410691.3e-3235.1Show/hide
Query:  GEYKTPTGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW--WRFSGFY-----------------------------GFSGNKFTWRRGKNSRMRI
        G Y     R GGL LLW  E+D+ IKSFSEGHID+ +K+ +G   WRF+GFY                             GF+G++FTW RGK+   RI
Subjt:  GEYKTPTGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW--WRFSGFY-----------------------------GFSGNKFTWRRGKNSRMRI

Query:  HERLDKYLINHAMCINARDLKVCHLNFLSSDHRPI---VASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQ-PGLGANNLCKKIDNSIIK
         ERLD+ +     C       V HL    SDH PI   V S      IQK K      +FE+ WL  +E + I++ NW          N+      ++  
Subjt:  HERLDKYLINHAMCINARDLKVCHLNFLSSDHRPI---VASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQ-PGLGANNLCKKIDNSIIK

Query:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGH--LDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGIPSQMME
        L+ W R +  GS++  I + +    KL +  +  H  +        +L  L+E+EE YW+ RSR  WLK GD+NT +FH+KA+QR+ +N I+   S  +E
Subjt:  LHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGH--LDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGIPSQMME

Query:  SG
        SG
Subjt:  SG

A0A803P4U9 Uncharacterized protein1.1e-2831.25Show/hide
Query:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW-WRFSGFYGF----------------------------------------SGNK----------
        G+SGGL LLW   + VQ+KSF+  HIDA ++ + G+ WRF+GFYG                                          GNK          
Subjt:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGW-WRFSGFYGF----------------------------------------SGNK----------

Query:  -----------------FTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKV
                         FTW  G+   + I E+LD+ L N     N +   V  L++ +SDHRP+  +       ++ +     R  FE+ W + +E + 
Subjt:  -----------------FTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKV

Query:  IVQKNWENQ-PGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKN
        I+QK W N+  G   +NL   +     KLH+W++ R K  +   I + +++I  LS    +     + K E DL+ + E+ E YW+ RSR +WLK+GD+N
Subjt:  IVQKNWENQ-PGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKN

Query:  TKWFHAKASQRRRRNHIEGI
        TK+FH KASQR+R+N IEG+
Subjt:  TKWFHAKASQRRRRNHIEGI

A0A803PUH4 Uncharacterized protein1.3e-2731.53Show/hide
Query:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKG-WWRFSGFYG---------------------------------------FSGNKFTWRRGKNSRM
        G+SGGL+LLW+  +D  I SFS  HID+ I+K +G WWRF+GFYG                                       + G+++TW  G+ + +
Subjt:  GRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKG-WWRFSGFYG---------------------------------------FSGNKFTWRRGKNSRM

Query:  RIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSII--
         I ERLD+   N          KV HL+ +SSDH PI+    F     +  +    R  FE  W   ++   IV ++W+     G+ N    + + +   
Subjt:  RIHERLDKYLINHAMCINARDLKVCHLNFLSSDHRPIVASWRFAEDIQKPKSNEKLR-RFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSII--

Query:  --KLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI
           L +W++ R K  +K  + + E++I  LS          +   E+  + LL++EE +WR RSR +WLK GD+NTK+FH KA+ R+R+N I G+
Subjt:  --KLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEEDLSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCTCTTCGCCACCTTGTGGAGACTGGAGAGTACAAAACCCCAACTGGCAGAAGCGGTGGCTTACTCCTGTTATGGAATAAAGAGATGGATGTTCAAATCAAATC
CTTCTCGGAAGGCCACATAGATGCTGGGATAAAAAAGAACAAAGGGTGGTGGAGATTTTCGGGTTTTTATGGGTTTTCAGGTAACAAATTCACATGGAGGAGGGGTAAAA
ACAGTAGAATGCGAATCCATGAAAGGCTGGACAAATATCTCATAAACCATGCAATGTGTATTAATGCTCGGGACTTGAAAGTTTGTCACCTAAATTTTCTAAGCTCCGAT
CATAGACCCATTGTTGCTAGCTGGAGGTTTGCAGAGGATATCCAGAAGCCTAAGAGTAATGAGAAGCTGAGAAGATTTGAAGAAGGTTGGCTGAAATATAAAGAAACAAA
AGTCATTGTTCAGAAGAATTGGGAGAACCAGCCGGGTCTGGGAGCTAATAACCTGTGCAAGAAGATTGACAACAGTATTATCAAGCTTCACCAATGGAGTAGACACCGTC
TGAAAGGGAGTATTAAGGCTGTTATTGACAAAAAGGAAGAAGAAATTCATAAACTCAGCAACGAGGACTTTGAGGGGCACTTGGATAACATCGATAAGGCTGAAGAGGAC
TTAAGTGCCTTGCTCGAAGAAGAAGAGGCATACTGGAGATGCAGATCTCGGGAAGTTTGGCTAAAGAATGGGGATAAGAACACTAAATGGTTCCACGCGAAAGCCTCCCA
ACGGAGAAGAAGGAACCATATTGAGGGCATCCCCAGTCAGATGATGGAATCTGGAAAGAGGATGAAAAGGATATTGGTAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCTCTTCGCCACCTTGTGGAGACTGGAGAGTACAAAACCCCAACTGGCAGAAGCGGTGGCTTACTCCTGTTATGGAATAAAGAGATGGATGTTCAAATCAAATC
CTTCTCGGAAGGCCACATAGATGCTGGGATAAAAAAGAACAAAGGGTGGTGGAGATTTTCGGGTTTTTATGGGTTTTCAGGTAACAAATTCACATGGAGGAGGGGTAAAA
ACAGTAGAATGCGAATCCATGAAAGGCTGGACAAATATCTCATAAACCATGCAATGTGTATTAATGCTCGGGACTTGAAAGTTTGTCACCTAAATTTTCTAAGCTCCGAT
CATAGACCCATTGTTGCTAGCTGGAGGTTTGCAGAGGATATCCAGAAGCCTAAGAGTAATGAGAAGCTGAGAAGATTTGAAGAAGGTTGGCTGAAATATAAAGAAACAAA
AGTCATTGTTCAGAAGAATTGGGAGAACCAGCCGGGTCTGGGAGCTAATAACCTGTGCAAGAAGATTGACAACAGTATTATCAAGCTTCACCAATGGAGTAGACACCGTC
TGAAAGGGAGTATTAAGGCTGTTATTGACAAAAAGGAAGAAGAAATTCATAAACTCAGCAACGAGGACTTTGAGGGGCACTTGGATAACATCGATAAGGCTGAAGAGGAC
TTAAGTGCCTTGCTCGAAGAAGAAGAGGCATACTGGAGATGCAGATCTCGGGAAGTTTGGCTAAAGAATGGGGATAAGAACACTAAATGGTTCCACGCGAAAGCCTCCCA
ACGGAGAAGAAGGAACCATATTGAGGGCATCCCCAGTCAGATGATGGAATCTGGAAAGAGGATGAAAAGGATATTGGTAGCATAG
Protein sequenceShow/hide protein sequence
MRALRHLVETGEYKTPTGRSGGLLLLWNKEMDVQIKSFSEGHIDAGIKKNKGWWRFSGFYGFSGNKFTWRRGKNSRMRIHERLDKYLINHAMCINARDLKVCHLNFLSSD
HRPIVASWRFAEDIQKPKSNEKLRRFEEGWLKYKETKVIVQKNWENQPGLGANNLCKKIDNSIIKLHQWSRHRLKGSIKAVIDKKEEEIHKLSNEDFEGHLDNIDKAEED
LSALLEEEEAYWRCRSREVWLKNGDKNTKWFHAKASQRRRRNHIEGIPSQMMESGKRMKRILVA