; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026686 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026686
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:40552638..40554673
RNA-Seq ExpressionLag0026686
SyntenyLag0026686
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3468965.1 reverse transcriptase [Gossypium australe]2.3e-2239.1Show/hide
Query:  EGHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLDSARRRNM-TWISIARSCPKISHLFFADDSFVFLKAAACDFGVF
        +G   L + RKKR G+ G+ A+K+DMSK YDR++C +++++M K+    EGLS+LL S ++ ++      +R  P+ISHL FADD  +F +A      + 
Subjt:  EGHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLDSARRRNM-TWISIARSCPKISHLFFADDSFVFLKAAACDFGVF

Query:  KSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPS
        K +L ++E  SGQC++ +KS+IF+S N  S+ +E +S++L ++ S     +LGLP+
Subjt:  KSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPS

XP_019158562.1 PREDICTED: uncharacterized protein LOC109155332 [Ipomoea nil]2.0e-2135.35Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL
        EM H++++K +G +G AALKLD+SK YDR++   LR IM +L                                                    EG SA+
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL

Query:  L-DSARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP
        + D   R  +   ++AR+ P ISHLFFADDSF+F KA+  +    K LL  YEKASGQ ++++KS + FSKN P+  ++ LSNIL +  S + G +LGLP
Subjt:  L-DSARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP

Query:  SSFQRGLEDPLVLSK
            R   + L   K
Subjt:  SSFQRGLEDPLVLSK

XP_019195637.1 PREDICTED: uncharacterized protein LOC109189300 [Ipomoea nil]1.0e-2244.23Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL---EGLSALLDSARRR-NMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLL
        E+ HFL +K+ G +G+ ALKLDM+K YDR++ S+LR ++  L   EGLS LL  A+ +  +    +AR  P ISHLFFADDS +F KA   +  V K  L
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL---EGLSALLDSARRR-NMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLL

Query:  YDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR
          YE+ SGQ V+  KS I +SKN   D +  +++IL +  + + G +LGLPS   R
Subjt:  YDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.6e-2133.33Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL
        E++H+L  K  GK GF A+KLDMSK +DRV+  ++ ++M ++                                                    EGLSAL
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL

Query:  LD-SARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP
        ++ +AR + +T ISI R CPK++HLFFADDS +F KAA  +  + +S+L  YE+ASGQ ++  KS IFFS N   ++++ + NIL    +     +LGLP
Subjt:  LD-SARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP

Query:  SSFQRGLEDPLVLSKGERG
        S   R       + K + G
Subjt:  SSFQRGLEDPLVLSKGERG

XP_030493541.1 uncharacterized protein LOC115709553 [Cannabis sativa]3.0e-2234.63Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL
        E+LH+L++KRKGK GF ALKLDMSK YDR++  +L  ++ KL                                                    EGLSAL
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL

Query:  LDSARRRNMT-WISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP
        L    RR +     ++   P+ISH+ FADDS+++ KA   +    K +L+ +E ASGQ V+ SKS IF+S N  SD+++ +S +L M ++     +LGLP
Subjt:  LDSARRRNMT-WISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP

Query:  SSFQR
        S+  R
Subjt:  SSFQR

TrEMBL top hitse value%identityAlignment
A0A2N9HE46 Uncharacterized protein4.3e-2240.51Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLD--SARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKS
        E LH+++ ++ GK+G  ALKLDMSK YDRV+  YL+ +M ++    + +S +++  S    ++  +SI+R  PKI+HLFFADDS +F KA   D    ++
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLD--SARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKS

Query:  LLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR
        +L +YE+ASGQ ++  K+ IFFSK+ P   KE + ++L +        +LGLPS   R
Subjt:  LLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR

A0A5B6VIS3 Reverse transcriptase1.1e-2239.1Show/hide
Query:  EGHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLDSARRRNM-TWISIARSCPKISHLFFADDSFVFLKAAACDFGVF
        +G   L + RKKR G+ G+ A+K+DMSK YDR++C +++++M K+    EGLS+LL S ++ ++      +R  P+ISHL FADD  +F +A      + 
Subjt:  EGHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----EGLSALLDSARRRNM-TWISIARSCPKISHLFFADDSFVFLKAAACDFGVF

Query:  KSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPS
        K +L ++E  SGQC++ +KS+IF+S N  S+ +E +S++L ++ S     +LGLP+
Subjt:  KSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPS

A0A803PW73 Uncharacterized protein7.3e-2238.01Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKLEGL----------------SALLDSARRRNMTWI---SIARSCPKISHLFFADDSFVF
        E+LH+L++KRKGK+GF ALKLDMSK YDR++  +L  ++ K+  L                + + D+       W+    +A   P+I+H+ FADDS+++
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKLEGL----------------SALLDSARRRNMTWI---SIARSCPKISHLFFADDSFVF

Query:  LKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR
         KA   +    + LL  +E ASGQ V+ +KS IFFS N  S+ +  + N L M V+D    +LGLPS+  R
Subjt:  LKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQR

A0A803PYQ3 Uncharacterized protein1.2e-2138.04Show/hide
Query:  GHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL---------------------EGLSALL-----DSARRRNMTWISIARSCPKISHL
        G E++H L  +R G+ G+ ALKLDM+K +DRV+  +LR+++  +                     EGLSAL+      +  R     I IAR  P ISHL
Subjt:  GHEMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL---------------------EGLSALL-----DSARRRNMTWISIARSCPKISHL

Query:  FFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQRGLED
        FFADDS +F  A+       KS+L +Y  ASGQ V+ SKS +FFS N   + +  ++ IL + VSD+   +LGLP +F R  ++
Subjt:  FFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLPSSFQRGLED

A0A803QNR5 Uncharacterized protein1.1e-2234.63Show/hide
Query:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL
        E+LH+L++KRKGK GF ALKLDMSK YDR++  +L  ++ KL                                                    EGLSAL
Subjt:  EMLHFLRKKRKGKSGFAALKLDMSKTYDRVKCSYLRQIMAKL----------------------------------------------------EGLSAL

Query:  LDSARRRNMT-WISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP
        L    RR +     ++   P+ISH+ FADDS+++ KA   +    K +L+ +E ASGQ V+ SKS IF+S N  SD+++ +S++L M ++     +LGLP
Subjt:  LDSARRRNMT-WISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDSKEYLSNILSMKVSDSLGAHLGLP

Query:  SSFQR
        S+  R
Subjt:  SSFQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCATAATGATGCGGTCTGGGGACCTGGGTTTCTCTGGGATATGGTTATGTGGTGCAATAGAAGGAATCTGGGGCGCAAAATGGTGGACTAGACATGAAGACTGTGG
TAACATCATTCGTCGGACTGGAGTTTGGGACGGAGTTAGAGCTCTGGCCCAACCTCTTCATTTGGCATTAAAGAAGTGTGCAGCAGACTTAAGAGGATGTGGAGCTCGCC
AGAACCAACGATTGAAGGTTGAGATTGAAATGGTGAGAGGAAAAAATAAAAAGGCTTATGAAGGCCATGAAATGTTACATTTTCTAAGAAAGAAGAGAAAAGGAAAATCT
GGTTTTGCTGCTTTAAAACTTGACATGAGTAAAACTTACGATAGGGTGAAATGTTCTTACCTGAGGCAGATTATGGCTAAACTAGAGGGTCTTTCTGCTCTATTGGATTC
GGCTAGAAGGAGGAATATGACATGGATTTCAATTGCGAGATCGTGTCCGAAAATTTCTCATCTATTCTTTGCAGATGATAGCTTCGTGTTTCTCAAAGCAGCGGCATGTG
ATTTTGGAGTTTTTAAATCCTTATTGTATGACTATGAGAAAGCGTCTGGCCAGTGTGTTCATGTTAGCAAATCGATGATCTTTTTCTCGAAAAATGTTCCTTCAGACTCT
AAGGAGTATCTCAGCAATATACTATCTATGAAGGTGTCTGATTCATTGGGCGCACATCTCGGTCTTCCATCATCATTTCAACGCGGGTTGGAAGATCCACTTGTTCTCTC
AAAGGGGGAAAGAGGTGTTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCATAATGATGCGGTCTGGGGACCTGGGTTTCTCTGGGATATGGTTATGTGGTGCAATAGAAGGAATCTGGGGCGCAAAATGGTGGACTAGACATGAAGACTGTGG
TAACATCATTCGTCGGACTGGAGTTTGGGACGGAGTTAGAGCTCTGGCCCAACCTCTTCATTTGGCATTAAAGAAGTGTGCAGCAGACTTAAGAGGATGTGGAGCTCGCC
AGAACCAACGATTGAAGGTTGAGATTGAAATGGTGAGAGGAAAAAATAAAAAGGCTTATGAAGGCCATGAAATGTTACATTTTCTAAGAAAGAAGAGAAAAGGAAAATCT
GGTTTTGCTGCTTTAAAACTTGACATGAGTAAAACTTACGATAGGGTGAAATGTTCTTACCTGAGGCAGATTATGGCTAAACTAGAGGGTCTTTCTGCTCTATTGGATTC
GGCTAGAAGGAGGAATATGACATGGATTTCAATTGCGAGATCGTGTCCGAAAATTTCTCATCTATTCTTTGCAGATGATAGCTTCGTGTTTCTCAAAGCAGCGGCATGTG
ATTTTGGAGTTTTTAAATCCTTATTGTATGACTATGAGAAAGCGTCTGGCCAGTGTGTTCATGTTAGCAAATCGATGATCTTTTTCTCGAAAAATGTTCCTTCAGACTCT
AAGGAGTATCTCAGCAATATACTATCTATGAAGGTGTCTGATTCATTGGGCGCACATCTCGGTCTTCCATCATCATTTCAACGCGGGTTGGAAGATCCACTTGTTCTCTC
AAAGGGGGAAAGAGGTGTTGGTTAA
Protein sequenceShow/hide protein sequence
MIIMMRSGDLGFSGIWLCGAIEGIWGAKWWTRHEDCGNIIRRTGVWDGVRALAQPLHLALKKCAADLRGCGARQNQRLKVEIEMVRGKNKKAYEGHEMLHFLRKKRKGKS
GFAALKLDMSKTYDRVKCSYLRQIMAKLEGLSALLDSARRRNMTWISIARSCPKISHLFFADDSFVFLKAAACDFGVFKSLLYDYEKASGQCVHVSKSMIFFSKNVPSDS
KEYLSNILSMKVSDSLGAHLGLPSSFQRGLEDPLVLSKGERGVG