; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032956 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032956
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:39180747..39183741
RNA-Seq ExpressionLag0032956
SyntenyLag0032956
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]4.2e-3154.48Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGDWRFTSFYGNPETEK
        NP   R LR+LV   +PQLVFL ETK +     R +  + F+ C+ V+S G+SGGLML+WNS   V I S S GHID+ I D  G WRFT FYGNP T K
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGDWRFTSFYGNPETEK

Query:  RRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
        R  SW+LL+RL    + PW+IGGDFNEI+S  EK
Subjt:  RRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

XP_030970584.1 uncharacterized protein LOC115990959 [Quercus lobata]6.2e-2751.47Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGD--WRFTSFYGNPET
        NPR++RAL +LV  + P+LVFL ETK   KR ERI+  + F   L V  +GRSGGL L+W     + I SFS  HIDA I +   +  WRFT FYG+P+T
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGD--WRFTSFYGNPET

Query:  EKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
          R+ SW+LLD L  +++ PW   GDFNEILS EEK
Subjt:  EKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]1.4e-2648.15Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRG-DWRFTSFYGNPETE
        NP+++R LR++V  + P  VFL ETK   +  ER ++S+ F   L + S+GRSGGL L+W   + V++ SFS+ HIDA + ++RG  WR T FYGNPE  
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRG-DWRFTSFYGNPETE

Query:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
        +R+ SWELL  L  + + PWL  GDFNEI+S  EK
Subjt:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

XP_042958101.1 uncharacterized protein LOC122293641 [Carya illinoinensis]1.4e-2628.28Show/hide
Query:  LSNKGINVE----HLPKIWGVEENVQLKKTGKNMYICKFKNRKTKRRV-------------------GDRGIKALELRYVNFWVDFHNLPVVCLNRKYAI
        L+++ +N E     + K+W +E  +  K+ G N ++ +++    KR+V                   GD     +  +   FWV  HN+P   +N +   
Subjt:  LSNKGINVE----HLPKIWGVEENVQLKKTGKNMYICKFKNRKTKRRV-------------------GDRGIKALELRYVNFWVDFHNLPVVCLNRKYAI

Query:  ALANSIGAFVKMNEEEEEGRVWGETLRVKVKLEANKPLRRG--TKVMNK----------------CKEDNSD-------EEDEDENAAYNMEYRGKKAKE
         L + IG   K+ E +++G  WG  LR++V ++ NKPL RG   KV +K                  +DNS        E  +D +   ++   G+   +
Subjt:  ALANSIGAFVKMNEEEEEGRVWGETLRVKVKLEANKPLRRG--TKVMNK----------------CKEDNSD-------EEDEDENAAYNMEYRGKKAKE

Query:  KEAMEEKK-----------DRTKEIEEKTKELATKMAERDIGGDRRTALSDAMNPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCV
           + ++K           D+T    E       K  E   G  ++  +          + R L  +  P LVFL+ETKC S++ E IRV M F+ C  V
Subjt:  KEAMEEKK-----------DRTKEIEEKTKELATKMAERDIGGDRRTALSDAMNPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCV

Query:  SSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGD--WRFTSFYGNPETEKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
         S GRSGGL L+WNS ++V +++++  HI   ++    D  W  T FYG+  T KR  +W++L  L    +  WL  GDFNEI    EK
Subjt:  SSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGD--WRFTSFYGNPETEKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

XP_042972796.1 uncharacterized protein LOC122304603 [Carya illinoinensis]4.3e-2851.47Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRG--DWRFTSFYGNPET
        NPR +RAL  LV    P ++FLMETK  S++ ERIRV + FE C  V+S+GR GG+ L+W + +K++I SFS  HIDA I    G  +W+FT  YG+ E 
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRG--DWRFTSFYGNPET

Query:  EKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
        EKR  +W LL  L EE   PWL+ GDFNE+LS +EK
Subjt:  EKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

TrEMBL top hitse value%identityAlignment
A0A2N9ESV7 Uncharacterized protein4.2e-2948.53Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE
        NP+AIR LR+L     P ++FL+ETK D KR E++R S+ F++   V S GRSGGL L+W   ++V + +FS+ H+D  +  D    WR T FYG+PE  
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE

Query:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK
        KR  +W+LL+ L   N+ PWL  GDFNEILS EEK+
Subjt:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK

A0A2N9EUS3 Reverse transcriptase domain-containing protein4.2e-2948.53Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE
        NP+AIR LR+L     P ++FL+ETK D KR E++R S+ F++   V S GRSGGL L+W   ++V + +FS+ H+D  +  D    WR T FYG+PE  
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE

Query:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK
        KR  +W+LL+ L   N+ PWL  GDFNEILS EEK+
Subjt:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK

A0A2N9F4F2 Reverse transcriptase domain-containing protein4.2e-2948.53Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE
        NP+AIR LR+L     P ++FL+ETK D KR E++R S+ F++   V S GRSGGL L+W   ++V + +FS+ H+D  +  D    WR T FYG+PE  
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE

Query:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK
        KR  +W+LL+ L   N+ PWL  GDFNEILS EEK+
Subjt:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK

A0A2N9HE28 Reverse transcriptase domain-containing protein4.2e-2948.53Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE
        NP+AIR LR+L     P ++FL+ETK D KR E++R S+ F++   V S GRSGGL L+W   ++V + +FS+ H+D  +  D    WR T FYG+PE  
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASI-QDNRGDWRFTSFYGNPETE

Query:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK
        KR  +W+LL+ L   N+ PWL  GDFNEILS EEK+
Subjt:  KRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKK

A0A6J1DUG8 uncharacterized protein LOC1110241352.0e-3154.48Show/hide
Query:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGDWRFTSFYGNPETEK
        NP   R LR+LV   +PQLVFL ETK +     R +  + F+ C+ V+S G+SGGLML+WNS   V I S S GHID+ I D  G WRFT FYGNP T K
Subjt:  NPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNINSFSEGHIDASIQDNRGDWRFTSFYGNPETEK

Query:  RRFSWELLDRLYEENERPWLIGGDFNEILSAEEK
        R  SW+LL+RL    + PW+IGGDFNEI+S  EK
Subjt:  RRFSWELLDRLYEENERPWLIGGDFNEILSAEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAGTGTGGGGTGAAGCGCCCGAATAATACTTGGTTGCAGGTTGCATACCGTCGCCATGGAGGAAGAAAAGGTGATGAAAGAGATGGGGAAGATGAAACTAACAG
AGGCAGAAAAAGAAGGGCTCATGGAGGTGTGGAAGACAATGATATGGACTTAACAGACAAGGACATAGAAAATACAGTGGCTTGCAAAATCCTATCAAACAAAGGAATAA
ATGTTGAGCACTTACCAAAGATATGGGGCGTAGAAGAGAATGTGCAGTTGAAAAAGACAGGGAAGAATATGTACATCTGCAAGTTTAAGAACAGAAAAACAAAAAGAAGA
GTGGGGGATAGGGGGATCAAGGCCTTAGAGTTGAGGTACGTAAATTTCTGGGTTGATTTTCATAACTTACCCGTGGTTTGCCTAAACAGGAAATATGCAATAGCCCTGGC
CAACTCTATTGGTGCTTTCGTGAAGATGAATGAAGAAGAGGAGGAAGGTAGAGTTTGGGGGGAGACACTTCGCGTGAAAGTTAAGCTGGAAGCAAACAAGCCTCTGCGTA
GAGGAACCAAGGTTATGAACAAATGCAAAGAAGACAACAGTGACGAAGAGGATGAGGATGAAAATGCAGCTTACAATATGGAGTATAGAGGTAAAAAAGCAAAAGAGAAA
GAAGCCATGGAGGAAAAGAAAGATAGAACCAAAGAAATAGAAGAGAAAACAAAAGAGCTGGCGACTAAGATGGCCGAGAGGGACATTGGTGGAGACAGGAGAACAGCCCT
GTCGGATGCCATGAACCCTCGAGCGATCCGAGCGCTTCGCCACTTAGTGGGGAGCTTCAAACCCCAGCTAGTCTTTTTAATGGAAACCAAATGCGATTCAAAAAGGAGTG
AAAGAATTAGAGTTAGCATGATGTTTGAGTTTTGTTTGTGTGTTTCGAGTAACGGTAGAAGTGGAGGCCTAATGCTCATTTGGAACTCTTTTATGAAGGTCAATATAAAT
TCTTTCTCTGAAGGGCACATTGATGCATCTATTCAAGACAACCGAGGGGATTGGAGATTTACTAGCTTCTATGGCAACCCAGAAACAGAGAAAAGACGGTTTTCTTGGGA
GCTCCTTGACAGGTTGTATGAGGAAAATGAGAGGCCTTGGTTGATTGGTGGTGATTTCAATGAGATCTTGTCGGCTGAAGAGAAAAAAAGGAGGAATGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCAAGTGTGGGGTGAAGCGCCCGAATAATACTTGGTTGCAGGTTGCATACCGTCGCCATGGAGGAAGAAAAGGTGATGAAAGAGATGGGGAAGATGAAACTAACAG
AGGCAGAAAAAGAAGGGCTCATGGAGGTGTGGAAGACAATGATATGGACTTAACAGACAAGGACATAGAAAATACAGTGGCTTGCAAAATCCTATCAAACAAAGGAATAA
ATGTTGAGCACTTACCAAAGATATGGGGCGTAGAAGAGAATGTGCAGTTGAAAAAGACAGGGAAGAATATGTACATCTGCAAGTTTAAGAACAGAAAAACAAAAAGAAGA
GTGGGGGATAGGGGGATCAAGGCCTTAGAGTTGAGGTACGTAAATTTCTGGGTTGATTTTCATAACTTACCCGTGGTTTGCCTAAACAGGAAATATGCAATAGCCCTGGC
CAACTCTATTGGTGCTTTCGTGAAGATGAATGAAGAAGAGGAGGAAGGTAGAGTTTGGGGGGAGACACTTCGCGTGAAAGTTAAGCTGGAAGCAAACAAGCCTCTGCGTA
GAGGAACCAAGGTTATGAACAAATGCAAAGAAGACAACAGTGACGAAGAGGATGAGGATGAAAATGCAGCTTACAATATGGAGTATAGAGGTAAAAAAGCAAAAGAGAAA
GAAGCCATGGAGGAAAAGAAAGATAGAACCAAAGAAATAGAAGAGAAAACAAAAGAGCTGGCGACTAAGATGGCCGAGAGGGACATTGGTGGAGACAGGAGAACAGCCCT
GTCGGATGCCATGAACCCTCGAGCGATCCGAGCGCTTCGCCACTTAGTGGGGAGCTTCAAACCCCAGCTAGTCTTTTTAATGGAAACCAAATGCGATTCAAAAAGGAGTG
AAAGAATTAGAGTTAGCATGATGTTTGAGTTTTGTTTGTGTGTTTCGAGTAACGGTAGAAGTGGAGGCCTAATGCTCATTTGGAACTCTTTTATGAAGGTCAATATAAAT
TCTTTCTCTGAAGGGCACATTGATGCATCTATTCAAGACAACCGAGGGGATTGGAGATTTACTAGCTTCTATGGCAACCCAGAAACAGAGAAAAGACGGTTTTCTTGGGA
GCTCCTTGACAGGTTGTATGAGGAAAATGAGAGGCCTTGGTTGATTGGTGGTGATTTCAATGAGATCTTGTCGGCTGAAGAGAAAAAAAGGAGGAATGGCTAG
Protein sequenceShow/hide protein sequence
MTKCGVKRPNNTWLQVAYRRHGGRKGDERDGEDETNRGRKRRAHGGVEDNDMDLTDKDIENTVACKILSNKGINVEHLPKIWGVEENVQLKKTGKNMYICKFKNRKTKRR
VGDRGIKALELRYVNFWVDFHNLPVVCLNRKYAIALANSIGAFVKMNEEEEEGRVWGETLRVKVKLEANKPLRRGTKVMNKCKEDNSDEEDEDENAAYNMEYRGKKAKEK
EAMEEKKDRTKEIEEKTKELATKMAERDIGGDRRTALSDAMNPRAIRALRHLVGSFKPQLVFLMETKCDSKRSERIRVSMMFEFCLCVSSNGRSGGLMLIWNSFMKVNIN
SFSEGHIDASIQDNRGDWRFTSFYGNPETEKRRFSWELLDRLYEENERPWLIGGDFNEILSAEEKKRRNG