; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018183 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018183
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:18207023..18209581
RNA-Seq ExpressionLag0018183
SyntenyLag0018183
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO99000.1 reverse transcriptase [Corchorus capsularis]7.9e-1730.15Show/hide
Query:  EASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTSPIHQS-LGQYLGIPSQNARNKKEIFNGIKDR------------------
        +A++A+   +RD L IY + SGQ IN DKS    S NT +     I+E      QS + +YLG+P+   RNKK  FN IK+R                  
Subjt:  EASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTSPIHQS-LGQYLGIPSQNARNKKEIFNGIKDR------------------

Query:  ---------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVA------GTNFATTRIRR----NGLSRAILVQPSHVGKTKLEVNQISK--
                 +IP YAM  F FP TLCNE++ + ARFWW QQI +R    VA        +F     R     + L   +   PS   ++ L+  ++ +  
Subjt:  ---------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVA------GTNFATTRIRR----NGLSRAILVQPSHVGKTKLEVNQISK--

Query:  QLARKGSAKPILVRPD---------IRH----------LTVAHLKDSNGLWKENIIRECFLDTYAEAILNMP
           R G  + + V  D         I H          L    +++ +  W  ++IR  F +  AEAIL +P
Subjt:  QLARKGSAKPILVRPD---------IRH----------LTVAHLKDSNGLWKENIIRECFLDTYAEAILNMP

XP_023902041.1 uncharacterized protein LOC112013897 [Quercus suber]4.6e-1729.71Show/hide
Query:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTSPIHQSL-GQYLGIPSQNARNKKEIFNGIKDR--------------
        L+  +A++ +C+T+ DIL +Y  ASGQ IN+DKS+   S NT +E    +      +  +   +YLG+PS   ++K EIF  +K+R              
Subjt:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTSPIHQSL-GQYLGIPSQNARNKKEIFNGIKDR--------------

Query:  -------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK------FIGVAGTNFATTRIRRNGLSRAILVQPSHVGKTKLEVNQISKQL
                     AIP Y MSCF+ P TLC E+ A+  RFWWGQ+ +  K      F G+      T    R G    IL+       T +    IS   
Subjt:  -------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK------FIGVAGTNFATTRIRRNGLSRAILVQPSHVGKTKLEVNQISKQL

Query:  ARKGSAKPILVRPDIRHLTVAHLKDSNGLWKENIIRECFLDTYAEAILNMPSCSTMARTRSFGITMLKGSFG-KSA
              KP    P +  L     K     W+++++R  FL   A  IL++P        +   +   KG F  KSA
Subjt:  ARKGSAKPILVRPDIRHLTVAHLKDSNGLWKENIIRECFLDTYAEAILNMPSCSTMARTRSFGITMLKGSFG-KSA

XP_030477990.1 uncharacterized protein LOC115695032 [Cannabis sativa]1.6e-1737.25Show/hide
Query:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESV----TIIKEPTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR-----------
        L+  EA +  C  I+ +LD Y KASGQ +N DKS    SPNT E S      I+  P    H+S   YLG+P+ + R+KK++FN IK+R           
Subjt:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESV----TIIKEPTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR-----------

Query:  ----------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK
                        +IP YAMSCFK P+  C+E+ +L + FWWG    ++K
Subjt:  ----------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK

XP_030479476.1 uncharacterized protein LOC115696730 [Cannabis sativa]4.2e-1835.98Show/hide
Query:  EWIHPNRDLDRETH--FPILILICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTS-PIHQSLGQYLGIPSQNARNKKEI
        E++ P R L + +H  F    L+  EA+N   R I+ +LDIY KASGQ +N  KS    SPNT + +     +    PI +    YLG+P+ + R+KKE+
Subjt:  EWIHPNRDLDRETH--FPILILICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKEPTS-PIHQSLGQYLGIPSQNARNKKEI

Query:  FNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWG
        F+ +K+R                           +IP YAMSCF+ P T CN+L ++ A FWWG
Subjt:  FNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWG

XP_034219069.1 uncharacterized protein LOC117630466 [Prunus dulcis]4.6e-1733.48Show/hide
Query:  YLAKPIGFCVWRLITDNAILGSPQWYSEEWIHPNRDLDRETHFPILILICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNT----KEESVTI
        YL   +   +  +I+ N+ LG  Q     ++H    +     F    +    A+N +C  + +IL  Y  ASGQ INLDKSN   SPNT    +++   I
Subjt:  YLAKPIGFCVWRLITDNAILGSPQWYSEEWIHPNRDLDRETHFPILILICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNT----KEESVTI

Query:  --IKEPTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRR
          IKE  +P     G YLG+P+   R+KKE  N IK+R                           AIP+Y MSCFK P+TLC E+++L A FWWG     
Subjt:  --IKEPTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRR

Query:  RKFIGVAGTNFATTRIRRNGLSRA
             V G      R    GLS+A
Subjt:  RKFIGVAGTNFATTRIRRNGLSRA

TrEMBL top hitse value%identityAlignment
A0A2N9EMD0 Uncharacterized protein5.9e-1834.48Show/hide
Query:  CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE
        C+  +     + G P  Y    I P+R L + +   P L L+CAE        A+   C  I+ IL  Y  ASGQ +N DK+    S +T E S  +IKE
Subjt:  CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE

Query:  PTS-PIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFI
            PI +   +YLG+PS   RN+ E F  IK+R                           AIP Y+MSCF+ P  LC+EL A+  RFWW     +RK  
Subjt:  PTS-PIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFI

Query:  GVA
         VA
Subjt:  GVA

A0A2N9GJ35 Uncharacterized protein3.5e-1840.4Show/hide
Query:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR-------------
        ++   A+NADC T++++L  YA ASGQ +N DK+    SPNT ++S   I     TSP  Q   +YLG+P    R K+  FN IKDR             
Subjt:  LICAEASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR-------------

Query:  --------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK
                      AIPNYAMSCFK P   C+EL ++  RFWWGQ+   RK
Subjt:  --------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRK

A0A2N9I6M1 CCHC-type domain-containing protein2.7e-1835.53Show/hide
Query:  WIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQN
        +I P R L + +   P L L+CAE        A+  +C  +  IL+IY  ASGQ IN  K+    S NT+ E   +I +   T+P  Q   +YLG+P   
Subjt:  WIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASGQTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQN

Query:  ARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVAGTNFATTRIRRNGL
         ++KK  FNG+KDR                           AIP YAMSCFKFP  LC+E++++  RFWWGQ+   RK +   G N  + R    G+
Subjt:  ARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVAGTNFATTRIRRNGL

A0A2N9J5D9 CCHC-type domain-containing protein9.1e-1934.67Show/hide
Query:  RARQDYLAK---PIGF----------CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASG
        R   DYL K    +GF          CV  +     + G P+ Y    + P+R L + +   P L LICAE        A+N +C+ ++DIL +Y  ASG
Subjt:  RARQDYLAK---PIGF----------CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE--------ASNADCRTIRDILDIYAKASG

Query:  QTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKF
        Q IN  K+    S N        I     TSP  Q   +YLG+P    R+KK+ F  IKDR                           AIP YAMSCFKF
Subjt:  QTINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKF

Query:  PLTLCNELNALCARFWWGQQIRRRK
        P  LC E++++  RFWWGQ+   RK
Subjt:  PLTLCNELNALCARFWWGQQIRRRK

A0A2N9J6K4 Reverse transcriptase domain-containing protein9.1e-1934.38Show/hide
Query:  RARQDYLAK---PIGF----------CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE-------ASNADCRTIRDILDIYAKASGQ
        R   DYL K    +GF          CV  +     + G P+ Y    + P+R L + +   P L LICAE       A++ +C+ ++DIL +Y  ASGQ
Subjt:  RARQDYLAK---PIGF----------CVWRLITDNAILGSPQWYSEEWIHPNRDLDR-ETHFPILILICAE-------ASNADCRTIRDILDIYAKASGQ

Query:  TINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFP
         IN  K+    S N        I     TSP  Q   +YLG+P    R+KK+ F+ IKDR                           AIP YAMSCFKFP
Subjt:  TINLDKSNFMVSPNTKEESVTIIKE--PTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDR---------------------------AIPNYAMSCFKFP

Query:  LTLCNELNALCARFWWGQQIRRRK
          LC E++++  RFWWGQ+   RK
Subjt:  LTLCNELNALCARFWWGQQIRRRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.7e-0435.71Show/hide
Query:  AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVAGTNFATTRIRRNGL
        A+P YAMSCF+    LC +L +    FWW     +RK   VA      ++    GL
Subjt:  AIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVAGTNFATTRIRRNGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAACTCATTGCATAGGATACCCCCACTCGCATGTCTCCTACACGAACGCGTTGGATCGCAGCGTTTGTATCAAATACAAAGTGGGCCGTGCTCGACAAGATTATCT
CGCCAAGCCAATCGGCTTTTGTGTCTGGCGCCTTATCACGGACAATGCCATCTTGGGTTCTCCTCAATGGTACTCCGAGGAGTGGATTCACCCAAATCGGGACTTAGACA
GGGAGACCCACTTTCCCATACTTATTCTCATTTGTGCTGAAGCGTCTAATGCAGATTGCAGGACTATTCGGGACATCCTTGATATATATGCGAAAGCTTCTGGCCAAACT
ATTAATTTGGACAAATCTAACTTTATGGTGAGCCCTAACACAAAAGAAGAAAGCGTGACGATCATCAAGGAGCCTACAAGTCCAATTCACCAAAGCTTAGGCCAATATTT
AGGCATTCCCTCGCAGAATGCTAGAAACAAGAAGGAGATCTTCAATGGCATAAAAGATCGAGCTATTCCTAATTATGCAATGAGCTGCTTTAAGTTTCCTCTAACTCTAT
GTAATGAATTAAATGCCTTATGTGCTAGGTTCTGGTGGGGACAACAGATTCGGAGAAGAAAATTCATTGGTGTAGCTGGGACAAACTTTGCCACAACAAGAATTCGGAGG
AATGGGCTTTCGAGAGCTATCCTTGTTCAACCAAGCCATGTTGGCAAAACAAAGCTGGAGGTTAATCAAATTTCCAAACAGCTTGCTCGCAAAGGTTCCGCTAAGCCAAT
TCTAGTTCGTCCTGATATACGTCACCTCACTGTGGCGCATCTCAAAGACTCGAACGGGTTGTGGAAGGAGAATATTATTAGAGAGTGCTTCCTGGACACATATGCAGAGG
CAATTTTAAATATGCCATCTTGTTCTACCATGGCGAGGACGAGATCATTTGGAATTACGATGCTAAAGGGAAGTTTCGGTAAAAGTGCGAGAAGCCTGAATGGTTGCAAC
ATCTCTTTGGAATGCAAGTTCATGAAAGAGGAACTCCATCAGCCACAACAAACAAATTCCCAAGCAGAGAGTTTACAACAGCAAATTGACAAAGCAATATCTGAGCTAAT
CGGAGAACAAGAGGAGTACCTACCTAATGCGACTGTGAACGAGCAACCCTTCAGTTTCAACAACAAGTCGGCACCCATGTCTGGATGGCTTGCGATTCCCGAAGGCCTAT
TGATGTGGACTTTTCCTACCGCAAGCGCACGGGTCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAACTCATTGCATAGGATACCCCCACTCGCATGTCTCCTACACGAACGCGTTGGATCGCAGCGTTTGTATCAAATACAAAGTGGGCCGTGCTCGACAAGATTATCT
CGCCAAGCCAATCGGCTTTTGTGTCTGGCGCCTTATCACGGACAATGCCATCTTGGGTTCTCCTCAATGGTACTCCGAGGAGTGGATTCACCCAAATCGGGACTTAGACA
GGGAGACCCACTTTCCCATACTTATTCTCATTTGTGCTGAAGCGTCTAATGCAGATTGCAGGACTATTCGGGACATCCTTGATATATATGCGAAAGCTTCTGGCCAAACT
ATTAATTTGGACAAATCTAACTTTATGGTGAGCCCTAACACAAAAGAAGAAAGCGTGACGATCATCAAGGAGCCTACAAGTCCAATTCACCAAAGCTTAGGCCAATATTT
AGGCATTCCCTCGCAGAATGCTAGAAACAAGAAGGAGATCTTCAATGGCATAAAAGATCGAGCTATTCCTAATTATGCAATGAGCTGCTTTAAGTTTCCTCTAACTCTAT
GTAATGAATTAAATGCCTTATGTGCTAGGTTCTGGTGGGGACAACAGATTCGGAGAAGAAAATTCATTGGTGTAGCTGGGACAAACTTTGCCACAACAAGAATTCGGAGG
AATGGGCTTTCGAGAGCTATCCTTGTTCAACCAAGCCATGTTGGCAAAACAAAGCTGGAGGTTAATCAAATTTCCAAACAGCTTGCTCGCAAAGGTTCCGCTAAGCCAAT
TCTAGTTCGTCCTGATATACGTCACCTCACTGTGGCGCATCTCAAAGACTCGAACGGGTTGTGGAAGGAGAATATTATTAGAGAGTGCTTCCTGGACACATATGCAGAGG
CAATTTTAAATATGCCATCTTGTTCTACCATGGCGAGGACGAGATCATTTGGAATTACGATGCTAAAGGGAAGTTTCGGTAAAAGTGCGAGAAGCCTGAATGGTTGCAAC
ATCTCTTTGGAATGCAAGTTCATGAAAGAGGAACTCCATCAGCCACAACAAACAAATTCCCAAGCAGAGAGTTTACAACAGCAAATTGACAAAGCAATATCTGAGCTAAT
CGGAGAACAAGAGGAGTACCTACCTAATGCGACTGTGAACGAGCAACCCTTCAGTTTCAACAACAAGTCGGCACCCATGTCTGGATGGCTTGCGATTCCCGAAGGCCTAT
TGATGTGGACTTTTCCTACCGCAAGCGCACGGGTCAAGTAA
Protein sequenceShow/hide protein sequence
MQTHCIGYPHSHVSYTNALDRSVCIKYKVGRARQDYLAKPIGFCVWRLITDNAILGSPQWYSEEWIHPNRDLDRETHFPILILICAEASNADCRTIRDILDIYAKASGQT
INLDKSNFMVSPNTKEESVTIIKEPTSPIHQSLGQYLGIPSQNARNKKEIFNGIKDRAIPNYAMSCFKFPLTLCNELNALCARFWWGQQIRRRKFIGVAGTNFATTRIRR
NGLSRAILVQPSHVGKTKLEVNQISKQLARKGSAKPILVRPDIRHLTVAHLKDSNGLWKENIIRECFLDTYAEAILNMPSCSTMARTRSFGITMLKGSFGKSARSLNGCN
ISLECKFMKEELHQPQQTNSQAESLQQQIDKAISELIGEQEEYLPNATVNEQPFSFNNKSAPMSGWLAIPEGLLMWTFPTASARVK