; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039409 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039409
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:43125479..43126991
RNA-Seq ExpressionLag0039409
SyntenyLag0039409
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY08849.1 Uncharacterized protein TCM_024087 [Theobroma cacao]2.4e-1434.39Show/hide
Query:  NRFINNLARAKY-VELLKRDFLFERGFSGDLPHFLRVG--ITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRGAQWRFSKTEKRTFQSAYLKK
        ++FI+  A  +Y   L+ +  + ERG    +  +  +   I +  W  FC +P++    VV +FYAN+ E       VRGAQW+ S  E  +F+ + +KK
Subjt:  NRFINNLARAKY-VELLKRDFLFERGFSGDLPHFLRVG--ITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRGAQWRFSKTEKRTFQSAYLKK

Query:  EANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKKR
        E   W+ F+  RLL +TH S V+++R +L +AI+   SI VGK+I+  I    + KR
Subjt:  EANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKKR

KAE8712804.1 hypothetical protein F3Y22_tig00110223pilonHSYRG00028 [Hibiscus syriacus]3.5e-1328.28Show/hide
Query:  ARAKYVELLKRDFLFERGF------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRG---------------AQWRFSKTE
        A  +Y  +  R   FE GF        DL   +   +T H W+ F   P +VN  +V +FY+NI E     ++V G                QW   +  
Subjt:  ARAKYVELLKRDFLFERGF------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRG---------------AQWRFSKTE

Query:  KRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASE-----------------ISGCWQKKRTQKARQGELVYGI
        ++T     L      W  F+K +LLPT+H++TVS +R++L  +I+  L+I +GKII  +                 I+  W+KK+ Q+    E++ GI
Subjt:  KRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASE-----------------ISGCWQKKRTQKARQGELVYGI

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.5e-1130.1Show/hide
Query:  RFINNLARAKYV-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR-----------------
        +F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V +FYAN+ + E   V VR                 
Subjt:  RFINNLARAKYV-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR-----------------

Query:  -----------------------------GAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEIS
                                     GA+W  S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI VG++I SEI 
Subjt:  -----------------------------GAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEIS

Query:  GCWQKK
         C  +K
Subjt:  GCWQKK

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.3e-1128.8Show/hide
Query:  LKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYAN--------------------------------IDEEEGF------------
        ++++F+++     + P F+   I  H W+LFC+ PE     +V +FY N                                IDE   F            
Subjt:  LKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYAN--------------------------------IDEEEGF------------

Query:  --QVIVRGAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK
           V + GA+W  S     T   + L   A  W  F+K RLLPTTH  TVS+E V L +++L   SI VG++I  EI  C  +K
Subjt:  --QVIVRGAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK

TYG52543.1 hypothetical protein ES288_D09G036700v1 [Gossypium darwinii]8.6e-1234.39Show/hide
Query:  ERGF---SGDL---PHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR--------------------GAQWRFSKTEKRTFQSAYLK
        E+GF   S DL   P  +R  I    WE FC      + ++V +FYA++  ++  +VIVR                    G+QW        + Q  YLK
Subjt:  ERGF---SGDL---PHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR--------------------GAQWRFSKTEKRTFQSAYLK

Query:  KEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK
          A  W  F++   +P +H ST+S E +LL +AIL   SI VGKII  EI  C +KK
Subjt:  KEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK

TrEMBL top hitse value%identityAlignment
A0A061F2U9 Uncharacterized protein1.2e-1434.39Show/hide
Query:  NRFINNLARAKY-VELLKRDFLFERGFSGDLPHFLRVG--ITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRGAQWRFSKTEKRTFQSAYLKK
        ++FI+  A  +Y   L+ +  + ERG    +  +  +   I +  W  FC +P++    VV +FYAN+ E       VRGAQW+ S  E  +F+ + +KK
Subjt:  NRFINNLARAKY-VELLKRDFLFERGFSGDLPHFLRVG--ITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRGAQWRFSKTEKRTFQSAYLKK

Query:  EANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKKR
        E   W+ F+  RLL +TH S V+++R +L +AI+   SI VGK+I+  I    + KR
Subjt:  EANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKKR

A0A2P5BCG4 Uncharacterized protein (Fragment)1.2e-1130.1Show/hide
Query:  RFINNLARAKYV-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR-----------------
        +F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V +FYAN+ + E   V VR                 
Subjt:  RFINNLARAKYV-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR-----------------

Query:  -----------------------------GAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEIS
                                     GA+W  S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI VG++I SEI 
Subjt:  -----------------------------GAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEIS

Query:  GCWQKK
         C  +K
Subjt:  GCWQKK

A0A2P5DAQ2 Uncharacterized protein2.1e-1128.8Show/hide
Query:  LKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYAN--------------------------------IDEEEGF------------
        ++++F+++     + P F+   I  H W+LFC+ PE     +V +FY N                                IDE   F            
Subjt:  LKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYAN--------------------------------IDEEEGF------------

Query:  --QVIVRGAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK
           V + GA+W  S     T   + L   A  W  F+K RLLPTTH  TVS+E V L +++L   SI VG++I  EI  C  +K
Subjt:  --QVIVRGAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK

A0A5D2B8V0 Uncharacterized protein4.2e-1234.39Show/hide
Query:  ERGF---SGDL---PHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR--------------------GAQWRFSKTEKRTFQSAYLK
        E+GF   S DL   P  +R  I    WE FC      + ++V +FYA++  ++  +VIVR                    G+QW        + Q  YLK
Subjt:  ERGF---SGDL---PHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVR--------------------GAQWRFSKTEKRTFQSAYLK

Query:  KEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK
          A  W  F++   +P +H ST+S E +LL +AIL   SI VGKII  EI  C +KK
Subjt:  KEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKK

A0A6A3B6J9 RT_RNaseH_2 domain-containing protein1.7e-1328.28Show/hide
Query:  ARAKYVELLKRDFLFERGF------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRG---------------AQWRFSKTE
        A  +Y  +  R   FE GF        DL   +   +T H W+ F   P +VN  +V +FY+NI E     ++V G                QW   +  
Subjt:  ARAKYVELLKRDFLFERGF------SGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEEGFQVIVRG---------------AQWRFSKTE

Query:  KRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASE-----------------ISGCWQKKRTQKARQGELVYGI
        ++T     L      W  F+K +LLPT+H++TVS +R++L  +I+  L+I +GKII  +                 I+  W+KK+ Q+    E++ GI
Subjt:  KRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASE-----------------ISGCWQKKRTQKARQGELVYGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACAAGAGCAAGAAAAGAGAGAGATAATAAGGAAGATGAGGTACCTGTTACCCCCGAAGTACCAAAACGAAGGGAAGAAAAAGAAAACCCAGAAGAAAAAGA
AGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAATGTGTAGAAGTTGCGGATACGGAGGAAGTTCAAGAAGAAACACAGAGGAAGTTCAAGAAAAACAGA
CTGAGAATGTGCAAGAAGGACGAACAGAGGTTGCGCCTGAAAAGGGTAATGAACAAGAGCACGAGGCTCGAGTGGTGGTTATCATGCCGGAAGTGCCAAGACGTCGCCGC
CGGCAGCAAAAAGCCAGAGAAAAAAAGGCCAAAGCAAGAAAGGAAGCAGAGAAAAAGGCTGAGGAAGAAATCTTGCTCAAACAAAGGGAAGATAACGAAACAGAGGAACC
AAGGTTGTCGTACAATCGCTTCATCAACAATCTTGCCAGAGCAAAGTATGTTGAGCTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACACT
TTCTTCGGGTCGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCCAAGCCTGAATCTGTGAACACGCAGGTAGTGTGCAAGTTTTATGCAAACATTGACGAGGAAGAA
GGTTTCCAAGTTATTGTTCGAGGGGCGCAGTGGAGATTTTCGAAAACAGAGAAAAGGACATTTCAGTCAGCCTATCTGAAGAAGGAAGCAAATACATGGATGGGATTCAT
CAAACAAAGATTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTTGCGATTTTAAGGTCTCTCAGTATTTATGTGGGCAAGATTATTG
CGAGTGAGATATCTGGATGCTGGCAGAAGAAGCGTACGCAAAAAGCACGTCAGGGTGAGCTTGTTTATGGCATTCACAAAATTTTAGAACAACTTGCTCTGTCGGCCAGT
AGGCAAGAGTTTGCCGAGAGGCAATCTCAAACCTTCTGGAACTATGTTAAACGTCGTGATGCCAACTTGAAGAAGGCGCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACAAGAGCAAGAAAAGAGAGAGATAATAAGGAAGATGAGGTACCTGTTACCCCCGAAGTACCAAAACGAAGGGAAGAAAAAGAAAACCCAGAAGAAAAAGA
AGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAATGTGTAGAAGTTGCGGATACGGAGGAAGTTCAAGAAGAAACACAGAGGAAGTTCAAGAAAAACAGA
CTGAGAATGTGCAAGAAGGACGAACAGAGGTTGCGCCTGAAAAGGGTAATGAACAAGAGCACGAGGCTCGAGTGGTGGTTATCATGCCGGAAGTGCCAAGACGTCGCCGC
CGGCAGCAAAAAGCCAGAGAAAAAAAGGCCAAAGCAAGAAAGGAAGCAGAGAAAAAGGCTGAGGAAGAAATCTTGCTCAAACAAAGGGAAGATAACGAAACAGAGGAACC
AAGGTTGTCGTACAATCGCTTCATCAACAATCTTGCCAGAGCAAAGTATGTTGAGCTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACACT
TTCTTCGGGTCGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCCAAGCCTGAATCTGTGAACACGCAGGTAGTGTGCAAGTTTTATGCAAACATTGACGAGGAAGAA
GGTTTCCAAGTTATTGTTCGAGGGGCGCAGTGGAGATTTTCGAAAACAGAGAAAAGGACATTTCAGTCAGCCTATCTGAAGAAGGAAGCAAATACATGGATGGGATTCAT
CAAACAAAGATTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTTGCGATTTTAAGGTCTCTCAGTATTTATGTGGGCAAGATTATTG
CGAGTGAGATATCTGGATGCTGGCAGAAGAAGCGTACGCAAAAAGCACGTCAGGGTGAGCTTGTTTATGGCATTCACAAAATTTTAGAACAACTTGCTCTGTCGGCCAGT
AGGCAAGAGTTTGCCGAGAGGCAATCTCAAACCTTCTGGAACTATGTTAAACGTCGTGATGCCAACTTGAAGAAGGCGCTATAG
Protein sequenceShow/hide protein sequence
MAKTRARKERDNKEDEVPVTPEVPKRREEKENPEEKEAKRRRRQQRAEDQEMCRSCGYGGSSRRNTEEVQEKQTENVQEGRTEVAPEKGNEQEHEARVVVIMPEVPRRRR
RQQKAREKKAKARKEAEKKAEEEILLKQREDNETEEPRLSYNRFINNLARAKYVELLKRDFLFERGFSGDLPHFLRVGITNHGWELFCSKPESVNTQVVCKFYANIDEEE
GFQVIVRGAQWRFSKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIYVGKIIASEISGCWQKKRTQKARQGELVYGIHKILEQLALSAS
RQEFAERQSQTFWNYVKRRDANLKKAL