; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016010 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016010
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:31388736..31389386
RNA-Seq ExpressionLag0016010
SyntenyLag0016010
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5443558.1 hypothetical protein F2P56_036105, partial [Juglans regia]4.2e-2333.33Show/hide
Query:  SEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPSGLF
        +EK+GG  R+ + M +FR+ + DC LRDLGFRG  +TWCN R+ +  I E LDRF+GN  FC LFP F+V +   A SDH PV    E L +      LF
Subjt:  SEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPSGLF

Query:  ---------------------------------LNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIK---DAYSVTPVDFSIIHSLEAELDKLLEEEEIYW
                                          ++  C  +L  W + +   +  ++   KH +K   D  S+ P D  ++     E+   LE EE+ W
Subjt:  ---------------------------------LNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIK---DAYSVTPVDFSIIHSLEAELDKLLEEEEIYW

Query:  HQRPRENWLKWGD
         QR R  WL+ GD
Subjt:  HQRPRENWLKWGD

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]5.3e-2639.44Show/hide
Query:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS
        +W+ E    ++     + +FR+ +D C L D+GF+G +FTWCN R    ++++ LDRF+ N+ F  +FP    ++  W+ + H            N S S
Subjt:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS

Query:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYS-VTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWG
            ++ + SS LRHWGR     L  +I+ +K AI DAY+   P+DF+IIH+LE +L  LLE EEI+W QR RE+WLKWG
Subjt:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYS-VTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWG

XP_022158772.1 uncharacterized protein LOC111025237 [Momordica charantia]3.8e-2435.56Show/hide
Query:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS
        +W+ E    ++     + +FR+ +D C L D+GF G  FTWCN R    ++++ LDRF+ N++F  LFP   + ++ W+ SDHR + L +  LP      
Subjt:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS

Query:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWGD
                 SS+++   R        E   +    +   S  P+DF+IIH +E +L  LLE EEI+W QR RE+WLKWGD
Subjt:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWGD

XP_042952138.1 uncharacterized protein LOC122289227 [Carya illinoinensis]1.8e-2134.91Show/hide
Query:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPV--ELSVEPLPR-NGS---
        EK G A R F +M  FR+A++DC LR++ F+GD FTW N+R       E LDR +GN+A+   FP   +S +   CSDH P+   LS E L R +GS   
Subjt:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPV--ELSVEPLPR-NGS---

Query:  --PSGLFL---------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVD-FSIIHSLEAELDKLLEEEEIYWH
           +  FL                            L  CS  L  W +  ++     IQ K++ IK   S    D  S I+ L+ E+D L+E+E++ W 
Subjt:  --PSGLFL---------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVD-FSIIHSLEAELDKLLEEEEIYWH

Query:  QRPRENWLKWGD
        QR ++ WLK GD
Subjt:  QRPRENWLKWGD

XP_042972741.1 uncharacterized protein LOC122304536 [Carya illinoinensis]6.0e-2234.91Show/hide
Query:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPV--ELSVEPLPR-NGS---
        EK G A R F +M  FR+A++DC LR++ F+GD FTW N+R       E LDR +GN+A+   FP   +S +   CSDH P+   LS E L R +GS   
Subjt:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPV--ELSVEPLPR-NGS---

Query:  --PSGLFL---------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSI-IHSLEAELDKLLEEEEIYWH
           +  FL                            L  CS  L  W +  ++     IQ K++ IK   S    D +  I+ L+ E+D L+EEE++ W 
Subjt:  --PSGLFL---------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSI-IHSLEAELDKLLEEEEIYWH

Query:  QRPRENWLKWGD
        QR ++ WLK GD
Subjt:  QRPRENWLKWGD

TrEMBL top hitse value%identityAlignment
A0A2N9I921 Reverse transcriptase domain-containing protein1.1e-2133.81Show/hide
Query:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELS---------VEPL--
        EKQG  ++    M SFR ALDDCG  DLG+ G  FTWCN R     ++E LDR V + A+   FP   V ++D+  SDH+P+ LS          +P   
Subjt:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELS---------VEPL--

Query:  --------------------PRNG-SPSGLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIK--DAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQR
                            P NG S   +   L+ C ++L++W +     +  ++Q+K+  +K  +  S+      +I SL AE+  LL +EE  W QR
Subjt:  --------------------PRNG-SPSGLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIK--DAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQR

Query:  PRENWLKWGD
         R  WLK GD
Subjt:  PRENWLKWGD

A0A2N9IWN7 Uncharacterized protein1.1e-2131.13Show/hide
Query:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSV-EPLPRNGSPSGLF
        EK+GG  R+   M  FR A+D CG  DLG+ G  FTWCN R     ++E LDR +   ++  LFP   V ++    SDH P+ +    PL     P+ +F
Subjt:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSV-EPLPRNGSPSGLF

Query:  L---------------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDA--YSVTPVDFSIIHSLEAELDKLLEEEEIYWH
                                           L SC + LR W RD+   +  E+++K   +++A   S+        H+L+ E+  LL  EE  W 
Subjt:  L---------------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDA--YSVTPVDFSIIHSLEAELDKLLEEEEIYWH

Query:  QRPRENWLKWGD
        QR R+ WL+WGD
Subjt:  QRPRENWLKWGD

A0A2N9IXK4 RNase H domain-containing protein3.4e-2331.13Show/hide
Query:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPSGLFL
        EKQGG  R+   M  FR A+D CG  DLGF G  FTWCN R  +  ++E LDR +   ++  LFP   V ++    SDH P+     P P +   S    
Subjt:  EKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPSGLFL

Query:  ----------------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDA--YSVTPVDFSIIHSLEAELDKLLEEEEIYWH
                                           L +C + LR W RD+   + +E+++K   +++A   S+     +  H+L+ E++ LL  EE  W 
Subjt:  ----------------------------------NLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDA--YSVTPVDFSIIHSLEAELDKLLEEEEIYWH

Query:  QRPRENWLKWGD
        QR R+ WL+WGD
Subjt:  QRPRENWLKWGD

A0A6J1DX30 uncharacterized protein LOC1110248742.6e-2639.44Show/hide
Query:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS
        +W+ E    ++     + +FR+ +D C L D+GF+G +FTWCN R    ++++ LDRF+ N+ F  +FP    ++  W+ + H            N S S
Subjt:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS

Query:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYS-VTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWG
            ++ + SS LRHWGR     L  +I+ +K AI DAY+   P+DF+IIH+LE +L  LLE EEI+W QR RE+WLKWG
Subjt:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYS-VTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWG

A0A6J1DY29 uncharacterized protein LOC1110252371.8e-2435.56Show/hide
Query:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS
        +W+ E    ++     + +FR+ +D C L D+GF G  FTWCN R    ++++ LDRF+ N++F  LFP   + ++ W+ SDHR + L +  LP      
Subjt:  MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPS

Query:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWGD
                 SS+++   R        E   +    +   S  P+DF+IIH +E +L  LLE EEI+W QR RE+WLKWGD
Subjt:  GLFLNLSSCSSRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWGD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGATTCTGAGAAGCAAGGTGGTGCGACACGAGCATTTGATTTAATGTCTAGCTTTCGCTCTGCCTTGGATGATTGTGGTCTTCGTGATCTTGGATTTCGTGGTGA
TGTGTTTACGTGGTGTAATCGTCGTTCGGTGGCTGTTCGTATTTTTGAACTTTTGGATAGATTTGTTGGGAATGAGGCGTTCTGTCAATTGTTCCCTCATTTTCTTGTTT
CAAATGTAGACTGGGCGTGTTCTGATCACCGGCCAGTGGAACTTTCGGTGGAACCACTTCCTCGAAATGGGTCTCCGAGTGGTTTGTTCCTTAATTTGTCATCTTGCTCT
TCTCGGTTGAGGCATTGGGGTAGGGACGCGAACAGGTTTCTTATGACAGAGATTCAGCAAAAGAAACATGCCATTAAAGATGCATATTCGGTGACCCCTGTGGATTTTTC
GATTATTCACTCTTTAGAGGCAGAGTTGGACAAGCTTTTGGAGGAAGAAGAAATCTATTGGCATCAGCGTCCTAGGGAAAACTGGCTCAAATGGGGTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGATTCTGAGAAGCAAGGTGGTGCGACACGAGCATTTGATTTAATGTCTAGCTTTCGCTCTGCCTTGGATGATTGTGGTCTTCGTGATCTTGGATTTCGTGGTGA
TGTGTTTACGTGGTGTAATCGTCGTTCGGTGGCTGTTCGTATTTTTGAACTTTTGGATAGATTTGTTGGGAATGAGGCGTTCTGTCAATTGTTCCCTCATTTTCTTGTTT
CAAATGTAGACTGGGCGTGTTCTGATCACCGGCCAGTGGAACTTTCGGTGGAACCACTTCCTCGAAATGGGTCTCCGAGTGGTTTGTTCCTTAATTTGTCATCTTGCTCT
TCTCGGTTGAGGCATTGGGGTAGGGACGCGAACAGGTTTCTTATGACAGAGATTCAGCAAAAGAAACATGCCATTAAAGATGCATATTCGGTGACCCCTGTGGATTTTTC
GATTATTCACTCTTTAGAGGCAGAGTTGGACAAGCTTTTGGAGGAAGAAGAAATCTATTGGCATCAGCGTCCTAGGGAAAACTGGCTCAAATGGGGTGATTGA
Protein sequenceShow/hide protein sequence
MWDSEKQGGATRAFDLMSSFRSALDDCGLRDLGFRGDVFTWCNRRSVAVRIFELLDRFVGNEAFCQLFPHFLVSNVDWACSDHRPVELSVEPLPRNGSPSGLFLNLSSCS
SRLRHWGRDANRFLMTEIQQKKHAIKDAYSVTPVDFSIIHSLEAELDKLLEEEEIYWHQRPRENWLKWGD