; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009033 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009033
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:34316401..34316883
RNA-Seq ExpressionLag0009033
SyntenyLag0009033
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]8.5e-3652.2Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHSL-TGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M CI++ S+SILING   G I PSRGLRQGD LSP +FLLC EGLSA++  A  + L TG++I+R CP +THLFFADDS++F +A  EE  + ++I+  Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHSL-TGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        E+AS Q IN +KS ++FS N   +TRD + N+LG         YLGLPS   R KS+ F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

XP_023892052.1 uncharacterized protein LOC112004058 [Quercus suber]2.5e-3548.45Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M C+++ SFSI I+G+  G I+PSRGLRQGD LSPY+FLLC EG S+MLA A     L GV I R  PSI+HL FADDSL+F +A+ EE ++   +++ Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF
          +S QCIN  KS VYFS N  ++ R+ +   +G+R ++ F  YL LP+   R K + F +
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]2.1e-3450.93Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M C+ST SFS+ ING   G I PSRGLRQGD LSPY+FLLC EG +++L+ A S   L GV I R  P I++L FADDSLIF RA  EE ++  N ++ Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF
         +AS QCIN  KS  YFS N     R  +   LG+R +D F  YLGLP+   R K + F +
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF

XP_023915663.1 uncharacterized protein LOC112027222 [Quercus suber]7.2e-3550.93Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLA-YACSHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M C+ST SF++ ING+  G IKP+RG+RQGD  SPY+FLLC EG +A+LA       L GV+I R  P I+HL FADDSL+F RATMEE     + ++ Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLA-YACSHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF
          AS QCIN  KS VYFS N     R+ + N LG++ +D F  YLGLP+   R K + F F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF

XP_030939512.1 uncharacterized protein LOC115964316 [Quercus lobata]9.4e-3552.83Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACS-HSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M C++T SFS+L+NG   G + PSRG+RQGD LSPY+FLLCTEG +++L  A S  SL GV+I RS P IT+L FADDSLIF RAT  EF    +I++ Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACS-HSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
         KAS QCINL KS VYFS N  V  +     +LG++ +  F  YLGLP+   + K   F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

TrEMBL top hitse value%identityAlignment
A0A2N9FT83 Reverse transcriptase domain-containing protein2.7e-3550.94Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSH-SLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M+CI+T S+S+LINGE  G I PSRGLRQGD +SPY+FL+C EGL+ +L  A S   + GV+ISR  P +THLFFADDSL+F RAT  E +  Q+I+  Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSH-SLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        EKAS Q +N +K+ ++FS+N    T++ + N+LG+  I  +  YLGLPS   + K + F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

A0A2N9G656 Reverse transcriptase domain-containing protein1.2e-3550.94Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M+C+ T S+S+L+NGE  G+IKPSRGLRQGD LSPY+FL+C EGL A++A A  +  + GV++ R  P ITHLFFADDSL+F +AT EE    QNI+ +Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        E+AS Q +N  K+ ++FS N   + ++ L N+LG+  I  +  YLGLPS   R K   F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

A0A2N9GZI9 Reverse transcriptase domain-containing protein3.5e-3552.83Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSH-SLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M+CIST S+SILINGE  G IKPSRGLRQGD LSPY+FL C EGL ++L  A +  ++ GV+ISR  P +THLFFADDSL+F +AT  E    Q+I+  Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSH-SLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        E AS Q IN  K+ ++FSK+     ++H+  MLG+ +I  +  YLGLPS   R K   F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

A0A2N9HV89 Reverse transcriptase domain-containing protein1.2e-3550.94Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M+C+ T S+S+L+NGE  G+IKPSRGLRQGD LSPY+FL+C EGL A++A A  +  + GV++ R  P ITHLFFADDSL+F +AT EE    QNI+ +Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        E+AS Q +N  K+ ++FS N   + ++ L N+LG+  I  +  YLGLPS   R K   F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

A0A2N9J7Z5 Reverse transcriptase domain-containing protein1.2e-3550.94Show/hide
Query:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY
        M+C+ T S+S+L+NGE  G+IKPSRGLRQGD LSPY+FL+C EGL A++A A  +  + GV++ R  P ITHLFFADDSL+F +AT EE    QNI+ +Y
Subjt:  MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYAC-SHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENY

Query:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF
        E+AS Q +N  K+ ++FS N   + ++ L N+LG+  I  +  YLGLPS   R K   F
Subjt:  EKASSQCINLNKSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDF

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012505.2e-1252.94Show/hide
Query:  LINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDS
        +ING   G++ PSRGLRQGD LSPY+F+LCTE LS +   A     L G+ +S + P I HL FADD+
Subjt:  LINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDS

Arabidopsis top hitse value%identityAlignment
ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.7e-1352.94Show/hide
Query:  LINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDS
        +ING   G++ PSRGLRQGD LSPY+F+LCTE LS +   A     L G+ +S + P I HL FADD+
Subjt:  LINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHS-LTGVAISRSCPSITHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGTATCTCAACTGCATCATTCTCTATCCTGATTAATGGGGAGGTGGTTGGTATTATCAAGCCTTCCCGAGGACTGCGACAAGGGGACTTCTTGTCTCCATATAT
GTTTTTATTATGCACGGAAGGCCTCTCAGCTATGCTAGCTTATGCCTGCTCCCATTCGCTAACCGGGGTAGCTATTTCGAGATCCTGCCCAAGCATTACTCATTTATTCT
TCGCAGACGATAGTTTGATATTCCTTAGAGCTACTATGGAGGAGTTTGAGATTTTTCAGAATATTATGGAAAACTATGAGAAAGCTTCTAGTCAGTGTATCAATCTTAAT
AAATCAATGGTATATTTCTCTAAAAACATGGTCGTGGATACGAGAGATCACCTCAGTAACATGCTGGGTATGAGAATGATTGATTCCTTTGGTCCTTACCTTGGATTGCC
TTCAACTTTCCACAGGGGCAAAAGCAAAGACTTCCAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGTATCTCAACTGCATCATTCTCTATCCTGATTAATGGGGAGGTGGTTGGTATTATCAAGCCTTCCCGAGGACTGCGACAAGGGGACTTCTTGTCTCCATATAT
GTTTTTATTATGCACGGAAGGCCTCTCAGCTATGCTAGCTTATGCCTGCTCCCATTCGCTAACCGGGGTAGCTATTTCGAGATCCTGCCCAAGCATTACTCATTTATTCT
TCGCAGACGATAGTTTGATATTCCTTAGAGCTACTATGGAGGAGTTTGAGATTTTTCAGAATATTATGGAAAACTATGAGAAAGCTTCTAGTCAGTGTATCAATCTTAAT
AAATCAATGGTATATTTCTCTAAAAACATGGTCGTGGATACGAGAGATCACCTCAGTAACATGCTGGGTATGAGAATGATTGATTCCTTTGGTCCTTACCTTGGATTGCC
TTCAACTTTCCACAGGGGCAAAAGCAAAGACTTCCAGTTTTAA
Protein sequenceShow/hide protein sequence
MDCISTASFSILINGEVVGIIKPSRGLRQGDFLSPYMFLLCTEGLSAMLAYACSHSLTGVAISRSCPSITHLFFADDSLIFLRATMEEFEIFQNIMENYEKASSQCINLN
KSMVYFSKNMVVDTRDHLSNMLGMRMIDSFGPYLGLPSTFHRGKSKDFQF