; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036498 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036498
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:47430286..47432308
RNA-Seq ExpressionLag0036498
SyntenyLag0036498
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG63781.1 hypothetical protein EZV62_010775 [Acer yangbiense]1.0e-1623.71Show/hide
Query:  VAQLASLKVMAKEKASIFHL-QENEDKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQ-EHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSL
        +A+L     +AKE  ++  + +E      K +   ++ K+L+++++N   F+G++ +IW    H  ++ V  N+F+  FKN   +  + + GPW +  SL
Subjt:  VAQLASLKVMAKEKASIFHL-QENEDKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQ-EHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSL

Query:  LLLEEPRGDINVEDMDFKFISFWQ----------------------GSGRGRMGRGWR----------------NNIYVDE----EDETDKQHEIGPAKD
        ++LE+   + NV  + F    FW                       G     +G  W                  N Y  E           +++   +D
Subjt:  LLLEEPRGDINVEDMDFKFISFWQ----------------------GSGRGRMGRGWR----------------NNIYVDE----EDETDKQHEIGPAKD

Query:  NPQRFTGIYGNPHHE-------------KHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF
            + G +G                  K + +W L++R RD   LPW+ GGDFNE+ S  +K+GG  +    M +F+ +++ C L+DLGF
Subjt:  NPQRFTGIYGNPHHE-------------KHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]2.1e-1739.84Show/hide
Query:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE
        E+++QL   LK+  +EK  IF + E E         +  +K+    ++CK+LT + I   VF+ MMPRIW   +  I+ VG N+FLC FK  R K  I  
Subjt:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE

Query:  VGPWFYDKSLLLLEEPRGDINVEDMDFK
         GPWF+DKS+++LEEPR + N  +++F+
Subjt:  VGPWFYDKSLLLLEEPRGDINVEDMDFK

XP_022154991.1 uncharacterized protein LOC111022134 isoform X2 [Momordica charantia]2.1e-1739.84Show/hide
Query:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE
        E+++QL   LK+  +EK  IF + E E         +  +K+    ++CK+LT + I   VF+ MMPRIW   +  I+ VG N+FLC FK  R K  I  
Subjt:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE

Query:  VGPWFYDKSLLLLEEPRGDINVEDMDFK
         GPWF+DKS+++LEEPR + N  +++F+
Subjt:  VGPWFYDKSLLLLEEPRGDINVEDMDFK

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]3.6e-1738.24Show/hide
Query:  LEEPRGDINVEDMDFKFISFWQGSGRGRMGRGWRNNI--YVDEEDETDKQHEIGPAKDNPQRFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFN
        L+E + D++ E++ F      + +  G +   W+NNI  +V+   +      +G  K+   RFTG YG P   K  E+W L++       LPW+  GDFN
Subjt:  LEEPRGDINVEDMDFKFISFWQGSGRGRMGRGWRNNI--YVDEEDETDKQHEIGPAKDNPQRFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFN

Query:  EITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF
        EIT  +EK+GG VR    MQ FRD+I+ CG  DLGF
Subjt:  EITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF

XP_024190127.1 uncharacterized protein LOC112194102 [Rosa chinensis]1.4e-1634.85Show/hide
Query:  NVEDMDFKFISFWQGSGRGRMG---RGWRNNIYVDEEDETDKQHEIGPAKDNPQ---RFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFNEITS
        N   +D +F+   +G G  R G     W++++ V+ +  +D   ++   ++N Q   +FTG+YG P  E  H+TW L+++      LPW++GGDFNEI+S
Subjt:  NVEDMDFKFISFWQGSGRGRMG---RGWRNNIYVDEEDETDKQHEIGPAKDNPQ---RFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFNEITS

Query:  NTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF
          +KMGG++R  R M  F++++  C L D+ F
Subjt:  NTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF

TrEMBL top hitse value%identityAlignment
A0A2N9EFF7 Uncharacterized protein1.6e-1540.68Show/hide
Query:  GSGRGRMGRG----WRNNIYVDEEDETDK--QHEIGPAKDNPQRFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRD
        G  R R G G    W N + V  +  +      EI P      RFTG YGNP H +  E+W L++R      LPW++ GDFNEI  N EK+G   RP+R 
Subjt:  GSGRGRMGRG----WRNNIYVDEEDETDK--QHEIGPAKDNPQRFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRD

Query:  MQEFRDSINLCGLSDLGF
        M+ FR++++ C L DLG+
Subjt:  MQEFRDSINLCGLSDLGF

A0A5C7I3G0 DUF4283 domain-containing protein5.1e-1723.71Show/hide
Query:  VAQLASLKVMAKEKASIFHL-QENEDKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQ-EHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSL
        +A+L     +AKE  ++  + +E      K +   ++ K+L+++++N   F+G++ +IW    H  ++ V  N+F+  FKN   +  + + GPW +  SL
Subjt:  VAQLASLKVMAKEKASIFHL-QENEDKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQ-EHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSL

Query:  LLLEEPRGDINVEDMDFKFISFWQ----------------------GSGRGRMGRGWR----------------NNIYVDE----EDETDKQHEIGPAKD
        ++LE+   + NV  + F    FW                       G     +G  W                  N Y  E           +++   +D
Subjt:  LLLEEPRGDINVEDMDFKFISFWQ----------------------GSGRGRMGRGWR----------------NNIYVDE----EDETDKQHEIGPAKD

Query:  NPQRFTGIYGNPHHE-------------KHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF
            + G +G                  K + +W L++R RD   LPW+ GGDFNE+ S  +K+GG  +    M +F+ +++ C L+DLGF
Subjt:  NPQRFTGIYGNPHHE-------------KHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGGLVRPKRDMQEFRDSINLCGLSDLGF

A0A6J1CQJ5 uncharacterized protein LOC1110134133.3e-1641.23Show/hide
Query:  QLASLKVMAKEKASIFHLQENE-DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSLLLL
        Q+  LK+  +EK  IF + E+E +  +K+    ++CK+LT + I   VF+ MMPRIW   +  I+ VG N+FLC FK  R K  I   GP F+DKS+++L
Subjt:  QLASLKVMAKEKASIFHLQENE-DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSLLLL

Query:  EEPRGDINVEDMDF
        EEPR + N  ++++
Subjt:  EEPRGDINVEDMDF

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X11.0e-1739.84Show/hide
Query:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE
        E+++QL   LK+  +EK  IF + E E         +  +K+    ++CK+LT + I   VF+ MMPRIW   +  I+ VG N+FLC FK  R K  I  
Subjt:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE

Query:  VGPWFYDKSLLLLEEPRGDINVEDMDFK
         GPWF+DKS+++LEEPR + N  +++F+
Subjt:  VGPWFYDKSLLLLEEPRGDINVEDMDFK

A0A6J1DQC9 uncharacterized protein LOC111022134 isoform X21.0e-1739.84Show/hide
Query:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE
        E+++QL   LK+  +EK  IF + E E         +  +K+    ++CK+LT + I   VF+ MMPRIW   +  I+ VG N+FLC FK  R K  I  
Subjt:  EIVAQLAS-LKVMAKEKASIFHLQENE---------DKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQE

Query:  VGPWFYDKSLLLLEEPRGDINVEDMDFK
         GPWF+DKS+++LEEPR + N  +++F+
Subjt:  VGPWFYDKSLLLLEEPRGDINVEDMDFK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATTCGAATGAAGTTGAGAAAGAAATAGTGGCTCAATTAGCAAGTCTGAAGGTCATGGCTAAAGAAAAAGCAAGCATCTTCCATTTGCAAGAAAACGAAGACAA
ATCAGAGAAGAAATTAATCAATGTGATGCTTTGCAAGATTTTAACTCACAGGGAGATAAACCTGAATGTGTTTAGAGGGATGATGCCTCGCATATGGGGACAGGAACATA
CAATCATTGATCACGTGGGTTCTAATGTATTTCTTTGCATGTTCAAGAATGCAAGGATAAAGGGATACATTCAAGAAGTAGGACCTTGGTTTTATGACAAATCCCTTCTT
TTGCTAGAAGAACCAAGAGGAGATATCAACGTGGAGGACATGGATTTCAAGTTTATATCTTTTTGGCAGGGCAGTGGAAGGGGTAGAATGGGGAGAGGATGGAGAAACAA
CATATATGTTGATGAAGAGGATGAAACAGACAAACAACATGAGATTGGTCCAGCAAAAGACAACCCTCAGAGGTTTACTGGAATTTATGGCAACCCTCATCATGAGAAGC
ATCATGAGACATGGACCCTCATGAAAAGATCGAGGGATACTCCGGGATTGCCATGGGTCGTGGGTGGTGATTTCAATGAGATTACTAGCAACACTGAAAAAATGGGAGGG
TTGGTTCGGCCAAAAAGAGATATGCAAGAATTTAGAGACAGTATAAATCTTTGTGGCCTCAGTGATTTGGGGTTCGACTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGATTCGAATGAAGTTGAGAAAGAAATAGTGGCTCAATTAGCAAGTCTGAAGGTCATGGCTAAAGAAAAAGCAAGCATCTTCCATTTGCAAGAAAACGAAGACAA
ATCAGAGAAGAAATTAATCAATGTGATGCTTTGCAAGATTTTAACTCACAGGGAGATAAACCTGAATGTGTTTAGAGGGATGATGCCTCGCATATGGGGACAGGAACATA
CAATCATTGATCACGTGGGTTCTAATGTATTTCTTTGCATGTTCAAGAATGCAAGGATAAAGGGATACATTCAAGAAGTAGGACCTTGGTTTTATGACAAATCCCTTCTT
TTGCTAGAAGAACCAAGAGGAGATATCAACGTGGAGGACATGGATTTCAAGTTTATATCTTTTTGGCAGGGCAGTGGAAGGGGTAGAATGGGGAGAGGATGGAGAAACAA
CATATATGTTGATGAAGAGGATGAAACAGACAAACAACATGAGATTGGTCCAGCAAAAGACAACCCTCAGAGGTTTACTGGAATTTATGGCAACCCTCATCATGAGAAGC
ATCATGAGACATGGACCCTCATGAAAAGATCGAGGGATACTCCGGGATTGCCATGGGTCGTGGGTGGTGATTTCAATGAGATTACTAGCAACACTGAAAAAATGGGAGGG
TTGGTTCGGCCAAAAAGAGATATGCAAGAATTTAGAGACAGTATAAATCTTTGTGGCCTCAGTGATTTGGGGTTCGACTCTTAA
Protein sequenceShow/hide protein sequence
MMDSNEVEKEIVAQLASLKVMAKEKASIFHLQENEDKSEKKLINVMLCKILTHREINLNVFRGMMPRIWGQEHTIIDHVGSNVFLCMFKNARIKGYIQEVGPWFYDKSLL
LLEEPRGDINVEDMDFKFISFWQGSGRGRMGRGWRNNIYVDEEDETDKQHEIGPAKDNPQRFTGIYGNPHHEKHHETWTLMKRSRDTPGLPWVVGGDFNEITSNTEKMGG
LVRPKRDMQEFRDSINLCGLSDLGFDS