; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007741 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007741
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr9:4113897..4116625
RNA-Seq ExpressionLag0007741
SyntenyLag0007741
Gene Ontology termsGO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]5.3e-2940.27Show/hide
Query:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------
        + K LR   +P+  S   S  S   DE+WE++DLR ASAIRL LAKN+LANVHGISTAK+LWEKLE +YQ + ISNRLYLKEQF+ L             
Subjt:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------

Query:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSG
                                                   KETL + DVT KLLSEE+RL S G  S E + L+  N KKK    +   CW+C QSG
Subjt:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSG

Query:  RMKKDCPNREGCDVSSSTSLHTDIGL
         +K++CP   G D +S++    ++ +
Subjt:  RMKKDCPNREGCDVSSSTSLHTDIGL

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]9.1e-2940.27Show/hide
Query:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------
        + K LR   +P+  S   S  S   DE+WE++DLR ASAIRL LAKN+LANVHGISTAK+LWEKLE +YQ + I NRLYLKEQF+ L             
Subjt:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------

Query:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSG
                                                   KETL + DVT KLLSEE+RL S G  S E + L+  N KKK    +   CW+C QSG
Subjt:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSG

Query:  RMKKDCPNREGCDVSSSTSLHTDIGL
         +K++CP   G D +SS+    ++ +
Subjt:  RMKKDCPNREGCDVSSSTSLHTDIGL

KAG7577502.1 F-box associated domain type 1 [Arabidopsis thaliana x Arabidopsis arenosa]1.5e-2840.44Show/hide
Query:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------
        + K L+   +P+ G+  G  K  +SD DWE++DLR ASAIRL LAKNILANVHGISTAKELWEKLE +YQA+ +SNR+YLKE+F+ L             
Subjt:  MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------

Query:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG--CWECRQ
                                                   KE + F +VTSKL SEE+RL +       +S LVA N  +KKE+V KK   CW C Q
Subjt:  -------------------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG--CWECRQ

Query:  SGRMKKDCPNREGCDVSSSTSLHTD
        SG +K++CPN  G     ++++  D
Subjt:  SGRMKKDCPNREGCDVSSSTSLHTD

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]2.1e-4659.09Show/hide
Query:  GGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------------------
        GGSSRGSKKS+MS EDWEEMDLR ASAIR +LAKNILANVH ISTAKELWEKLEA+YQA+ ISNRLYLKEQF+ L                         
Subjt:  GGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------------------

Query:  -------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG-CWECRQSGRMKKDCPNR
                                       K+TL+F +VTSKLLSEERRLKSEGR S EDS LV SNWKKKK+SVQKK  CW C QSG MKKDCPNR
Subjt:  -------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG-CWECRQSGRMKKDCPNR

XP_035543057.1 uncharacterized protein LOC118346128 [Juglans regia]2.6e-3151.85Show/hide
Query:  ELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERR
        E++S    + +   +S MSDEDWE++DLR ASAIRL LAKN+LAN+HGISTAKELWEKLE +YQ + +SNR+YLKEQF+ L  +    +VTSKL SEERR
Subjt:  ELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERR

Query:  LKSEGRASQED-STLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSSTSLHTD
        L         + + +VA N KKK     K  CW C QSG +KK+CP       S S S++ D
Subjt:  LKSEGRASQED-STLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSSTSLHTD

TrEMBL top hitse value%identityAlignment
A0A2G2VS38 CCHC-type domain-containing protein1.8e-2742.71Show/hide
Query:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML---------------------------
        SS+ S+KS +SDE+WEE+D++  S IRL LAK +L NV G+ST KELWEKLE +YQ +SISNRLYLKEQF+ L                           
Subjt:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML---------------------------

Query:  -------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSS
                                 KET+ F +VTSKL+SEE+RLK+    S EDS LV    K K+ S +K  CW C+Q G +K +CPN     V SS
Subjt:  -------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSS

A0A6A2WZ82 Mitogen-activated protein kinase 9-like4.5e-2640.74Show/hide
Query:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML---------------------------
        SS    KS MS+E+WEE+D+R AS IRL LAKN+LANV   S+ KELWEKLE MYQA+S+SNRLYLKE+F+ L                           
Subjt:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML---------------------------

Query:  -----------------------------KETLSFVDVTSKLLSEERRLKS-EGRASQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCD
                                     KE ++F +VTSKL+SEERRLK+ E ++S+  +  V  N KK K S +K  CW C Q G +KKDC N     
Subjt:  -----------------------------KETLSFVDVTSKLLSEERRLKS-EGRASQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCD

Query:  VSSSTSLHTDIGLAAM
         + S S  T++ L  M
Subjt:  VSSSTSLHTDIGLAAM

A0A6A2Y9V1 CCHC-type domain-containing protein2.8e-2856.83Show/hide
Query:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERRLKS-EGRA
        SS    KS MS+E+WEE+D+R AS IRL LAKN+LANV   S+  ELWEKLE MYQA+S+SNRLYLKE+F+ L+      +VTSKL+SEERRLK+ E ++
Subjt:  SSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERRLKS-EGRA

Query:  SQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPN
        S+  +  V  N KK K S +K  CW C Q G +KKDC N
Subjt:  SQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPN

A0A6J1CG82 uncharacterized protein LOC1110105211.0e-4659.09Show/hide
Query:  GGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------------------
        GGSSRGSKKS+MS EDWEEMDLR ASAIR +LAKNILANVH ISTAKELWEKLEA+YQA+ ISNRLYLKEQF+ L                         
Subjt:  GGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML-------------------------

Query:  -------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG-CWECRQSGRMKKDCPNR
                                       K+TL+F +VTSKLLSEERRLKSEGR S EDS LV SNWKKKK+SVQKK  CW C QSG MKKDCPNR
Subjt:  -------------------------------KETLSFVDVTSKLLSEERRLKSEGRASQEDSTLVASNWKKKKESVQKKG-CWECRQSGRMKKDCPNR

A0A6P9EJZ4 uncharacterized protein LOC1183461281.2e-3151.85Show/hide
Query:  ELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERR
        E++S    + +   +S MSDEDWE++DLR ASAIRL LAKN+LAN+HGISTAKELWEKLE +YQ + +SNR+YLKEQF+ L  +    +VTSKL SEERR
Subjt:  ELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERR

Query:  LKSEGRASQED-STLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSSTSLHTD
        L         + + +VA N KKK     K  CW C QSG +KK+CP       S S S++ D
Subjt:  LKSEGRASQED-STLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSSTSLHTD

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1026.34Show/hide
Query:  KKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML--------------------------------
        K   M  EDW ++D R ASAIRL+L+ +++ N+    TA+ +W +LE++Y +++++N+LYLK+Q Y L                                
Subjt:  KKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYML--------------------------------

Query:  ------------------------KETLSFVDVTSKLLSEERRLK------------SEGRASQEDSTLV----ASNWKKKKESVQKKGCWECRQSGRMK
                                K T+   DVTS LL  E+  K              GR+ Q  S       A    K +   + + C+ C Q G  K
Subjt:  ------------------------KETLSFVDVTSKLLSEERRLK------------SEGRASQEDSTLV----ASNWKKKKESVQKKGCWECRQSGRMK

Query:  KDCPN
        +DCPN
Subjt:  KDCPN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAAACTCTTAGGGAATTGACTTCTCCAATTGGTGGTTCTAGTAGAGGTTCGAAGAAGTCCAACATGAGTGATGAAGATTGGGAAGAAATGGATTTGAGAGTTGC
AAGTGCAATCAGACTAAATTTGGCTAAGAACATTCTTGCAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGCAATGTATCAGGCTAGGAGCA
TCTCGAATCGGTTGTACCTGAAAGAGCAGTTTTACATGCTGAAGGAAACTTTAAGTTTTGTTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGACTGAAGAGTGAA
GGACGTGCTTCACAGGAAGATTCAACGCTAGTAGCTAGCAATTGGAAGAAGAAGAAGGAGTCCGTGCAGAAGAAAGGTTGTTGGGAATGCAGACAGTCTGGACGCATGAA
AAAGGATTGTCCTAACAGAGAAGGATGTGATGTTAGCAGTTCCACAAGTTTGCACACGGACATTGGCTTGGCAGCTATGCAAGGTGTGTGGTGGAAGTTATGTCGATGGT
TGAAGAACTTCCAGATGTCGCAAAAATCATCAGAAAAGCGTTGTGTTGATGAGGTCAATGAAACTGTAAGAGATTACAGGACACTACACATATCTCCACAATGGCTCACA
ACTTCTTTGTTTGTCCACTTTGGGCCTCATACCAATGGAGAATCCTCCCCTCTTCTTTTACCAAGAGTCCGTCAAATTCTCTGGGTTCCAACTCCAGGACAACTCCTTTA
TGACTACTTAGGCTCACAACTTCTTTGTTTGGTACCTGAGGATTTTATCCGACACGGCTATGTCTGGAGTATGGCTCTGATACCACATGTAAGGGATTACGGGACATTAC
ACATATCTTCACAATGGTATGATATTGTCCACTTTGGGCCTAAGCCCTCATGGTTTTGCTTTTGGTTCACTCCAAAAGGCCTCATACCAGTGGAGATAGTTGTCCTCCCT
TATAAACCCATGGTCATCCCCTTATCTAGCCGATGTGGGACTTTGGTCGCACTCCCAACAGAAACCGATGAGTTGAAAAGTGAGGGACCATGGCCTGACAAGAATGAGAC
TCTATTAGTTGACTTAATGGATGAATTGAAGAGGTTGCAAAGCGCAATCAACCGACCAACTACAACATTTGCAAGGACTAGTTGGAATTATATGAGAAACCAACTAAATA
CTAGTACTGGGTATGTTTATACTCATGAACAATTGAAAAATAACTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAAACTCTTAGGGAATTGACTTCTCCAATTGGTGGTTCTAGTAGAGGTTCGAAGAAGTCCAACATGAGTGATGAAGATTGGGAAGAAATGGATTTGAGAGTTGC
AAGTGCAATCAGACTAAATTTGGCTAAGAACATTCTTGCAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGCAATGTATCAGGCTAGGAGCA
TCTCGAATCGGTTGTACCTGAAAGAGCAGTTTTACATGCTGAAGGAAACTTTAAGTTTTGTTGATGTTACTAGTAAACTCTTATCAGAAGAAAGAAGACTGAAGAGTGAA
GGACGTGCTTCACAGGAAGATTCAACGCTAGTAGCTAGCAATTGGAAGAAGAAGAAGGAGTCCGTGCAGAAGAAAGGTTGTTGGGAATGCAGACAGTCTGGACGCATGAA
AAAGGATTGTCCTAACAGAGAAGGATGTGATGTTAGCAGTTCCACAAGTTTGCACACGGACATTGGCTTGGCAGCTATGCAAGGTGTGTGGTGGAAGTTATGTCGATGGT
TGAAGAACTTCCAGATGTCGCAAAAATCATCAGAAAAGCGTTGTGTTGATGAGGTCAATGAAACTGTAAGAGATTACAGGACACTACACATATCTCCACAATGGCTCACA
ACTTCTTTGTTTGTCCACTTTGGGCCTCATACCAATGGAGAATCCTCCCCTCTTCTTTTACCAAGAGTCCGTCAAATTCTCTGGGTTCCAACTCCAGGACAACTCCTTTA
TGACTACTTAGGCTCACAACTTCTTTGTTTGGTACCTGAGGATTTTATCCGACACGGCTATGTCTGGAGTATGGCTCTGATACCACATGTAAGGGATTACGGGACATTAC
ACATATCTTCACAATGGTATGATATTGTCCACTTTGGGCCTAAGCCCTCATGGTTTTGCTTTTGGTTCACTCCAAAAGGCCTCATACCAGTGGAGATAGTTGTCCTCCCT
TATAAACCCATGGTCATCCCCTTATCTAGCCGATGTGGGACTTTGGTCGCACTCCCAACAGAAACCGATGAGTTGAAAAGTGAGGGACCATGGCCTGACAAGAATGAGAC
TCTATTAGTTGACTTAATGGATGAATTGAAGAGGTTGCAAAGCGCAATCAACCGACCAACTACAACATTTGCAAGGACTAGTTGGAATTATATGAGAAACCAACTAAATA
CTAGTACTGGGTATGTTTATACTCATGAACAATTGAAAAATAACTCATGA
Protein sequenceShow/hide protein sequence
MRKTLRELTSPIGGSSRGSKKSNMSDEDWEEMDLRVASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSISNRLYLKEQFYMLKETLSFVDVTSKLLSEERRLKSE
GRASQEDSTLVASNWKKKKESVQKKGCWECRQSGRMKKDCPNREGCDVSSSTSLHTDIGLAAMQGVWWKLCRWLKNFQMSQKSSEKRCVDEVNETVRDYRTLHISPQWLT
TSLFVHFGPHTNGESSPLLLPRVRQILWVPTPGQLLYDYLGSQLLCLVPEDFIRHGYVWSMALIPHVRDYGTLHISSQWYDIVHFGPKPSWFCFWFTPKGLIPVEIVVLP
YKPMVIPLSSRCGTLVALPTETDELKSEGPWPDKNETLLVDLMDELKRLQSAINRPTTTFARTSWNYMRNQLNTSTGYVYTHEQLKNNS