; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038621 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038621
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr2:21745313..21750165
RNA-Seq ExpressionLag0038621
SyntenyLag0038621
Gene Ontology termsGO:0006367 - transcription initiation from RNA polymerase II promoter (biological process)
GO:0006413 - translational initiation (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0005672 - transcription factor TFIIA complex (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003743 - translation initiation factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF5758504.1 putative RNA-directed DNA polymerase [Helianthus annuus]9.9e-4971.24Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +DLRAASAIRL LAKN+LANVHGISTAK+LWEKLE +YQ + I NRLYLKEQF+TLRM+  TKISDHLSVLN+I+S+LE I VK+EDEDK L  ILSL +
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK
        S EHMK ILMYGKETL + DVT KLLS+E+RL S G TS E + L+  N KKK
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK

KAF5765959.1 putative RNA-directed DNA polymerase [Helianthus annuus]1.3e-4871.24Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +DLRAASAIRL LAKN+LANVHGISTAK+LWEKLE +YQ + I NRLYLKEQF+TLRM+  TKISDHLSVLN+I+S+LE I VK+EDEDK L  ILSL +
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK
        S EHMK ILMYGKETL + DVT KLLS+E+RL S G TS E + L+  N KKK
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK

KAG7577502.1 F-box associated domain type 1 [Arabidopsis thaliana x Arabidopsis arenosa]1.4e-4259.66Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +DLRAASAIRL LAKNILANVHGISTAKELWEKLE +YQA+ + NR+YLKE+F+TLRM EGT +SDHLSVLN I+S+LE I VK++DED  L  I SLP+
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSC
        S EHMK IL++GKE + F +VTSKL S+E+RL +       +S LVA N +KK+  M+KK       +  H   +C
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSC

PON56879.1 hypothetical protein TorRG33x02_294980 [Trema orientale]1.4e-4265.36Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D RAASAIR+ LAKN+LANV GI+TAK+LW KLE +YQA+ + NR+YLKEQF+TLRM EGTKISDHLSVLN I+S+LE I VKIEDEDK L FI S+P 
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK
        S EHMK IL++GKET+ F++VTSKLLS+ERRL   G       + +A N +KK
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]1.9e-6074.03Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        MDLRAASAIR +LAKNILANVH ISTAKELWEKLEA+YQA+ I NRLYLKEQF+TL+MEEG KISDHLS LN II +LE IEVKI+DEDK L  ILSLP 
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSCYARCV
        S EHMK ILMYGK+TLNF +VTSKLLS+ERRLKSEGRTSHEDS LV SNWKKKK+S+QKK       +  H    C  R V
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSCYARCV

TrEMBL top hitse value%identityAlignment
A0A2P5C765 Uncharacterized protein6.7e-4365.36Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D RAASAIR+ LAKN+LANV GI+TAK+LW KLE +YQA+ + NR+YLKEQF+TLRM EGTKISDHLSVLN I+S+LE I VKIEDEDK L FI S+P 
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK
        S EHMK IL++GKET+ F++VTSKLLS+ERRL   G       + +A N +KK
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKK

A0A6A3BK59 CCHC-type domain-containing protein2.6e-4263.12Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D+RAAS IRL LAKN+LANV   S+ KELWEKLE MYQA+S+ NRLYLKE+F+ L+MEEGTKISDHLS LN I+S+LE I V+I+DEDK L  I SLP+
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK
        S EHM+ +LMYGKE +NF++VTSKL+S+ERRLK+ E ++S   +  V  N KK K S +K
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK

A0A6A3CWI3 CCHC-type domain-containing protein4.4e-4263.12Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D+RAAS IRL LAKN+LANV   S+ KELWEKLE MYQA+S+ NRLYLKE+F+ L+MEEGTKISDHLS LN I+S+LE I V I+DEDK L  I SLP+
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK
        S EHM+ +LMYGKE +NF++VTSKL+S+ERRLK+ E ++S   +  V  N KK K S +K
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK

A0A6A3DA47 CCHC-type domain-containing protein4.4e-4263.12Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D+RAAS IRL LAKN+LANV   S+ KELWEKLE MYQA+S+ NRLYLKE+F+ L+MEEGTKISDHLS LN I+S+LE I V+I+DEDK L  I SLP+
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK
        S EHM+ +LMYGKE +NF++VTSKL+S+ERRLK+ E ++S   +  V  N KK K S +K
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKS-EGRTSHEDSTLVASNWKKKKESMQK

A0A6J1CG82 uncharacterized protein LOC1110105219.3e-6174.03Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        MDLRAASAIR +LAKNILANVH ISTAKELWEKLEA+YQA+ I NRLYLKEQF+TL+MEEG KISDHLS LN II +LE IEVKI+DEDK L  ILSLP 
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSCYARCV
        S EHMK ILMYGK+TLNF +VTSKLLS+ERRLKSEGRTSHEDS LV SNWKKKK+S+QKK       +  H    C  R V
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSCYARCV

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-0625.6Show/hide
Query:  TAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPTSNEH-MKLILMYGKETLNFTDVTSK
        TA+++ E L+A+Y+ +S+ ++L L+++  +L++     +  H  + +++IS+L     KIE+ DK    +++LP+  +  +  I    +E L    V ++
Subjt:  TAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPTSNEH-MKLILMYGKETLNFTDVTSK

Query:  LLSKERRLKSEGRTSHEDSTLVASN
        LL +E ++K++    H D++    N
Subjt:  LLSKERRLKSEGRTSHEDSTLVASN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2335.08Show/hide
Query:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT
        +D RAASAIRL+L+ +++ N+    TA+ +W +LE++Y ++++ N+LYLK+Q Y L M EGT    HL+V N +I++L  + VKIE+EDK ++ + SLP+
Subjt:  MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPT

Query:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLK----------SEGR-----TSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSC
        S +++   +++GK T+   DVTS LL  E+  K          +EGR      S  +     +  K K  S  +     N ++  H    C
Subjt:  SNEHMKLILMYGKETLNFTDVTSKLLSKERRLK----------SEGR-----TSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSC

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAGAGCTGCAAGTGCAATCAGATTAAATTTGGCTAAGAACATTCTTGCAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGCAAT
GTATCAGGCAAGGAGCATCTTGAATCGGTTGTACCTGAAGGAGCAGTTTTACACGTTGCGAATGGAGGAAGGTACGAAAATCTCAGATCATCTGAGTGTTCTCAATGACA
TCATTTCGAAGCTGGAGGTGATCGAAGTTAAGATAGAGGATGAGGATAAGACACTCATGTTTATCTTGTCACTTCCAACTTCTAATGAACACATGAAGCTAATCTTGATG
TACGGGAAGGAAACTTTAAATTTTACTGATGTTACTAGTAAACTCTTATCAAAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCACATGAGGATTCAACACTAGTAGC
TAGCAATTGGAAGAAGAAGAAAGAGTCCATGCAGAAGAAAGGGGAGATAGTGAATTCCTCTGAAGAAAGACATAATTCATCCTCATGTTATGCAAGGTGTGTGGTGGAAG
TTATGTCGATGGCTAAAGAACTTCCAGATTCTGCTTCCAGACGCAACAGCGTCGAGACGCTGTCTCGATACCACAGGCGCCAGACATGGAAAGCGCACAGCGTCGAGACG
CTATCAACACAGCGTCGAGACGCTGTCACGATGGAGGCGCGCATTAGGATTTCAAAAGGCGCGGTTCAAGTGCAGGCCGGTCCGGTTCTGACCGATTCAGCTGGGCTGGG
ACCTATTTGGTCCGGTTCAGCCAATTTTTGGCATGTTGAGGCCGAACCTTCTCAGATACAGAGGGGGAAGAGGCTAAAGGCGAGGATAGAGGCTCTCTCTCTTTGCTTTT
TTTTTTCCTCTACCTCTAATTTGTCTGAACAAGCCAAAGAGGGGAAAACTCTTCTTTCTCTCTCATTCTCTTCTTTTGGGTATGGTGGTGGAGACTCGAGTGTTGACATG
GAGGAGGGGGTTCAAGTCCCTTCAGGCGCTGACAGCCCATGGTTTTCTTTTATCCTTCACGCTTGCTCTTCCTCTCAAGCTTCGACCATGTTGTCCCTTACTCTTGATGG
TTTGTTGCTTTCTCTCAATTTCTTCTTGAACTTTTGTGTTGTTTGCAGAAAATTAGTGGAACCCATTTATATCGCCGCCACTACCGAGAAGAATGACCGCAACGAATTGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGATTTGAGAGCTGCAAGTGCAATCAGATTAAATTTGGCTAAGAACATTCTTGCAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGCAAT
GTATCAGGCAAGGAGCATCTTGAATCGGTTGTACCTGAAGGAGCAGTTTTACACGTTGCGAATGGAGGAAGGTACGAAAATCTCAGATCATCTGAGTGTTCTCAATGACA
TCATTTCGAAGCTGGAGGTGATCGAAGTTAAGATAGAGGATGAGGATAAGACACTCATGTTTATCTTGTCACTTCCAACTTCTAATGAACACATGAAGCTAATCTTGATG
TACGGGAAGGAAACTTTAAATTTTACTGATGTTACTAGTAAACTCTTATCAAAAGAAAGAAGGCTGAAGAGTGAAGGGCGTACTTCACATGAGGATTCAACACTAGTAGC
TAGCAATTGGAAGAAGAAGAAAGAGTCCATGCAGAAGAAAGGGGAGATAGTGAATTCCTCTGAAGAAAGACATAATTCATCCTCATGTTATGCAAGGTGTGTGGTGGAAG
TTATGTCGATGGCTAAAGAACTTCCAGATTCTGCTTCCAGACGCAACAGCGTCGAGACGCTGTCTCGATACCACAGGCGCCAGACATGGAAAGCGCACAGCGTCGAGACG
CTATCAACACAGCGTCGAGACGCTGTCACGATGGAGGCGCGCATTAGGATTTCAAAAGGCGCGGTTCAAGTGCAGGCCGGTCCGGTTCTGACCGATTCAGCTGGGCTGGG
ACCTATTTGGTCCGGTTCAGCCAATTTTTGGCATGTTGAGGCCGAACCTTCTCAGATACAGAGGGGGAAGAGGCTAAAGGCGAGGATAGAGGCTCTCTCTCTTTGCTTTT
TTTTTTCCTCTACCTCTAATTTGTCTGAACAAGCCAAAGAGGGGAAAACTCTTCTTTCTCTCTCATTCTCTTCTTTTGGGTATGGTGGTGGAGACTCGAGTGTTGACATG
GAGGAGGGGGTTCAAGTCCCTTCAGGCGCTGACAGCCCATGGTTTTCTTTTATCCTTCACGCTTGCTCTTCCTCTCAAGCTTCGACCATGTTGTCCCTTACTCTTGATGG
TTTGTTGCTTTCTCTCAATTTCTTCTTGAACTTTTGTGTTGTTTGCAGAAAATTAGTGGAACCCATTTATATCGCCGCCACTACCGAGAAGAATGACCGCAACGAATTGT
AA
Protein sequenceShow/hide protein sequence
MDLRAASAIRLNLAKNILANVHGISTAKELWEKLEAMYQARSILNRLYLKEQFYTLRMEEGTKISDHLSVLNDIISKLEVIEVKIEDEDKTLMFILSLPTSNEHMKLILM
YGKETLNFTDVTSKLLSKERRLKSEGRTSHEDSTLVASNWKKKKESMQKKGEIVNSSEERHNSSSCYARCVVEVMSMAKELPDSASRRNSVETLSRYHRRQTWKAHSVET
LSTQRRDAVTMEARIRISKGAVQVQAGPVLTDSAGLGPIWSGSANFWHVEAEPSQIQRGKRLKARIEALSLCFFFSSTSNLSEQAKEGKTLLSLSFSSFGYGGGDSSVDM
EEGVQVPSGADSPWFSFILHACSSSQASTMLSLTLDGLLLSLNFFLNFCVVCRKLVEPIYIAATTEKNDRNEL