; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002203 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002203
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold1:32888735..32897276
RNA-Seq ExpressionSpg002203
SyntenySpg002203
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040194.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2851.59Show/hide
Query:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL
        +STGFSSKPGG LVFS  LSI     S  + Y+     WPRGNR  LQTGELDK                  VEIELP+PDTLP SAE S ++SSTW+EL
Subjt:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL

Query:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT
        Y ESVH+  + +   + +  S          +  DDV WLHAIFRAK AGGPGGGVT
Subjt:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT

KAA0052218.1 uncharacterized protein E6C27_scaffold207G00290 [Cucumis melo var. makuwa]2.5e-2654.07Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGCTSQIRSYIWPRGNRSSLQTGELDKTVEIELPVPDTLPASAERSRTSSSTWVELYIESVHIVNVLFN----------I
        ST F SK GG LVFS  LSIL           PRGNR +L TG+LDKTVEIELPVPDTLP SAE S+++SSTW+ELY ESVH+  + +            
Subjt:  STGFSSKPGGSLVFSGVLSILGCTSQIRSYIWPRGNRSSLQTGELDKTVEIELPVPDTLPASAERSRTSSSTWVELYIESVHIVNVLFN----------I

Query:  QEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVTLP
         EG    +  DD+ WLHA+F AK AG PGGGV  P
Subjt:  QEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVTLP

KAA0058280.1 uncharacterized protein E6C27_scaffold274G006090 [Cucumis melo var. makuwa]4.2e-2648.5Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGC-TSQIRSYI---------------WPRGNRSSLQTGELDKT---------------VEIELPVPDTLPASAERSRTS
        STGFSSKPGG LVFS  LSIL    S  + Y+                PRGNR SLQT ELDKT               V IELPVPD LP SAE S ++
Subjt:  STGFSSKPGGSLVFSGVLSILGC-TSQIRSYI---------------WPRGNRSSLQTGELDKT---------------VEIELPVPDTLPASAERSRTS

Query:  SSTWVELYIESVHI-------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT
        SSTW+ELY ESVH+             ++++FN+  G    +  DDV WLHA+FRAK  GGPGGGVT
Subjt:  SSTWVELYIESVHI-------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT

TYK11835.1 uncharacterized protein E5676_scaffold152G00520 [Cucumis melo var. makuwa]2.4e-2953.29Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGCTS-QIRSYIWPRGNRSSLQTGELDKTV---------------EIELPVPDTLPASAERSRTSSSTWVELYIESVHI-
        STGFSSKPGG LV S  LSIL       + Y+ PRGNR SLQTGELDKTV                IELPVPDTLP SAE S ++SSTW+ELY ESVH+ 
Subjt:  STGFSSKPGGSLVFSGVLSILGCTS-QIRSYIWPRGNRSSLQTGELDKTV---------------EIELPVPDTLPASAERSRTSSSTWVELYIESVHI-

Query:  ------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT
                    ++++FN+  G    +  DDV WLHA+FRAK  GGPGGGVT
Subjt:  ------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT

TYK25935.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-2851.59Show/hide
Query:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL
        +STGFSSKPGG LVFS  LSI     S  + Y+     WPRGNR  LQTGELDK                  VEIELP+PDTLP SAE S ++SSTW+EL
Subjt:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL

Query:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT
        Y ESVH+  + +   + +  S          +  DDV WLHAIFRAK AGGPGGGVT
Subjt:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT

TrEMBL top hitse value%identityAlignment
A0A5A7T9P7 Gag/pol protein9.7e-2951.59Show/hide
Query:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL
        +STGFSSKPGG LVFS  LSI     S  + Y+     WPRGNR  LQTGELDK                  VEIELP+PDTLP SAE S ++SSTW+EL
Subjt:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL

Query:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT
        Y ESVH+  + +   + +  S          +  DDV WLHAIFRAK AGGPGGGVT
Subjt:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT

A0A5A7UAH6 CCHC-type domain-containing protein1.2e-2654.07Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGCTSQIRSYIWPRGNRSSLQTGELDKTVEIELPVPDTLPASAERSRTSSSTWVELYIESVHIVNVLFN----------I
        ST F SK GG LVFS  LSIL           PRGNR +L TG+LDKTVEIELPVPDTLP SAE S+++SSTW+ELY ESVH+  + +            
Subjt:  STGFSSKPGGSLVFSGVLSILGCTSQIRSYIWPRGNRSSLQTGELDKTVEIELPVPDTLPASAERSRTSSSTWVELYIESVHIVNVLFN----------I

Query:  QEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVTLP
         EG    +  DD+ WLHA+F AK AG PGGGV  P
Subjt:  QEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVTLP

A0A5A7UT17 CCHC-type domain-containing protein2.0e-2648.5Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGC-TSQIRSYI---------------WPRGNRSSLQTGELDKT---------------VEIELPVPDTLPASAERSRTS
        STGFSSKPGG LVFS  LSIL    S  + Y+                PRGNR SLQT ELDKT               V IELPVPD LP SAE S ++
Subjt:  STGFSSKPGGSLVFSGVLSILGC-TSQIRSYI---------------WPRGNRSSLQTGELDKT---------------VEIELPVPDTLPASAERSRTS

Query:  SSTWVELYIESVHI-------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT
        SSTW+ELY ESVH+             ++++FN+  G    +  DDV WLHA+FRAK  GGPGGGVT
Subjt:  SSTWVELYIESVHI-------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT

A0A5D3CJX3 CCHC-type domain-containing protein1.1e-2953.29Show/hide
Query:  STGFSSKPGGSLVFSGVLSILGCTS-QIRSYIWPRGNRSSLQTGELDKTV---------------EIELPVPDTLPASAERSRTSSSTWVELYIESVHI-
        STGFSSKPGG LV S  LSIL       + Y+ PRGNR SLQTGELDKTV                IELPVPDTLP SAE S ++SSTW+ELY ESVH+ 
Subjt:  STGFSSKPGGSLVFSGVLSILGCTS-QIRSYIWPRGNRSSLQTGELDKTV---------------EIELPVPDTLPASAERSRTSSSTWVELYIESVHI-

Query:  ------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT
                    ++++FN+  G    +  DDV WLHA+FRAK  GGPGGGVT
Subjt:  ------------VNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVT

A0A5D3DR18 Gag/pol protein9.7e-2951.59Show/hide
Query:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL
        +STGFSSKPGG LVFS  LSI     S  + Y+     WPRGNR  LQTGELDK                  VEIELP+PDTLP SAE S ++SSTW+EL
Subjt:  LSTGFSSKPGGSLVFSGVLSILGC-TSQIRSYI-----WPRGNRSSLQTGELDK-----------------TVEIELPVPDTLPASAERSRTSSSTWVEL

Query:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT
        Y ESVH+  + +   + +  S          +  DDV WLHAIFRAK AGGPGGGVT
Subjt:  YIESVHIVNVLFNIQEGQRVS----------LEEDDVRWLHAIFRAKLAGGPGGGVT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGCCTTTCACCCCACCCCAGGGGTCCTATATATAGTAAAGGCCAAGACCTTTTTCCTAATAGGAAAAGCATTCCCTTCCCATTCCCTATCTCTATAAGACTCCC
ACAAGTCGTTCTTGGCCCGGAGAATAGTAGGGAAGACTCTAGAGGTTGTCCAAGGAGGATTGGAGAAGAAAACACCACAAGGAGTTCCACACAAGGCATAAAACCCAACA
ATGCGAAGTGGTGTCCCACCAAAGGCGAGCCTAATCAAGAGAAGCCAGCGTTGAGAGCCCCTCAAGCGCTAGGGCAAGCTGTCAAGCAAAGGGCGCGTGTCTTTGCGCTC
ATAAAAGAGAAGGATGAAGTTGGTGATGTCGTAGCGGCAGTAAGTGTTGGAATCAAGCATGCTAAATCGATTCTCATTAGTTATCTAGGCCTTAGAAGCATTGCCCAATC
GCTAGGAGTCCTTAAAATGCTAATTAAGTGTTATTTACTTGTTGTCCAATGGCTGAAAAGCCTTGAATATAGCCATTCGGTGAATACCCATTGTCCGTTGAGTTATTTGG
ATCTTTTGAGCGGTTTAAGGAGTTCTTTAGGAGCGCTCGTAAGCATTTTAGTCAAGTTGCATAGTCATGGTGCTCATAGAAAGGTGAGCTGGGTGTGTGTGCGGAGCGTG
ATGCGGAAATCACGTGTTAAGATTGCGCTACATGACGCGGAAATCATGTTATTGGTGCGGAGCGTGACGCGTAAATCACGTCACTTTAAGGAGAGAGTGAAGGAGGAAGA
AGAAGAAGAATTTCGTGGTTTTGAGGTTGAAGAAGATAAGAAAAGCTCATGCAAAGCCAGCCATAGCAGAATCTGGGTGCAAAACAGGAAACCTCTGAGCACAGGATTTT
CGAGCAAACCAGGAGGATCTCTGGTGTTCTCTGGTGTTTTGAGCATTCTGGGGTGTACAAGTCAAATCAGAAGCTATATTTGGCCAAGAGGAAATAGGTCGAGTCTACAA
ACCGGGGAACTAGATAAGACAGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAGCGTCTGCTGAACGTTCCAGAACAAGCTCCAGTACGTGGGTGGAGTTGTATAT
TGAGTCTGTTCATATTGTAAATGTTTTGTTCAACATTCAGGAGGGTCAGCGGGTATCGTTAGAGGAGGACGATGTCCGTTGGCTTCACGCCATCTTTCGAGCTAAGCTAG
CAGGTGGTCCGGGAGGGGGTGTGACATTACCAGTACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATGCCTTTCACCCCACCCCAGGGGTCCTATATATAGTAAAGGCCAAGACCTTTTTCCTAATAGGAAAAGCATTCCCTTCCCATTCCCTATCTCTATAAGACTCCC
ACAAGTCGTTCTTGGCCCGGAGAATAGTAGGGAAGACTCTAGAGGTTGTCCAAGGAGGATTGGAGAAGAAAACACCACAAGGAGTTCCACACAAGGCATAAAACCCAACA
ATGCGAAGTGGTGTCCCACCAAAGGCGAGCCTAATCAAGAGAAGCCAGCGTTGAGAGCCCCTCAAGCGCTAGGGCAAGCTGTCAAGCAAAGGGCGCGTGTCTTTGCGCTC
ATAAAAGAGAAGGATGAAGTTGGTGATGTCGTAGCGGCAGTAAGTGTTGGAATCAAGCATGCTAAATCGATTCTCATTAGTTATCTAGGCCTTAGAAGCATTGCCCAATC
GCTAGGAGTCCTTAAAATGCTAATTAAGTGTTATTTACTTGTTGTCCAATGGCTGAAAAGCCTTGAATATAGCCATTCGGTGAATACCCATTGTCCGTTGAGTTATTTGG
ATCTTTTGAGCGGTTTAAGGAGTTCTTTAGGAGCGCTCGTAAGCATTTTAGTCAAGTTGCATAGTCATGGTGCTCATAGAAAGGTGAGCTGGGTGTGTGTGCGGAGCGTG
ATGCGGAAATCACGTGTTAAGATTGCGCTACATGACGCGGAAATCATGTTATTGGTGCGGAGCGTGACGCGTAAATCACGTCACTTTAAGGAGAGAGTGAAGGAGGAAGA
AGAAGAAGAATTTCGTGGTTTTGAGGTTGAAGAAGATAAGAAAAGCTCATGCAAAGCCAGCCATAGCAGAATCTGGGTGCAAAACAGGAAACCTCTGAGCACAGGATTTT
CGAGCAAACCAGGAGGATCTCTGGTGTTCTCTGGTGTTTTGAGCATTCTGGGGTGTACAAGTCAAATCAGAAGCTATATTTGGCCAAGAGGAAATAGGTCGAGTCTACAA
ACCGGGGAACTAGATAAGACAGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAGCGTCTGCTGAACGTTCCAGAACAAGCTCCAGTACGTGGGTGGAGTTGTATAT
TGAGTCTGTTCATATTGTAAATGTTTTGTTCAACATTCAGGAGGGTCAGCGGGTATCGTTAGAGGAGGACGATGTCCGTTGGCTTCACGCCATCTTTCGAGCTAAGCTAG
CAGGTGGTCCGGGAGGGGGTGTGACATTACCAGTACCGTGA
Protein sequenceShow/hide protein sequence
MGCLSPHPRGPIYSKGQDLFPNRKSIPFPFPISIRLPQVVLGPENSREDSRGCPRRIGEENTTRSSTQGIKPNNAKWCPTKGEPNQEKPALRAPQALGQAVKQRARVFAL
IKEKDEVGDVVAAVSVGIKHAKSILISYLGLRSIAQSLGVLKMLIKCYLLVVQWLKSLEYSHSVNTHCPLSYLDLLSGLRSSLGALVSILVKLHSHGAHRKVSWVCVRSV
MRKSRVKIALHDAEIMLLVRSVTRKSRHFKERVKEEEEEEFRGFEVEEDKKSSCKASHSRIWVQNRKPLSTGFSSKPGGSLVFSGVLSILGCTSQIRSYIWPRGNRSSLQ
TGELDKTVEIELPVPDTLPASAERSRTSSSTWVELYIESVHIVNVLFNIQEGQRVSLEEDDVRWLHAIFRAKLAGGPGGGVTLPVP