; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021834 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021834
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr7:12800389..12805552
RNA-Seq ExpressionLag0021834
SyntenyLag0021834
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7810708.1 cytochrome p450 [Senna tora]9.9e-1939.46Show/hide
Query:  GRYTSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRR-----------------
        G+ +S  S +    S MSD DWEE+DLRAAS IRL LAKN+L NV GISTAKELW+KLEG+YQA+ ISNRL LKEQF+  R                   
Subjt:  GRYTSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRR-----------------

Query:  -------------------------------------------------KSGHIKKDWPNREGSPKGSGSDADIVSLVRGDNEFL
                                                         KSGH+KK+ P       GS  DA  VSLVRG+ +FL
Subjt:  -------------------------------------------------KSGHIKKDWPNREGSPKGSGSDADIVSLVRGDNEFL

KAF7812914.1 Zinc finger, CCHC-type [Senna tora]1.3e-1856.14Show/hide
Query:  SSRGSKMSR--MSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRRKSGHIKKDWPNREGSPKGSGSD
        SS+ SK S   MSD DWEE+DLRAAS IRL LAKN+L NV GISTAKELW+KLEG+YQA+ ISNRL LKEQF+     + G           S  GS  +
Subjt:  SSRGSKMSR--MSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRRKSGHIKKDWPNREGSPKGSGSD

Query:  ADIVSLVRGDNEFL
        A  VS   G   FL
Subjt:  ADIVSLVRGDNEFL

KAF7823890.1 cytochrome p450 [Senna tora]6.8e-2053.78Show/hide
Query:  GRYTSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRRKSGHIKKDWPNREGSPK
        G+ +S  S +    S MS  DWEE+DLRAAS I L LAKN+L NV  ISTAKELW+KLEG+YQA+ ISNRL LKEQF+  R  + G           S  
Subjt:  GRYTSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRRKSGHIKKDWPNREGSPK

Query:  GSGSDADIVSLVRGDNEFL
        GS  DA  VSL+RG+ +FL
Subjt:  GSGSDADIVSLVRGDNEFL

XP_022139673.1 uncharacterized protein LOC111010521 [Momordica charantia]7.8e-2482.67Show/hide
Query:  TSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        +SGGSSRGSK S MS EDWEEMDLRAASAIR +LAKNIL NVH ISTAKELWEKLE +YQA+ ISNRLYLKEQF+
Subjt:  TSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

XP_035543057.1 uncharacterized protein LOC118346128 [Juglans regia]9.9e-1976.56Show/hide
Query:  SRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        S MSDEDWE++DLRAASAIRL LAKN+L N+HGISTAKELWEKLE +YQ + +SNR+YLKEQF+
Subjt:  SRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

TrEMBL top hitse value%identityAlignment
A0A2G2VS38 CCHC-type domain-containing protein9.0e-1869.01Show/hide
Query:  SSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        SS+ S+ SR+SDE+WEE+D++A S IRL LAK +LTNV G+ST KELWEKLE +YQ +SISNRLYLKEQF+
Subjt:  SSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

A0A444X3K2 Uncharacterized protein4.5e-1771.21Show/hide
Query:  KMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        K+S + DE+WEE+DLRAASAIRL LAKN+L NV G+  AKELW+KLEG+YQA+ ISNRL LKEQF+
Subjt:  KMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

A0A6J1CG82 uncharacterized protein LOC1110105213.8e-2482.67Show/hide
Query:  TSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        +SGGSSRGSK S MS EDWEEMDLRAASAIR +LAKNIL NVH ISTAKELWEKLE +YQA+ ISNRLYLKEQF+
Subjt:  TSGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

A0A6P9EJZ4 uncharacterized protein LOC1183461284.8e-1976.56Show/hide
Query:  SRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        S MSDEDWE++DLRAASAIRL LAKN+L N+HGISTAKELWEKLE +YQ + +SNR+YLKEQF+
Subjt:  SRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

V4T614 Uncharacterized protein (Fragment)9.0e-1868.83Show/hide
Query:  SGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQR
        S  S+ GS  + +SDEDWEE+D RAASAIRL LAKN+L NV  I TAKELWEKLE +YQ +SISNRLYLKEQF+  R
Subjt:  SGGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQR

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-1043.94Show/hide
Query:  KMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY
        K   M  EDW ++D RAASAIRL+L+ +++ N+    TA+ +W +LE +Y +++++N+LYLK+Q Y
Subjt:  KMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTCTAGTGAAAGTGTGAGGTCAGAGGACCTTGTGTGTCTAAAAGGAAAGTTAGGTGAATACAAGGTGCCTTGGATAAAAATGGTCGAGGGGTGGTATGTCACCAA
TGGATTAAGAGGAGCATTGAGTGCCTTGCATATGGCCAATGGGCGATGTGATGAGTATCTACGTTGTGAAAGTCATCGGTTGCCTTGGGTAATATGGCCAAGGGGCGATG
TACAGTTTGATTGTGAAGTCATCGGGTGCCTTGGATGTGAGCCACTTACTCAGTACCGTGGTTTTGTACTGACCCACCACCAGGTAAAATTTTGGGGGCGTTACACCAGT
GGTGGTTCTAGTAGAGGTTCGAAGATGTCTAGAATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGTGCAATCAGACTAAATTTGGCTAAGAACATTCT
TACAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGGAATGTATCAGGCAAGGAGCATCTCGAATCGGTTGTACTTGAAGGAACAGTTTTACA
TGCAGCGAAGGAGGAAGTCTGGACACATAAAAAAGGATTGGCCTAACAGAGAAGGTTCGCCAAAGGGCTCTGGGTCAGATGCTGACATTGTCTCCCTCGTCAGGGGAGAC
AATGAATTCCTCCGAAGAAAGACAAAATTCATCCTCATGGTATGTCTGTTTTACCATGATAGAAGATGTGATGTTAACGGTTCCACAAGTTTGCACACGGGCATTGGCTT
GGCAGTTATGCAAGGTGTGTGGTGGAAGTTATGTCGATGGCTGAAGAACTTCCAGTACCCGAAAGTCTCACCAACCTATCATGGCTTCACATTTTTTACATCTCTGTTAA
CAACATCACTAGAGGCATCCAAAACTTTAGAACATCAGTTCAAATCGACTAAAGGGGCAATCCTCTCCTACAGACAAGCTCGAGCATACTACCAAATGGAGGCAATTGAC
AACCACAACGACAACGACAATGGCAACATGAACTTTACTAAGGGGATACCTGTTATAGTTGTTGCTATTTTTCACCAACACCTTGGAAGGATCTTCTTTGCAAATCACCA
TAGGGACACCACCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATTCTAGTGAAAGTGTGAGGTCAGAGGACCTTGTGTGTCTAAAAGGAAAGTTAGGTGAATACAAGGTGCCTTGGATAAAAATGGTCGAGGGGTGGTATGTCACCAA
TGGATTAAGAGGAGCATTGAGTGCCTTGCATATGGCCAATGGGCGATGTGATGAGTATCTACGTTGTGAAAGTCATCGGTTGCCTTGGGTAATATGGCCAAGGGGCGATG
TACAGTTTGATTGTGAAGTCATCGGGTGCCTTGGATGTGAGCCACTTACTCAGTACCGTGGTTTTGTACTGACCCACCACCAGGTAAAATTTTGGGGGCGTTACACCAGT
GGTGGTTCTAGTAGAGGTTCGAAGATGTCTAGAATGAGTGATGAAGATTGGGAGGAAATGGATTTGAGAGCTGCAAGTGCAATCAGACTAAATTTGGCTAAGAACATTCT
TACAAATGTGCATGGAATTTCGACAGCCAAAGAGCTTTGGGAGAAGCTTGAAGGAATGTATCAGGCAAGGAGCATCTCGAATCGGTTGTACTTGAAGGAACAGTTTTACA
TGCAGCGAAGGAGGAAGTCTGGACACATAAAAAAGGATTGGCCTAACAGAGAAGGTTCGCCAAAGGGCTCTGGGTCAGATGCTGACATTGTCTCCCTCGTCAGGGGAGAC
AATGAATTCCTCCGAAGAAAGACAAAATTCATCCTCATGGTATGTCTGTTTTACCATGATAGAAGATGTGATGTTAACGGTTCCACAAGTTTGCACACGGGCATTGGCTT
GGCAGTTATGCAAGGTGTGTGGTGGAAGTTATGTCGATGGCTGAAGAACTTCCAGTACCCGAAAGTCTCACCAACCTATCATGGCTTCACATTTTTTACATCTCTGTTAA
CAACATCACTAGAGGCATCCAAAACTTTAGAACATCAGTTCAAATCGACTAAAGGGGCAATCCTCTCCTACAGACAAGCTCGAGCATACTACCAAATGGAGGCAATTGAC
AACCACAACGACAACGACAATGGCAACATGAACTTTACTAAGGGGATACCTGTTATAGTTGTTGCTATTTTTCACCAACACCTTGGAAGGATCTTCTTTGCAAATCACCA
TAGGGACACCACCACTTGA
Protein sequenceShow/hide protein sequence
MHSSESVRSEDLVCLKGKLGEYKVPWIKMVEGWYVTNGLRGALSALHMANGRCDEYLRCESHRLPWVIWPRGDVQFDCEVIGCLGCEPLTQYRGFVLTHHQVKFWGRYTS
GGSSRGSKMSRMSDEDWEEMDLRAASAIRLNLAKNILTNVHGISTAKELWEKLEGMYQARSISNRLYLKEQFYMQRRRKSGHIKKDWPNREGSPKGSGSDADIVSLVRGD
NEFLRRKTKFILMVCLFYHDRRCDVNGSTSLHTGIGLAVMQGVWWKLCRWLKNFQYPKVSPTYHGFTFFTSLLTTSLEASKTLEHQFKSTKGAILSYRQARAYYQMEAID
NHNDNDNGNMNFTKGIPVIVVAIFHQHLGRIFFANHHRDTTT