; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028587 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028587
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein CROWDED NUCLEI 3
Genome locationchr8:25875303..25878047
RNA-Seq ExpressionLag0028587
SyntenyLag0028587
Gene Ontology termsGO:0006997 - nucleus organization (biological process)
GO:0005652 - nuclear lamina (cellular component)
InterPro domainsIPR040418 - Protein crowded nuclei


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032453.1 Protein CROWDED NUCLEI 2 [Cucurbita argyrosperma subsp. argyrosperma]9.7e-2691.43Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGL+L+EKK WASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

XP_004147138.1 protein CROWDED NUCLEI 3 [Cucumis sativus]2.6e-2685.33Show/hide
Query:  IDYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYTWF
        +DYQHNLGLLLIEKK+WASK+D+LGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +  ++
Subjt:  IDYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYTWF

XP_022930031.1 protein CROWDED NUCLEI 1-like isoform X1 [Cucurbita moschata]9.7e-2691.43Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGL+L+EKK WASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

XP_038907101.1 protein CROWDED NUCLEI 1 isoform X1 [Benincasa hispida]4.4e-2692.86Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGLLLIEKK+WASKY+QLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

XP_038907102.1 protein CROWDED NUCLEI 1 isoform X2 [Benincasa hispida]4.4e-2692.86Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGLLLIEKK+WASKY+QLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

TrEMBL top hitse value%identityAlignment
A0A1S3CSZ3 protein CROWDED NUCLEI 31.2e-2488.57Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHN+GLLLIEKK+WA K+DQL QDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

A0A5D3BM77 Protein CROWDED NUCLEI 31.2e-2488.57Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHN+GLLLIEKK+WA K+DQL QDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

A0A6J1E0M3 protein CROWDED NUCLEI 1-like isoform X23.4e-2470.97Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYTWF------VERLLMIEKILGD
        DYQHNLGLLL+EKKEWASKYD+LGQ+LAETEEI KREQSAH+IALSEVETR DNLKKALAAEKQ+V S +  ++       E  L  EK L D
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYTWF------VERLLMIEKILGD

A0A6J1EPS4 protein CROWDED NUCLEI 1-like isoform X14.7e-2691.43Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGL+L+EKK WASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

A0A6J1K004 protein CROWDED NUCLEI 1-like isoform X14.7e-2691.43Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR
        DYQHNLGL+L+EKK WASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV S +
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFR

SwissProt top hitse value%identityAlignment
A0A166B1A6 Nuclear matrix constituent protein 14.7e-1560.61Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        DYQ+N+GLLLIEKKEW SK+++L Q   ET++  K+EQ AHLIA+S+ E R +NL KAL  EKQ V
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

F4HRT5 Protein CROWDED NUCLEI 12.0e-1356.25Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQ
        +YQH++GLLLIEKKEW+S+Y+ L Q   E  E  K+E++AHLIA+++VE R + L+KAL  EKQ
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQ

I0J0E7 Nuclear matrix constituent protein 12.5e-1663.64Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        +YQ+N+GLLLIEKKEW+S ++++   LAE EEI KREQ+AH+IAL+E E R DNL+KAL  EKQ V
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

Q0DY81 Nuclear matrix constituent protein 1a8.0e-1559.09Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        +YQ+N+GLLLIEKKEW +K D++ Q L + EEI KREQ+AHL A+SE E R ++++KAL  EKQ V
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

Q9SAF6 Protein CROWDED NUCLEI 26.8e-1461.54Show/hide
Query:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        YQHN+GLLL+E KE  SK++QL Q   E +EI KREQS+HL AL+ VE R +NL+KAL  EKQ V
Subjt:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

Arabidopsis top hitse value%identityAlignment
AT1G13220.1 nuclear matrix constituent protein-related4.8e-1561.54Show/hide
Query:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        YQHN+GLLL+E KE  SK++QL Q   E +EI KREQS+HL AL+ VE R +NL+KAL  EKQ V
Subjt:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

AT1G13220.2 nuclear matrix constituent protein-related4.8e-1561.54Show/hide
Query:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        YQHN+GLLL+E KE  SK++QL Q   E +EI KREQS+HL AL+ VE R +NL+KAL  EKQ V
Subjt:  YQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

AT1G67230.1 little nuclei11.4e-1456.25Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQ
        +YQH++GLLLIEKKEW+S+Y+ L Q   E  E  K+E++AHLIA+++VE R + L+KAL  EKQ
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQ

AT1G68790.1 little nuclei31.0e-1253.03Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV
        DYQHN+GLLLIEKK+W S  ++L Q   E  E+ KRE++++ I L+E + R +NL+KAL  EKQ V
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHV

AT5G65770.1 little nuclei43.0e-0940.28Show/hide
Query:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYT
        DYQHN+GLLL+EK E +S+Y+++   + E++    RE+SA++ AL+E + R ++LKK +   K+ + S   T
Subjt:  DYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATTGATTACCAGCACAATTTAGGACTTCTTTTGATAGAGAAGAAAGAGTGGGCTTCAAAGTATGACCAACTAGGGCAAGATTTAGCAGAAACTGAGGAGATCTT
CAAACGTGAACAATCAGCACATTTAATTGCACTATCCGAAGTTGAAACGAGGAGGGATAATTTGAAGAAAGCTCTAGCTGCTGAGAAGCAACATGTGTTTAGTTTTCGCT
ACACATGGTTTGTAGAAAGATTGTTGATGATTGAAAAGATCCTTGGGGATTTCTTTTGGGAAGGTGCAAAGGAAGATGGTGGAATGCATAATGTGAACAGGGCAAGAACT
CGTCAAGCTAAATGGATATGGCAGGCTTCTGAAAACAAATTCACAATGACCATCCTGAAGCTTTACAAACTTTCTGATTGTGTGGTGGGATTTCTTGAACATTTTTTGGA
AGGTGTAGGGCTCTCCCTCAATTCCTCCAAATCTTCTATAGTTGGAATCGATGTCGAGGAGGTGGAGGTAATTCAGCAAGCTGTCCGTTTGGGATGTTGTCATCTTGCTA
AATGGATTTGGACTTCTCTTCCTCTCGAATATGGGGGCCTTGGTATTGGTTCATTGAAACAGAGGAATACTGCTCTTCTCATTAAATGGTTGTGGAGATTTGCTCAAGAA
GAGCAAGCTTTATGGAGAAAGGTAGTTGATAGTACCTATACGGCAGTGAGCCAAGTGGTTGGATGTCTCTTCCCCCAAAAGGGACCTCAAGGGCAAGACCGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCATTGATTACCAGCACAATTTAGGACTTCTTTTGATAGAGAAGAAAGAGTGGGCTTCAAAGTATGACCAACTAGGGCAAGATTTAGCAGAAACTGAGGAGATCTT
CAAACGTGAACAATCAGCACATTTAATTGCACTATCCGAAGTTGAAACGAGGAGGGATAATTTGAAGAAAGCTCTAGCTGCTGAGAAGCAACATGTGTTTAGTTTTCGCT
ACACATGGTTTGTAGAAAGATTGTTGATGATTGAAAAGATCCTTGGGGATTTCTTTTGGGAAGGTGCAAAGGAAGATGGTGGAATGCATAATGTGAACAGGGCAAGAACT
CGTCAAGCTAAATGGATATGGCAGGCTTCTGAAAACAAATTCACAATGACCATCCTGAAGCTTTACAAACTTTCTGATTGTGTGGTGGGATTTCTTGAACATTTTTTGGA
AGGTGTAGGGCTCTCCCTCAATTCCTCCAAATCTTCTATAGTTGGAATCGATGTCGAGGAGGTGGAGGTAATTCAGCAAGCTGTCCGTTTGGGATGTTGTCATCTTGCTA
AATGGATTTGGACTTCTCTTCCTCTCGAATATGGGGGCCTTGGTATTGGTTCATTGAAACAGAGGAATACTGCTCTTCTCATTAAATGGTTGTGGAGATTTGCTCAAGAA
GAGCAAGCTTTATGGAGAAAGGTAGTTGATAGTACCTATACGGCAGTGAGCCAAGTGGTTGGATGTCTCTTCCCCCAAAAGGGACCTCAAGGGCAAGACCGTGGTTGA
Protein sequenceShow/hide protein sequence
MFIDYQHNLGLLLIEKKEWASKYDQLGQDLAETEEIFKREQSAHLIALSEVETRRDNLKKALAAEKQHVFSFRYTWFVERLLMIEKILGDFFWEGAKEDGGMHNVNRART
RQAKWIWQASENKFTMTILKLYKLSDCVVGFLEHFLEGVGLSLNSSKSSIVGIDVEEVEVIQQAVRLGCCHLAKWIWTSLPLEYGGLGIGSLKQRNTALLIKWLWRFAQE
EQALWRKVVDSTYTAVSQVVGCLFPQKGPQGQDRG