; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014966 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014966
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnuclear pore complex protein NUP1
Genome locationtig00002486:575132..593056
RNA-Seq ExpressionSgr014966
SyntenySgr014966
Gene Ontology termsGO:0005622 - intracellular (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602242.1 Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sororia]1.9e-1945.03Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF
        S   DFV++DEE  SNGP TDISF+RREK+DG LVA+SKPSDTEAIT             +  P       P   SE N    Q  + +P  +++  + F
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF

Query:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
         F             AS PSIT NA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

KAG7032922.1 Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-1945.03Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF
        S   DFV++DEE  SNGP TDISF+RREK+DG LVA+SKPSDTEAIT             +  P       P   SE N    Q  + +P  +++  + F
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF

Query:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
         F             AS PSIT NA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

XP_022133602.1 nuclear pore complex protein NUP1 [Momordica charantia]1.7e-2042.47Show/hide
Query:  SSSFPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQH
        SS  PS+  +W    S   DFV+M+EE YSNGPATDIS  RREK+DGPLVAVSKPSDT+  T   D     +  +   P++E N +  +      A  + 
Subjt:  SSSFPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQH

Query:  ATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
        + +           F F             AS PSITA+ + PE ASR E             P FGFGDK P QKELI SAPTFA
Subjt:  ATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

XP_022990208.1 nuclear pore complex protein NUP1-like [Cucurbita maxima]5.5e-1944.77Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEI-MLN
        S   DFV++DEE  SNGP TDISF+RREK+DG LVA+SKP+DTEAIT             +  P       P   SE N    Q  + +P  + +  +L+
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEI-MLN

Query:  FLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
        F               AS PSITANA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  FLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

XP_023527371.1 nuclear pore complex protein NUP1-like [Cucurbita pepo subsp. pepo]9.4e-1944.44Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF
        S   DFV++DEE  SNGP TDISF+RREK+DG L A+SKP+DTEAIT             +  P       P  ASE N    Q  + +P ++ +  + F
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF

Query:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
         F             AS PSI ANA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

TrEMBL top hitse value%identityAlignment
A0A6J1BX57 nuclear pore complex protein NUP18.3e-2142.47Show/hide
Query:  SSSFPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQH
        SS  PS+  +W    S   DFV+M+EE YSNGPATDIS  RREK+DGPLVAVSKPSDT+  T   D     +  +   P++E N +  +      A  + 
Subjt:  SSSFPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQH

Query:  ATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
        + +           F F             AS PSITA+ + PE ASR E             P FGFGDK P QKELI SAPTFA
Subjt:  ATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

A0A6J1FF52 nuclear pore complex protein NUP1-like isoform X18.1e-1637.84Show/hide
Query:  AGLANSRVCATPSIKNTVASTSTDAFRPTWSSS-------FPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITE
        + L  S+V +  SI       S+   + T SSS        PS+  +     S Q DFV+MD+E YSNGP +  SFERREK+D  LVAV KPSDTEAIT 
Subjt:  AGLANSRVCATPSIKNTVASTSTDAFRPTWSSS-------FPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITE

Query:  GIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------
                    +  P     A P   SE      Q  + +P ++ E    F F             AS  S TAN I PES +RPE             
Subjt:  GIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------

Query:  PRFGFGDKLPSQKELIASAPTF
        P FGFG+KLPSQK+ + S+PTF
Subjt:  PRFGFGDKLPSQKELIASAPTF

A0A6J1FKS9 nuclear pore complex protein NUP1-like isoform X28.1e-1637.84Show/hide
Query:  AGLANSRVCATPSIKNTVASTSTDAFRPTWSSS-------FPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITE
        + L  S+V +  SI       S+   + T SSS        PS+  +     S Q DFV+MD+E YSNGP +  SFERREK+D  LVAV KPSDTEAIT 
Subjt:  AGLANSRVCATPSIKNTVASTSTDAFRPTWSSS-------FPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITE

Query:  GIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------
                    +  P     A P   SE      Q  + +P ++ E    F F             AS  S TAN I PES +RPE             
Subjt:  GIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------

Query:  PRFGFGDKLPSQKELIASAPTF
        P FGFG+KLPSQK+ + S+PTF
Subjt:  PRFGFGDKLPSQKELIASAPTF

A0A6J1GY88 nuclear pore complex protein NUP1-like6.0e-1945.03Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF
        S   DFV++DEE  SNGP TDISF+RREK+D  LVA+SKPSDTEAIT             +  P       P   SE N    Q  + +P ++ E    F
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNF

Query:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
         F             AS PSIT NA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  LFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

A0A6J1JSL2 nuclear pore complex protein NUP1-like2.7e-1944.77Show/hide
Query:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEI-MLN
        S   DFV++DEE  SNGP TDISF+RREK+DG LVA+SKP+DTEAIT             +  P       P   SE N    Q  + +P  + +  +L+
Subjt:  SKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRPPLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEI-MLN

Query:  FLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA
        F               AS PSITANA  PES  RPE             P FGFGDKLPSQKE  +SAPTFA
Subjt:  FLFQQILLQRMEIRMQASSPSITANAIGPESASRPEN------------PRFGFGDKLPSQKELIASAPTFA

SwissProt top hitse value%identityAlignment
Q9XI00 F-box protein 71.1e-1473.08Show/hide
Query:  DLYGVNARHVVSCGSSSRKPFVDPALIYHCWPDELLFQVFARMTPYDLGRAS
        DLYGV+ R V   GS+SRKP  DPALI+ C PDELLF+VFARM PYDLGRAS
Subjt:  DLYGVNARHVVSCGSSSRKPFVDPALIYHCWPDELLFQVFARMTPYDLGRAS

Arabidopsis top hitse value%identityAlignment
AT1G21760.1 F-box protein 78.0e-1673.08Show/hide
Query:  DLYGVNARHVVSCGSSSRKPFVDPALIYHCWPDELLFQVFARMTPYDLGRAS
        DLYGV+ R V   GS+SRKP  DPALI+ C PDELLF+VFARM PYDLGRAS
Subjt:  DLYGVNARHVVSCGSSSRKPFVDPALIYHCWPDELLFQVFARMTPYDLGRAS

AT1G77600.1 ARM repeat superfamily protein3.1e-0437.7Show/hide
Query:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR
        SE L    IS  K A + +  +++          + + D+FEQIPCK+L+LC +K+C EFR
Subjt:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR

AT1G77600.2 ARM repeat superfamily protein3.1e-0437.7Show/hide
Query:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR
        SE L    IS  K A + +  +++          + + D+FEQIPCK+L+LC +K+C EFR
Subjt:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR

AT1G77600.3 ARM repeat superfamily protein3.1e-0437.7Show/hide
Query:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR
        SE L    IS  K A + +  +++          + + D+FEQIPCK+L+LC +K+C EFR
Subjt:  SESLEVHDISEEKCASEIVESIWR----------LLLNDDFEQIPCKVLMLCYDKDCREFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGTTTGGTTTTGAAGGGGAAAGTTATTATAGTTGCAAAACTTGGGATGAAGTTGGAAATCCCAGGTGGTGCAGTACTTGAGAATCAGATGGCGATAAAGATTGA
ATTGAAACAGCCTCACCAAATGGATGAGTCGTCGCTGGATCTTATGTTTGACATCGGCACTGAATTGAGCAAGCACACTCGCCCAACCAAGGATTACATTGTCAAATCCC
TACAATCTTACGCCCTAGATGCTGCAAGGGAGTCAGAATCTCTGGAAGTTCATGATATCAGTGAGGAAAAATGCGCTTCAGAAATTGTTGAAAGCATATGGAGATTATTG
TTGAACGATGACTTTGAACAGATCCCATGTAAAGTCTTGATGCTTTGCTATGATAAAGATTGTAGGGAGTTCAGGTGTTCTATTTCAGATCTCTACGGGGTTAACGCCAG
ACATGTCGTGTCATGCGGGAGTTCCAGTAGAAAGCCATTTGTTGATCCAGCATTAATATATCATTGCTGGCCAGATGAGTTACTGTTTCAGGTCTTTGCCAGAATGACTC
CTTATGACCTGGGACGGGCATCTGTCTCCGTCGAAAATGGAGATACACTATTGGTCTTTGGCGTTGTTGGTGTTATTGAAAATGGTTGTTTCACCCGTAGATCTGAAGGC
AGATCTGCTGTATACATTATGGCTGGACTGGCAAATTCCAGAGTTTGTGCAACCCCTAGCATAAAGAATACTGTAGCCTCAACCTCAACAGATGCTTTCAGACCAACATG
GTCGTCGTCATTTCCGTCAGCATGGGAGCAATGGAGAATTTTGGGGTCAAAACAAGGGGATTTTGTGAATATGGATGAGGAAAGATATTCTAATGGGCCAGCGACTGATA
TATCATTTGAGAGGCGAGAAAAAATTGACGGCCCATTGGTGGCGGTGAGTAAGCCAAGTGATACTGAAGCCATTACAGAAGGAATTGATTGCTTTCTGATGTTAAGGCCT
CCACTGTTACCACCGCCAACAAATGAACAAAATGCCGTTCCTGTTGTAGCTTCTGAAGGCAATGTTGCACCAATTCAACATGCTACTGTTCTACCACATTTAAGTTTGGA
GATAATGCTAAATTTCCTATTCCAGCAAATACTGCTACAGAGAATGGAAATAAGAATGCAGGCATCTTCACCTAGCATTACAGCCAATGCGATAGGTCCTGAATCAGCCT
CGAGGCCTGAAAATCCTAGATTTGGGTTTGGAGACAAGTTGCCGTCGCAGAAGGAATTGATTGCTTCGGCCCCCACGTTTGCTGCGGAAACAAGCACTATACAACACTTC
TTTATTTATAACCCCATGCCTATGTTTATTGATCCCACAGGGCGGAATGCTTCTGCGAGTGTGCACTTGTTCTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGTTTGGTTTTGAAGGGGAAAGTTATTATAGTTGCAAAACTTGGGATGAAGTTGGAAATCCCAGGTGGTGCAGTACTTGAGAATCAGATGGCGATAAAGATTGA
ATTGAAACAGCCTCACCAAATGGATGAGTCGTCGCTGGATCTTATGTTTGACATCGGCACTGAATTGAGCAAGCACACTCGCCCAACCAAGGATTACATTGTCAAATCCC
TACAATCTTACGCCCTAGATGCTGCAAGGGAGTCAGAATCTCTGGAAGTTCATGATATCAGTGAGGAAAAATGCGCTTCAGAAATTGTTGAAAGCATATGGAGATTATTG
TTGAACGATGACTTTGAACAGATCCCATGTAAAGTCTTGATGCTTTGCTATGATAAAGATTGTAGGGAGTTCAGGTGTTCTATTTCAGATCTCTACGGGGTTAACGCCAG
ACATGTCGTGTCATGCGGGAGTTCCAGTAGAAAGCCATTTGTTGATCCAGCATTAATATATCATTGCTGGCCAGATGAGTTACTGTTTCAGGTCTTTGCCAGAATGACTC
CTTATGACCTGGGACGGGCATCTGTCTCCGTCGAAAATGGAGATACACTATTGGTCTTTGGCGTTGTTGGTGTTATTGAAAATGGTTGTTTCACCCGTAGATCTGAAGGC
AGATCTGCTGTATACATTATGGCTGGACTGGCAAATTCCAGAGTTTGTGCAACCCCTAGCATAAAGAATACTGTAGCCTCAACCTCAACAGATGCTTTCAGACCAACATG
GTCGTCGTCATTTCCGTCAGCATGGGAGCAATGGAGAATTTTGGGGTCAAAACAAGGGGATTTTGTGAATATGGATGAGGAAAGATATTCTAATGGGCCAGCGACTGATA
TATCATTTGAGAGGCGAGAAAAAATTGACGGCCCATTGGTGGCGGTGAGTAAGCCAAGTGATACTGAAGCCATTACAGAAGGAATTGATTGCTTTCTGATGTTAAGGCCT
CCACTGTTACCACCGCCAACAAATGAACAAAATGCCGTTCCTGTTGTAGCTTCTGAAGGCAATGTTGCACCAATTCAACATGCTACTGTTCTACCACATTTAAGTTTGGA
GATAATGCTAAATTTCCTATTCCAGCAAATACTGCTACAGAGAATGGAAATAAGAATGCAGGCATCTTCACCTAGCATTACAGCCAATGCGATAGGTCCTGAATCAGCCT
CGAGGCCTGAAAATCCTAGATTTGGGTTTGGAGACAAGTTGCCGTCGCAGAAGGAATTGATTGCTTCGGCCCCCACGTTTGCTGCGGAAACAAGCACTATACAACACTTC
TTTATTTATAACCCCATGCCTATGTTTATTGATCCCACAGGGCGGAATGCTTCTGCGAGTGTGCACTTGTTCTTGTAG
Protein sequenceShow/hide protein sequence
MVGLVLKGKVIIVAKLGMKLEIPGGAVLENQMAIKIELKQPHQMDESSLDLMFDIGTELSKHTRPTKDYIVKSLQSYALDAARESESLEVHDISEEKCASEIVESIWRLL
LNDDFEQIPCKVLMLCYDKDCREFRCSISDLYGVNARHVVSCGSSSRKPFVDPALIYHCWPDELLFQVFARMTPYDLGRASVSVENGDTLLVFGVVGVIENGCFTRRSEG
RSAVYIMAGLANSRVCATPSIKNTVASTSTDAFRPTWSSSFPSAWEQWRILGSKQGDFVNMDEERYSNGPATDISFERREKIDGPLVAVSKPSDTEAITEGIDCFLMLRP
PLLPPPTNEQNAVPVVASEGNVAPIQHATVLPHLSLEIMLNFLFQQILLQRMEIRMQASSPSITANAIGPESASRPENPRFGFGDKLPSQKELIASAPTFAAETSTIQHF
FIYNPMPMFIDPTGRNASASVHLFL