; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022620 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022620
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnuclear pore complex protein NUP1
Genome locationtig00000289:1661803..1676227
RNA-Seq ExpressionSgr022620
SyntenySgr022620
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR036561 - Mitochondrial glycoprotein superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578832.1 Nuclear pore complex protein NUP1, partial [Cucurbita argyrosperma subsp. sororia]1.8e-1947.33Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+TATEN N  AGSPF+FAS+LVNEKE AK    S+  +E++  +T  FG  V +  
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI

Query:  PANAASEYGNQNAGSPFEFASPLVNEKESAKVCSASALKAESSCSIPPAS
         +  A +  + +AG     +  L+    S+     + +    S S+P +S
Subjt:  PANAASEYGNQNAGSPFEFASPLVNEKESAKVCSASALKAESSCSIPPAS

KAG7016358.1 Nuclear pore complex protein NUP1 [Cucurbita argyrosperma subsp. argyrosperma]4.8e-2056.52Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+TATEN N  AGSPF+FAS+LVNEKE AK    S+  +E++  +T  FG      +
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI

Query:  PANAASEYGNQNAGS
        P  + SE       S
Subjt:  PANAASEYGNQNAGS

XP_022133602.1 nuclear pore complex protein NUP1 [Momordica charantia]5.7e-2135.94Show/hide
Query:  DFVEMDEEGYSNGPTTDISSDRRERADGPVVAVTIPTNEQNA----------VPFVASEGNVAPIQQATVPTT------FKFGANATFPIPANT------
        DFV+M+EEGYSNGP TDISS RRE+ DGP+VAV+ P++              V   +    +  + ++  P T      F F A +   I A+       
Subjt:  DFVEMDEEGYSNGPTTDISSDRRERADGPVVAVTIPTNEQNA----------VPFVASEGNVAPIQQATVPTT------FKFGANATFPIPANT------

Query:  ------ATENVNKNAGSP-FEFASALVNEKE---------------------------SAKDSIVSLNEATVLTTYKFGDKVTRPIPANAASEYGNQNAG
               +  + K A +P F F      +KE                           +++ ++     A V TT+KFGDK T PI  NA +E GN+N+G
Subjt:  ------ATENVNKNAGSP-FEFASALVNEKE---------------------------SAKDSIVSLNEATVLTTYKFGDKVTRPIPANAASEYGNQNAG

Query:  SPFEFASPLVNEKESAKVCSASALKAESSCSIP--PASKQTIECLI--AASTTGIT
        SPF+F+SPLV+EKESAKV SAS  KAESS SIP    SK+++   +   + +TG+T
Subjt:  SPFEFASPLVNEKESAKVCSASALKAESSCSIP--PASKQTIECLI--AASTTGIT

XP_022938795.1 nuclear pore complex protein NUP1-like isoform X2 [Cucurbita moschata]4.8e-2056.52Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+TATEN N  AGSPF+FAS+LVNEKE AK    S+  +E++  +T  FG      +
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI

Query:  PANAASEYGNQNAGS
        P  + SE       S
Subjt:  PANAASEYGNQNAGS

XP_022992758.1 nuclear pore complex protein NUP1-like isoform X2 [Cucurbita maxima]1.8e-1948.95Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAK-------------DSIVSLNEATVLTT
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+T TEN N  AGSPF+FAS+LVNEKE AK              SI+S      L +
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAK-------------DSIVSLNEATVLTT

Query:  YKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN
         K GDK +     +         S   +    S F F+SP  N
Subjt:  YKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN

TrEMBL top hitse value%identityAlignment
A0A6J1BX57 nuclear pore complex protein NUP12.7e-2135.94Show/hide
Query:  DFVEMDEEGYSNGPTTDISSDRRERADGPVVAVTIPTNEQNA----------VPFVASEGNVAPIQQATVPTT------FKFGANATFPIPANT------
        DFV+M+EEGYSNGP TDISS RRE+ DGP+VAV+ P++              V   +    +  + ++  P T      F F A +   I A+       
Subjt:  DFVEMDEEGYSNGPTTDISSDRRERADGPVVAVTIPTNEQNA----------VPFVASEGNVAPIQQATVPTT------FKFGANATFPIPANT------

Query:  ------ATENVNKNAGSP-FEFASALVNEKE---------------------------SAKDSIVSLNEATVLTTYKFGDKVTRPIPANAASEYGNQNAG
               +  + K A +P F F      +KE                           +++ ++     A V TT+KFGDK T PI  NA +E GN+N+G
Subjt:  ------ATENVNKNAGSP-FEFASALVNEKE---------------------------SAKDSIVSLNEATVLTTYKFGDKVTRPIPANAASEYGNQNAG

Query:  SPFEFASPLVNEKESAKVCSASALKAESSCSIP--PASKQTIECLI--AASTTGIT
        SPF+F+SPLV+EKESAKV SAS  KAESS SIP    SK+++   +   + +TG+T
Subjt:  SPFEFASPLVNEKESAKVCSASALKAESSCSIP--PASKQTIECLI--AASTTGIT

A0A6J1FF52 nuclear pore complex protein NUP1-like isoform X12.0e-1972.15Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+TATEN N  AGSPF+FAS+LVNEKE AK    S+
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL

A0A6J1FKS9 nuclear pore complex protein NUP1-like isoform X22.3e-2056.52Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+TATEN N  AGSPF+FAS+LVNEKE AK    S+  +E++  +T  FG      +
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVSL--NEATVLTTYKFGDKVTRPI

Query:  PANAASEYGNQNAGS
        P  + SE       S
Subjt:  PANAASEYGNQNAGS

A0A6J1JQT4 nuclear pore complex protein NUP1-like isoform X14.4e-1947.62Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKD-----------------SIVSLNEAT
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+T TEN N  AGSPF+FAS+LVNEKE AK                  SI+S     
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKD-----------------SIVSLNEAT

Query:  VLTTYKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN
         L + K GDK +     +         S   +    S F F+SP  N
Subjt:  VLTTYKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN

A0A6J1K059 nuclear pore complex protein NUP1-like isoform X28.8e-2048.95Show/hide
Query:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAK-------------DSIVSLNEATVLTT
        VT  TNEQNAVP V SEGNVAP  QA+ PTTFKFG  ATFPIPA+T TEN N  AGSPF+FAS+LVNEKE AK              SI+S      L +
Subjt:  VTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAK-------------DSIVSLNEATVLTT

Query:  YKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN
         K GDK +     +         S   +    S F F+SP  N
Subjt:  YKFGDKVTRPIPANAA-------SEYGNQNAGSPFEFASPLVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G32605.1 Mitochondrial glycoprotein family protein3.0e-1277.5Show/hide
Query:  DLDEKMRDVLHNYIEELAVDESLFPFLQAWLYVKEHRNLL
        +LDEKMRDV H ++EE  V+ESLFPFLQAWLYVK+HRNLL
Subjt:  DLDEKMRDVLHNYIEELAVDESLFPFLQAWLYVKEHRNLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTTTGTGGAAATGGATGAGGAAGGATATTCTAATGGGCCAACGACTGATATATCATCTGATAGGCGAGAAAGAGCTGATGGTCCAGTAGTCGCGGTTACCATCCCAAC
AAATGAACAAAATGCTGTTCCTTTTGTAGCTTCTGAAGGCAATGTTGCACCAATTCAACAAGCTACTGTTCCTACCACATTTAAATTTGGAGCTAATGCTACATTTCCTA
TTCCAGCAAATACTGCTACAGAGAATGTAAATAAGAATGCAGGGTCTCCATTTGAGTTTGCATCAGCTTTAGTTAATGAAAAAGAAAGTGCTAAAGATAGCATAGTAAGC
CTGAACGAAGCTACTGTTCTTACCACTTATAAATTTGGAGATAAAGTTACACGTCCTATTCCCGCAAATGCTGCTTCAGAATATGGAAATCAGAATGCAGGGTCTCCATT
TGAGTTTGCATCACCTTTAGTTAATGAAAAAGAAAGTGCTAAAGTATGTAGCGCTTCAGCTCTTAAAGCAGAGAGTTCTTGCAGTATTCCTCCTGCGTCAAAGCAGACCA
TCGAGTGCTTAATAGCTGCTTCCACTACCGGTATCACGGACATGGACGTTCCTCTTCCCGACGAAGCAGGTTCTAATTTCCATGAAGGTTATTTGGACCGCTTGAAACAG
ACGAGGAAGGATGAGAAAGAGCCATGTGATAGATCAATTCCGCCATTGATCTCATCGATTCAATCTCTACAACGTCACATTGCCGGAGGGAAGGAAAGTCGGTGGTTGGT
TCGAGACGGAGGAGGATCAGCGAGAGGAAATCTACGGTGGAAGAAGGAAGTATGGAAAAGACGGTTTCTAGTGTCTGAGACAATGTTTAGGGTTCAGGAGGGCAAGCTGA
TGGTCGGTTCGAGACAGGAGGCTTGGCGAGATGAATTATACGGACTGAAAACGAGTGGTCGAGACAGAGGAGGATTGGCAAGAACTGAAGAATTCTCCCTCGGAATAAGG
AAGTTTGTAGACAGTGTTAGAGATCAATGGCTGTTAGAGATCAATGGCTACTTTATTCTGCCAATTGACTATGAACAGATCCCATGTAAAGTCTTGATGCATGCTATGAT
AAAGATTGTCCCAGTGCATGGAACTTGTTCTTCCGTTCTTGTGGAAGATTTATTCTCAGCCCATCTCTCTATTGAGGAAAGGACTAGACACTTAATCTCTCTTTTCAATA
TCCGTCACGAAAAGGCTCTCAGATACGTTTTGCTGCAAAAACAGAGAGATCTAGATGAAAAGATGCGAGATGTGCTTCACAATTATATAGAAGAGCTAGCTGTAGACGAG
TCTCTCTTTCCATTTCTTCAAGCATGGCTTTATGTGAAAGAACATCGAAATCTTTTGGGAGCACGTATCGGATTCCTCTCAGCCCTCGCACGGGCTGTCGGAGAAGTGGT
CAATCAGGACACCGTCGAAGAGCGGTCGAAGCTTCCCCTACTTGGGCAGCCCCAGCATCTCCTCTGCTCCACCAATGTACCTGCCCCTGAGGAAAAGTTTTGGAGGGCTC
CCCATGTTTATTTCATTCCTGACGATACATGGGGGTCAATCCTGGTTGTTCGGACGCCGAGGAAACGGAATGCTTGGAGCGTTTTCATAAAGGGAAGGAATCTGGTTAAC
TGGTTGCCGACAAATGAATTTAAGGTTCTATTTGTTCAGGTATTTCCGTCTTTGCCTTGTAGTGACATTGGGTCTTATTGCTTCGGTTACTGA
mRNA sequenceShow/hide mRNA sequence
GATTTTGTGGAAATGGATGAGGAAGGATATTCTAATGGGCCAACGACTGATATATCATCTGATAGGCGAGAAAGAGCTGATGGTCCAGTAGTCGCGGTTACCATCCCAAC
AAATGAACAAAATGCTGTTCCTTTTGTAGCTTCTGAAGGCAATGTTGCACCAATTCAACAAGCTACTGTTCCTACCACATTTAAATTTGGAGCTAATGCTACATTTCCTA
TTCCAGCAAATACTGCTACAGAGAATGTAAATAAGAATGCAGGGTCTCCATTTGAGTTTGCATCAGCTTTAGTTAATGAAAAAGAAAGTGCTAAAGATAGCATAGTAAGC
CTGAACGAAGCTACTGTTCTTACCACTTATAAATTTGGAGATAAAGTTACACGTCCTATTCCCGCAAATGCTGCTTCAGAATATGGAAATCAGAATGCAGGGTCTCCATT
TGAGTTTGCATCACCTTTAGTTAATGAAAAAGAAAGTGCTAAAGTATGTAGCGCTTCAGCTCTTAAAGCAGAGAGTTCTTGCAGTATTCCTCCTGCGTCAAAGCAGACCA
TCGAGTGCTTAATAGCTGCTTCCACTACCGGTATCACGGACATGGACGTTCCTCTTCCCGACGAAGCAGGTTCTAATTTCCATGAAGGTTATTTGGACCGCTTGAAACAG
ACGAGGAAGGATGAGAAAGAGCCATGTGATAGATCAATTCCGCCATTGATCTCATCGATTCAATCTCTACAACGTCACATTGCCGGAGGGAAGGAAAGTCGGTGGTTGGT
TCGAGACGGAGGAGGATCAGCGAGAGGAAATCTACGGTGGAAGAAGGAAGTATGGAAAAGACGGTTTCTAGTGTCTGAGACAATGTTTAGGGTTCAGGAGGGCAAGCTGA
TGGTCGGTTCGAGACAGGAGGCTTGGCGAGATGAATTATACGGACTGAAAACGAGTGGTCGAGACAGAGGAGGATTGGCAAGAACTGAAGAATTCTCCCTCGGAATAAGG
AAGTTTGTAGACAGTGTTAGAGATCAATGGCTGTTAGAGATCAATGGCTACTTTATTCTGCCAATTGACTATGAACAGATCCCATGTAAAGTCTTGATGCATGCTATGAT
AAAGATTGTCCCAGTGCATGGAACTTGTTCTTCCGTTCTTGTGGAAGATTTATTCTCAGCCCATCTCTCTATTGAGGAAAGGACTAGACACTTAATCTCTCTTTTCAATA
TCCGTCACGAAAAGGCTCTCAGATACGTTTTGCTGCAAAAACAGAGAGATCTAGATGAAAAGATGCGAGATGTGCTTCACAATTATATAGAAGAGCTAGCTGTAGACGAG
TCTCTCTTTCCATTTCTTCAAGCATGGCTTTATGTGAAAGAACATCGAAATCTTTTGGGAGCACGTATCGGATTCCTCTCAGCCCTCGCACGGGCTGTCGGAGAAGTGGT
CAATCAGGACACCGTCGAAGAGCGGTCGAAGCTTCCCCTACTTGGGCAGCCCCAGCATCTCCTCTGCTCCACCAATGTACCTGCCCCTGAGGAAAAGTTTTGGAGGGCTC
CCCATGTTTATTTCATTCCTGACGATACATGGGGGTCAATCCTGGTTGTTCGGACGCCGAGGAAACGGAATGCTTGGAGCGTTTTCATAAAGGGAAGGAATCTGGTTAAC
TGGTTGCCGACAAATGAATTTAAGGTTCTATTTGTTCAGGTATTTCCGTCTTTGCCTTGTAGTGACATTGGGTCTTATTGCTTCGGTTACTGA
Protein sequenceShow/hide protein sequence
DFVEMDEEGYSNGPTTDISSDRRERADGPVVAVTIPTNEQNAVPFVASEGNVAPIQQATVPTTFKFGANATFPIPANTATENVNKNAGSPFEFASALVNEKESAKDSIVS
LNEATVLTTYKFGDKVTRPIPANAASEYGNQNAGSPFEFASPLVNEKESAKVCSASALKAESSCSIPPASKQTIECLIAASTTGITDMDVPLPDEAGSNFHEGYLDRLKQ
TRKDEKEPCDRSIPPLISSIQSLQRHIAGGKESRWLVRDGGGSARGNLRWKKEVWKRRFLVSETMFRVQEGKLMVGSRQEAWRDELYGLKTSGRDRGGLARTEEFSLGIR
KFVDSVRDQWLLEINGYFILPIDYEQIPCKVLMHAMIKIVPVHGTCSSVLVEDLFSAHLSIEERTRHLISLFNIRHEKALRYVLLQKQRDLDEKMRDVLHNYIEELAVDE
SLFPFLQAWLYVKEHRNLLGARIGFLSALARAVGEVVNQDTVEERSKLPLLGQPQHLLCSTNVPAPEEKFWRAPHVYFIPDDTWGSILVVRTPRKRNAWSVFIKGRNLVN
WLPTNEFKVLFVQVFPSLPCSDIGSYCFGY