; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026577 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026577
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription termination/antitermination protein NusG
Genome locationtig00153033:1448583..1451682
RNA-Seq ExpressionSgr026577
SyntenySgr026577
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]2.1e-14984.04Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE
        MA GLL WS       + L S S PA SFSLSSS+RTQLS+SA++ET   AADD  QLS RERR+LRN RREIKTTTNWREEVEERLC+KPKKEFA+WTE
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE

Query:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA
        KLNLDYLAKLGPQWWVMRVARVR QEIVERLAR LARN+PDLDFKIYYP+V+EKR+LKNGTYTV P+ VFPGSVFIRC+MNKEIHDFIRECDGVGGFVGA
Subjt:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA

Query:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL
        KVGNTKRQINKPKPV EADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+S L+TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKKL
Subjt:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL

Query:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK
        NRK+GKVTVGFTLFGKETLVDL++GDI+V TK
Subjt:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]6.3e-15487.05Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE
        MA GLL WS      PI L S SLPA SFSLSSS+RTQLSISA++ET   AADDV QLSAR+RR+LRN RREIKTTTNWREEVEERLC+KPKKEFA+WTE
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE

Query:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA
        KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARN+PDLDFKIYYP+V+EKR+LKNGTYTVKP+ VFPGSVFIRC+MNKEIHDFIRECDGVGGFVGA
Subjt:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA

Query:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL
        KVGNTKRQINKPKPV EADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSS L+TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKKL
Subjt:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL

Query:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK
        NRK+GKVTVGFTLFGKETLVDL++GDI+V TK
Subjt:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]6.7e-15689.97Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET-AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKL
        MA GLLPWS       I LRS S PA SFSLSSSK TQLSISAALET AADDV QLSARERRRLRN RREIKTTTNWREEVEERLCKKPKKEFASWTEKL
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET-AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKL

Query:  NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV
        NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARN+PDLDFKIYYP+VQEKRRLKNGTY VKPR VFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV
Subjt:  NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV

Query:  GNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNR
        GNTKRQINKPKPV EADMEAIFKEAKEEQERHDQ FLEKEQE+APNS++ +TDLDTNGTTATKPKGR KKAVNALSPGSTVRVASGTFAEFEG LKKLNR
Subjt:  GNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNR

Query:  KNGKVTVGFTLFGKETLVDLNVGDIVVMT
        K+GKVTVGFTLFGKETLVDL++GDIVV T
Subjt:  KNGKVTVGFTLFGKETLVDLNVGDIVVMT

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]1.0e-14885.41Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN
        MA GLL W+     LP  LRSPS P+ SFSLSSS RTQLSISAALETAADDV QLSARERRRLRN RRE K TTNWREEVEERLCKKPKKEFA+WTEKLN
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN

Query:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG
        LDYLAKLGPQWWVMRV+RVRGQEIVERLARSLARN+PDLDFKIYYP+V EKR+LKNG+YTVKP+ VFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVG
Subjt:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG

Query:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK
        NTKRQINKPKPV + DMEAIFKEAKEEQERHDQAFLEKE+E+APN S+LETDLDTNGTTATK KGR KKAVN LSPGSTVRV+SGTFAEFEG LKK+NRK
Subjt:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK

Query:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK
        + KVTVGFTLFGKETLV+L++GDI+V TK
Subjt:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]2.8e-15487.31Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET--AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEK
        MA GLL WS      PI LRS S PA SFSLSS KRTQLSISA +ET  AADD+ QLSARERR+LRN RREIKTTTNWREEVEERLCKKPKKEFA+WTEK
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET--AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEK

Query:  LNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAK
        LNLDYL+KLGPQWWVMRVARVRGQEIVERLARSLARN+PDLDFKIYYP+VQEKR+LKNGTYTVKP+ +FPGSVFIRCIMNKEIHDFIRECDGVGGFVGAK
Subjt:  LNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAK

Query:  VGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLN
        VGNTKRQINKPKPV EADMEAIFKEAKEEQERHDQAFLEKEQ+ APNSS LETDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKKLN
Subjt:  VGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLN

Query:  RKNGKVTVGFTLFGKETLVDLNVGDIVVMTK
        RK+GKVTVGFTLFGKETLVDL++GDI+V TK
Subjt:  RKNGKVTVGFTLFGKETLVDLNVGDIVVMTK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein1.0e-14984.04Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE
        MA GLL WS       + L S S PA SFSLSSS+RTQLS+SA++ET   AADD  QLS RERR+LRN RREIKTTTNWREEVEERLC+KPKKEFA+WTE
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE

Query:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA
        KLNLDYLAKLGPQWWVMRVARVR QEIVERLAR LARN+PDLDFKIYYP+V+EKR+LKNGTYTV P+ VFPGSVFIRC+MNKEIHDFIRECDGVGGFVGA
Subjt:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA

Query:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL
        KVGNTKRQINKPKPV EADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+S L+TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKKL
Subjt:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL

Query:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK
        NRK+GKVTVGFTLFGKETLVDL++GDI+V TK
Subjt:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK

A0A1S3BKU2 transcription termination/antitermination protein NusG3.0e-15487.05Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE
        MA GLL WS      PI L S SLPA SFSLSSS+RTQLSISA++ET   AADDV QLSAR+RR+LRN RREIKTTTNWREEVEERLC+KPKKEFA+WTE
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET---AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTE

Query:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA
        KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARN+PDLDFKIYYP+V+EKR+LKNGTYTVKP+ VFPGSVFIRC+MNKEIHDFIRECDGVGGFVGA
Subjt:  KLNLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGA

Query:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL
        KVGNTKRQINKPKPV EADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSS L+TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKKL
Subjt:  KVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKL

Query:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK
        NRK+GKVTVGFTLFGKETLVDL++GDI+V TK
Subjt:  NRKNGKVTVGFTLFGKETLVDLNVGDIVVMTK

A0A6J1DDH4 uncharacterized protein LOC1110194923.3e-15689.97Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET-AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKL
        MA GLLPWS       I LRS S PA SFSLSSSK TQLSISAALET AADDV QLSARERRRLRN RREIKTTTNWREEVEERLCKKPKKEFASWTEKL
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALET-AADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKL

Query:  NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV
        NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARN+PDLDFKIYYP+VQEKRRLKNGTY VKPR VFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV
Subjt:  NLDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKV

Query:  GNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNR
        GNTKRQINKPKPV EADMEAIFKEAKEEQERHDQ FLEKEQE+APNS++ +TDLDTNGTTATKPKGR KKAVNALSPGSTVRVASGTFAEFEG LKKLNR
Subjt:  GNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNR

Query:  KNGKVTVGFTLFGKETLVDLNVGDIVVMT
        K+GKVTVGFTLFGKETLVDL++GDIVV T
Subjt:  KNGKVTVGFTLFGKETLVDLNVGDIVVMT

A0A6J1EV13 uncharacterized protein LOC1114382235.0e-14985.41Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN
        MA GLL W+     LP  LRSPS P+ SFSLSSS RTQLSISAALETAADDV QLSARERRRLRN RRE K TTNWREEVEERLCKKPKKEFA+WTEKLN
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN

Query:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG
        LDYLAKLGPQWWVMRV+RVRGQEIVERLARSLARN+PDLDFKIYYP+V EKR+LKNG+YTVKP+ VFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVG
Subjt:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG

Query:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK
        NTKRQINKPKPV + DMEAIFKEAKEEQERHDQAFLEKE+E+APN S+LETDLDTNGTTATK KGR KKAVN LSPGSTVRV+SGTFAEFEG LKK+NRK
Subjt:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK

Query:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK
        + KVTVGFTLFGKETLV+L++GDI+V TK
Subjt:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK

A0A6J1HQX6 uncharacterized protein LOC1114658891.6e-14784.19Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN
        MA  LL W+       + LRSPS P+ SFSLSSS RTQLSISAALETAADDV QLSARERRRLRN RRE K TTNWREEVEERLCKKPKKEFA+WTEKLN
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN

Query:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG
        LDYL+KLGPQWWVMRV+RVRGQEIVERLARSLARN+PDLDFKIYYP+V EKR+LKNG+YTVKP+ VFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVG
Subjt:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG

Query:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK
        NTKRQINKPKPV + DMEAIFKEAKEEQERHDQAFLEK++E+APN S+LET LDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEG LKK+NRK
Subjt:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRK

Query:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK
        +GKVTVGFTLFGKETLV L++GDI+V TK
Subjt:  NGKVTVGFTLFGKETLVDLNVGDIVVMTK

SwissProt top hitse value%identityAlignment
P29397 Transcription termination/antitermination protein NusG4.9e-0828.31Show/hide
Query:  YTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGT
        Y  K R +FPG VF+  IMN E ++F+R    V GFV +          +P PV + +M  I + A  E           E EE                
Subjt:  YTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGT

Query:  TATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTVGFTLFGKETLVDLNVGDI
             K +  K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L+V ++
Subjt:  TATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTVGFTLFGKETLVDLNVGDI

P65591 Transcription termination/antitermination protein NusG1.7e-0821.56Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  R  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV
        N+P P+ + + E I ++ +                                T   KPK + +  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV

Query:  GFTLFGKETLVDLNVGDI
           +FG+ET V+L    +
Subjt:  GFTLFGKETLVDLNVGDI

P65592 Transcription termination/antitermination protein NusG1.7e-0821.56Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  R  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV
        N+P P+ + + E I ++ +                                T   KPK + +  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV

Query:  GFTLFGKETLVDLNVGDI
           +FG+ET V+L    +
Subjt:  GFTLFGKETLVDLNVGDI

Q06795 Transcription termination/antitermination protein NusG2.2e-0824.19Show/hide
Query:  WWVMRVARVRGQEIVERLARSL-ARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP
        W+V+        ++   L + + +    D  F++  P  +E+  +KNG   V  + VFPG V +  +M  +    +R   GV GFVG+         +KP
Subjt:  WWVMRVARVRGQEIVERLARSL-ARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP

Query:  KPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTVGFT
         P+   + E I K    ++ + D  F  KE                                       TV+V  G FA F G +++++    KV V   
Subjt:  KPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTVGFT

Query:  LFGKETLVDLNVGDI
        +FG+ET V+L    I
Subjt:  LFGKETLVDLNVGDI

Q9HWC4 Transcription termination/antitermination protein NusG5.8e-0920.64Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V+       + ++  L   +     + +F       +E   ++NG      R  FPG V ++  MN+     +++   V GF+G          
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV
        +KP P+ + + +AI +   +                                +  KPK +T        PG TVRV  G FA+F G ++++N +  ++ V
Subjt:  NKPKPVCEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTV

Query:  GFTLFGKETLVDLNVGDI
           +FG+ T V+L    +
Subjt:  GFTLFGKETLVDLNVGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 132.1e-9153.91Show/hide
Query:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN
        +  GLL WS          RS  +P+     +   +TQ SI+A +    +  HQL+A+ERR+LRN RRE K   +WREEVEE+L KKPKK +A+WTE+LN
Subjt:  MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLN

Query:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG
        LD LA+ GPQWW +RV+R+RG E  + LAR+LAR FP+++F +Y P+VQ KR+LKNG+ +VKP+PVFPG +FIRCI+NKEIHD IR+ DGVGGF+G+KVG
Subjt:  LDYLAKLGPQWWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVG

Query:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------PNSSMLETDLDT-------NGTTATKPKGRTKKAVNALSPGSTV
        NTKRQINKP+PV ++D+EAIFK+AKE QE+ D  F E  + +EEA            NS ++ET  ++         T AT+ K + KK    L+ GSTV
Subjt:  NTKRQINKPKPVCEADMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------PNSSMLETDLDT-------NGTTATKPKGRTKKAVNALSPGSTV

Query:  RVASGTFAEFEGCLKKLNRKNGKVTVGFTLFGKETLVDLNVGDIV
        RV SGTFAEF G LKKLNRK  K TVGFTLFGKETLV++++ ++V
Subjt:  RVASGTFAEFEGCLKKLNRKNGKVTVGFTLFGKETLVDLNVGDIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGGGCTTCTGCCATGGAGTCCATGCCACGCCCACCTTCCAATTTATCTACGCTCTCCCTCCCTCCCTGCGCGTTCCTTCTCTCTCTCTTCCTCCAAACGCAC
CCAACTATCAATCTCCGCCGCCCTCGAAACCGCCGCCGACGATGTCCACCAGCTCTCGGCCCGGGAGAGAAGGAGGCTGAGAAACGTGAGGAGAGAGATTAAAACCACAA
CCAATTGGAGAGAAGAAGTGGAAGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCTTCTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGTCCTCAA
TGGTGGGTTATGCGGGTCGCTCGCGTCAGAGGTCAGGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTTCCCCGACCTCGATTTCAAGATATATTACCCGGC
TGTCCAGGAGAAGAGGAGATTAAAGAACGGTACTTACACGGTTAAACCAAGACCTGTTTTCCCAGGATCTGTATTTATAAGGTGTATCATGAACAAAGAGATACATGACT
TTATCAGAGAGTGTGATGGAGTTGGAGGCTTTGTTGGTGCGAAGGTTGGAAACACGAAACGACAAATAAACAAACCAAAGCCAGTTTGTGAAGCTGACATGGAAGCAATC
TTCAAAGAGGCAAAGGAAGAACAAGAAAGACACGACCAGGCCTTTCTAGAGAAAGAGCAAGAGGAAGCTCCGAATTCTAGCATGCTCGAAACTGACTTAGATACAAATGG
TACTACCGCTACAAAGCCCAAAGGAAGAACGAAGAAGGCTGTTAATGCTTTGTCTCCAGGCTCAACCGTTCGGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTGTC
TTAAGAAGCTGAACCGTAAAAATGGAAAGGTAACTGTGGGATTCACACTATTTGGAAAGGAAACCCTTGTAGACCTTAACGTTGGTGATATTGTAGTGATGACGAAGGAT
CCCACTGGAGCCGTAGTGGAAGTTTGTGATGGAAATTATTTGGTATCTTTCACACCAGTTGGGTTAAGGAGAGCCTCTACTCTAAATGGGAAGAAGCTCATGGAGTTGGA
GGATAAGCAAACAGCCCAAGCTGTCAGCTCACAGGCACATCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGGGCTTCTGCCATGGAGTCCATGCCACGCCCACCTTCCAATTTATCTACGCTCTCCCTCCCTCCCTGCGCGTTCCTTCTCTCTCTCTTCCTCCAAACGCAC
CCAACTATCAATCTCCGCCGCCCTCGAAACCGCCGCCGACGATGTCCACCAGCTCTCGGCCCGGGAGAGAAGGAGGCTGAGAAACGTGAGGAGAGAGATTAAAACCACAA
CCAATTGGAGAGAAGAAGTGGAAGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCTTCTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGTCCTCAA
TGGTGGGTTATGCGGGTCGCTCGCGTCAGAGGTCAGGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTTCCCCGACCTCGATTTCAAGATATATTACCCGGC
TGTCCAGGAGAAGAGGAGATTAAAGAACGGTACTTACACGGTTAAACCAAGACCTGTTTTCCCAGGATCTGTATTTATAAGGTGTATCATGAACAAAGAGATACATGACT
TTATCAGAGAGTGTGATGGAGTTGGAGGCTTTGTTGGTGCGAAGGTTGGAAACACGAAACGACAAATAAACAAACCAAAGCCAGTTTGTGAAGCTGACATGGAAGCAATC
TTCAAAGAGGCAAAGGAAGAACAAGAAAGACACGACCAGGCCTTTCTAGAGAAAGAGCAAGAGGAAGCTCCGAATTCTAGCATGCTCGAAACTGACTTAGATACAAATGG
TACTACCGCTACAAAGCCCAAAGGAAGAACGAAGAAGGCTGTTAATGCTTTGTCTCCAGGCTCAACCGTTCGGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTGTC
TTAAGAAGCTGAACCGTAAAAATGGAAAGGTAACTGTGGGATTCACACTATTTGGAAAGGAAACCCTTGTAGACCTTAACGTTGGTGATATTGTAGTGATGACGAAGGAT
CCCACTGGAGCCGTAGTGGAAGTTTGTGATGGAAATTATTTGGTATCTTTCACACCAGTTGGGTTAAGGAGAGCCTCTACTCTAAATGGGAAGAAGCTCATGGAGTTGGA
GGATAAGCAAACAGCCCAAGCTGTCAGCTCACAGGCACATCCATGA
Protein sequenceShow/hide protein sequence
MASGLLPWSPCHAHLPIYLRSPSLPARSFSLSSSKRTQLSISAALETAADDVHQLSARERRRLRNVRREIKTTTNWREEVEERLCKKPKKEFASWTEKLNLDYLAKLGPQ
WWVMRVARVRGQEIVERLARSLARNFPDLDFKIYYPAVQEKRRLKNGTYTVKPRPVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVCEADMEAI
FKEAKEEQERHDQAFLEKEQEEAPNSSMLETDLDTNGTTATKPKGRTKKAVNALSPGSTVRVASGTFAEFEGCLKKLNRKNGKVTVGFTLFGKETLVDLNVGDIVVMTKD
PTGAVVEVCDGNYLVSFTPVGLRRASTLNGKKLMELEDKQTAQAVSSQAHP