; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G15550 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G15550
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptiontranscription termination/antitermination protein NusG
Genome locationChr4:12905440..12908408
RNA-Seq ExpressionCSPI04G15550
SyntenyCSPI04G15550
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]5.0e-17699.69Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRSQEIVERLAR LARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]1.3e-16895.4Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MA GLL+WS +SLCSTS PALSFSLSSSRRTQLS+SASVETPAAAADD QQLS R+RRKLRNERREIKTTTNWREEVEERLCRKPKKEFA WTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTV PKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+SALKTDLDTNGTTATKHKGR KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]6.4e-15589.23Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MA GLL WSS+SL S+SFPALSFSLSSS+ TQLS+SA++ET  AAADD QQLS RERR+LRNERREIKTTTNWREEVEERLC+KPKKEFA+WTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLARSLARNYPDLDFKIYYPSV+EKR+LKNGTY V P+AVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQ FLEKEQE+APN++  KTDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVET
        VTVGFTLFGKETLVDLDIGDI+VET
Subjt:  VTVGFTLFGKETLVDLDIGDIIVET

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]6.0e-15387.42Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MACGLL W+++ L S SFP+LSFSLSSS RTQLS+SA++ET   AADD  QLS RERR+LRNERRE K TTNWREEVEERLC+KPKKEFANWTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRV+RVR QEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTV PKAVFPGSVFIRC+MNKE+HDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVS+ DMEAIFKEAK+EQERHDQAFLEKE+E+APN S L+TDLDTNGTTATKHKGRPKKAVNTLSPGSTVRV+SGTFAEFEGSLKK+NRKS K
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]2.4e-16291.72Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MACGLL+WS +SL S SFPALSFSLSS +RTQLS+SA+VETP +AADD QQLS RERRKLRNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        L+KLGPQWWVMRVARVR QEIVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKA+FPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQAFLEKEQ+ APN+SAL+TDLDTNGTTATK KGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein2.4e-17699.69Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRSQEIVERLAR LARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

A0A1S3BKU2 transcription termination/antitermination protein NusG6.4e-16995.4Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MA GLL+WS +SLCSTS PALSFSLSSSRRTQLS+SASVETPAAAADD QQLS R+RRKLRNERREIKTTTNWREEVEERLCRKPKKEFA WTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTV PKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+SALKTDLDTNGTTATKHKGR KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1DDH4 uncharacterized protein LOC1110194923.1e-15589.23Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MA GLL WSS+SL S+SFPALSFSLSSS+ TQLS+SA++ET  AAADD QQLS RERR+LRNERREIKTTTNWREEVEERLC+KPKKEFA+WTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLARSLARNYPDLDFKIYYPSV+EKR+LKNGTY V P+AVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQ FLEKEQE+APN++  KTDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVET
        VTVGFTLFGKETLVDLDIGDI+VET
Subjt:  VTVGFTLFGKETLVDLDIGDIIVET

A0A6J1EV13 uncharacterized protein LOC1114382232.9e-15387.42Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MACGLL W+++ L S SFP+LSFSLSSS RTQLS+SA++ET   AADD  QLS RERR+LRNERRE K TTNWREEVEERLC+KPKKEFANWTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRV+RVR QEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTV PKAVFPGSVFIRC+MNKE+HDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVS+ DMEAIFKEAK+EQERHDQAFLEKE+E+APN S L+TDLDTNGTTATKHKGRPKKAVNTLSPGSTVRV+SGTFAEFEGSLKK+NRKS K
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1HQX6 uncharacterized protein LOC1114658894.2e-15287.12Show/hide
Query:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY
        MAC LL W+++SL S SFP+LSFSLSSS RTQLS+SA++ET   AADD  QLS RERR+LRNERRE K TTNWREEVEERLC+KPKKEFANWTEKLNLDY
Subjt:  MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK
        L+KLGPQWWVMRV+RVR QEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTV PKAVFPGSVFIRC+MNKE+HDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVS+ DMEAIFKEAK+EQERHDQAFLEK++E+APN S L+T LDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKK+NRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIIVETK
        VTVGFTLFGKETLV LDIGDIIVETK
Subjt:  VTVGFTLFGKETLVDLDIGDIIVETK

SwissProt top hitse value%identityAlignment
P35872 Transcription termination/antitermination protein NusG8.5e-0924.32Show/hide
Query:  QWWVMRVARVRSQEIVERLARSL-ARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMN-----KEIHDFIRECDGVGGFVGAKVGNTK
        +W+ +     + ++    L + + A    D  F++  P+ +     + G   V  K +FPG +FI+  +       E  + +R   G+ GFVGA +    
Subjt:  QWWVMRVARVRSQEIVERLARSL-ARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMN-----KEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
            +P P+S  ++  I                                L+ +G    K      KA      G  VRV SG FA+F G++ ++N + GK
Subjt:  RQINKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDII
        V V  T+FG+ET V+LD   ++
Subjt:  VTVGFTLFGKETLVDLDIGDII

P65591 Transcription termination/antitermination protein NusG6.5e-0922.02Show/hide
Query:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T++ +  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ET V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

P65592 Transcription termination/antitermination protein NusG6.5e-0922.02Show/hide
Query:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T++ +  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ET V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

Q06795 Transcription termination/antitermination protein NusG1.0e-0925.58Show/hide
Query:  WWVMRVARVRSQEIVERLARSL-ARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP
        W+V+        ++   L + + +    D  F++  P  +E+  +KNG   V  K VFPG V +  VM  +    +R   GV GFVG+         +KP
Subjt:  WWVMRVARVRSQEIVERLARSL-ARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP

Query:  KPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFT
         P+   + E I K    ++ + D  F  KE                                       TV+V  G FA F GS+++++    KV V   
Subjt:  KPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFT

Query:  LFGKETLVDLDIGDI
        +FG+ET V+L+   I
Subjt:  LFGKETLVDLDIGDI

Q9HWC4 Transcription termination/antitermination protein NusG5.0e-0920.18Show/hide
Query:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V+       + ++  L   +     + +F       +E  +++NG    + +  FPG V ++  MN+     +++   V GF+G          
Subjt:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        +KP P+++ + +AI +   D  +                                  K +PK       PG TVRV  G FA+F G ++++N +  ++ V
Subjt:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ T V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 131.9e-8852.49Show/hide
Query:  GLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAK
        GLL WS  SL  + +  ++       +TQ S++A V       +   QL+ +ERR+LRNERRE K   +WREEVEE+L +KPKK +A WTE+LNLD LA+
Subjt:  GLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
         GPQWW +RV+R+R  E  + LAR+LAR +P+++F +Y PSV+ KRKLKNG+ +V PK VFPG +FIRC++NKEIHD IR+ DGVGGF+G+KVGNTKRQI
Subjt:  LGPQWWVMRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLE--KEQEEA-----------PNTSALKTDLDT-------NGTTATKHKGRPKKAVNTLSPGSTVRVASGT
        NKP+PV ++D+EAIFK+AK+ QE+ D  F E  + +EEA            N+  ++T  ++         T AT+ K + KK    L+ GSTVRV SGT
Subjt:  NKPKPVSEADMEAIFKEAKDEQERHDQAFLE--KEQEEA-----------PNTSALKTDLDT-------NGTTATKHKGRPKKAVNTLSPGSTVRVASGT

Query:  FAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE
        FAEF G+LKKLNRK+ K TVGFTLFGKETLV++DI +++ E
Subjt:  FAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTGGGCTTCTGGTTTGGAGCTCAGTTTCCCTTTGCTCTACCTCTTTCCCTGCACTTTCCTTCTCTCTCTCTTCTTCCAGACGTACCCAATTATCCGTCTCCGC
CTCCGTCGAAACCCCCGCCGCCGCCGCTGACGATGCTCAGCAGCTGTCGGTGAGGGAGCGGAGGAAGCTGAGGAACGAGAGAAGAGAGATCAAAACCACCACCAATTGGA
GGGAAGAAGTAGAGGAGAGGCTCTGCAGGAAGCCCAAGAAGGAATTTGCCAATTGGACTGAGAAGCTCAATCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTT
ATGCGTGTTGCTCGTGTTCGAAGCCAAGAAATTGTCGAACGCCTTGCTCGTTCTCTTGCTAGGAACTACCCTGACCTCGATTTTAAGATATATTACCCGTCGGTTAAGGA
GAAGAGGAAATTAAAGAATGGTACTTACACCGTTACACCAAAAGCTGTTTTTCCTGGATCTGTATTTATAAGGTGTGTCATGAACAAGGAGATTCATGACTTCATTAGAG
AGTGTGATGGAGTTGGAGGCTTTGTTGGTGCCAAGGTCGGAAACACTAAACGACAGATAAACAAACCAAAGCCAGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAG
GCAAAGGATGAGCAAGAAAGACACGACCAGGCTTTCCTAGAGAAAGAGCAAGAGGAAGCTCCAAATACTAGCGCACTCAAAACTGACTTAGATACAAATGGTACTACTGC
TACGAAGCACAAAGGAAGACCCAAAAAAGCTGTTAATACCTTGTCGCCAGGGTCAACAGTTCGAGTGGCATCAGGTACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGC
TGAACCGTAAAAGTGGAAAGGTAACTGTGGGATTTACACTATTTGGGAAGGAAACCCTTGTAGATCTTGACATTGGTGATATTATAGTAGAGACAAAATGA
mRNA sequenceShow/hide mRNA sequence
AACCAATCCGACAAATGGGAGTTTCAATAACGACCTCTCATTGGCGTTTATCGTCACCGGCGACTCCACACCCGACCCAATAATCGGAAACTTGTTTCCGTAAACAAAAT
GGCCTGTGGGCTTCTGGTTTGGAGCTCAGTTTCCCTTTGCTCTACCTCTTTCCCTGCACTTTCCTTCTCTCTCTCTTCTTCCAGACGTACCCAATTATCCGTCTCCGCCT
CCGTCGAAACCCCCGCCGCCGCCGCTGACGATGCTCAGCAGCTGTCGGTGAGGGAGCGGAGGAAGCTGAGGAACGAGAGAAGAGAGATCAAAACCACCACCAATTGGAGG
GAAGAAGTAGAGGAGAGGCTCTGCAGGAAGCCCAAGAAGGAATTTGCCAATTGGACTGAGAAGCTCAATCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTAT
GCGTGTTGCTCGTGTTCGAAGCCAAGAAATTGTCGAACGCCTTGCTCGTTCTCTTGCTAGGAACTACCCTGACCTCGATTTTAAGATATATTACCCGTCGGTTAAGGAGA
AGAGGAAATTAAAGAATGGTACTTACACCGTTACACCAAAAGCTGTTTTTCCTGGATCTGTATTTATAAGGTGTGTCATGAACAAGGAGATTCATGACTTCATTAGAGAG
TGTGATGGAGTTGGAGGCTTTGTTGGTGCCAAGGTCGGAAACACTAAACGACAGATAAACAAACCAAAGCCAGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGC
AAAGGATGAGCAAGAAAGACACGACCAGGCTTTCCTAGAGAAAGAGCAAGAGGAAGCTCCAAATACTAGCGCACTCAAAACTGACTTAGATACAAATGGTACTACTGCTA
CGAAGCACAAAGGAAGACCCAAAAAAGCTGTTAATACCTTGTCGCCAGGGTCAACAGTTCGAGTGGCATCAGGTACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGCTG
AACCGTAAAAGTGGAAAGGTAACTGTGGGATTTACACTATTTGGGAAGGAAACCCTTGTAGATCTTGACATTGGTGATATTATAGTAGAGACAAAATGAATGAATTTACT
TTCTCAGTCATGTAAAGTGTGGACGAAACTCATGGAGTTGGAGGATAAGCAAGCAGCTCCAGCTCTCAGCTCTCAAAGCATATTCATGAACCAAAGAACATAAAAGTACT
GTTGCTTGAATTGACAGACAGCCCACGTTGATGAACATGTCCACAAATCAGCTTTGCTTCTTTGAAATTACTCTGCATTCGGAGATTCTTACTGGAAATCAAATAGGATC
TTGAAGGAATTGGCAGCCAAATGTATTCTTGTTAAACTTTTGAGAAAATCAAACTAGGAAATATCATGATGAAATCATACCTGAGTAGCTCTTGTAAATTCAAATATCTC
TGAAGTCACAAATTCACCATAATTATAGAGGTGTTCAAGATTCGATTCGATCTACCCGTCAAACCGGACCAAACTAAAAAGTTTGATTTTGGATGTATACCAAGCCAAAC
CAACTTGTTTGTTTGCTCCAAAC
Protein sequenceShow/hide protein sequence
MACGLLVWSSVSLCSTSFPALSFSLSSSRRTQLSVSASVETPAAAADDAQQLSVRERRKLRNERREIKTTTNWREEVEERLCRKPKKEFANWTEKLNLDYLAKLGPQWWV
MRVARVRSQEIVERLARSLARNYPDLDFKIYYPSVKEKRKLKNGTYTVTPKAVFPGSVFIRCVMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKE
AKDEQERHDQAFLEKEQEEAPNTSALKTDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK