; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012404 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012404
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription termination/antitermination protein NusG
Genome locationchr1:40835253..40837208
RNA-Seq ExpressionLag0012404
SyntenyLag0012404
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]1.2e-16190.8Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
        MACGLL+WS +SL S SFP+LSFSLSSS+ TQLS+SA++ET   AADD QQLS RERR+LRNERRE+KTTTNWREEVEERLCRKPKKEFA WTEKLNLDY
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKR+LKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+S L+TDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVETK
        VTVGFTLFGKETLVDLDIGDI+VETK
Subjt:  VTVGFTLFGKETLVDLDIGDIVVETK

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]9.7e-16493.25Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
        MA GLLIWSPISL S S P+LSFSLSSS+ TQLSISA++ET   AADDVQQLSAR+RR+LRNERRE+KTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKR+LKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSS L+TDLDTNGTTATKHKGR KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVETK
        VTVGFTLFGKETLVDLDIGDI+VETK
Subjt:  VTVGFTLFGKETLVDLDIGDIVVETK

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]9.1e-16293.81Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET-AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLA
        MA GLL WS ISLRS SFP+LSFSLSSSKCTQLSISAALET AADDVQQLSARERRRLRNERRE+KTTTNWREEVEERLC+KPKKEFA+WTEKLNLDYLA
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET-AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLA

Query:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
Subjt:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT
        INKPKPVSEADMEAIFKEAKEEQERHDQ FLEKEQE+APNS++ +TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT
Subjt:  INKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDIVVET
        VGFTLFGKETLVDLDIGDIVVET
Subjt:  VGFTLFGKETLVDLDIGDIVVET

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]5.0e-16091.64Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK
        MACGLL W+ + LRSPSFPSLSFSLSSS  TQLSISAALETAADDV QLSARERRRLRNERRE K TTNWREEVEERLC+KPKKEFA WTEKLNLDYLAK
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEKE+E+APN SVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRV+SGTFAEFEGSLKK+NRKS KVTV
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVETK
        GFTLFGKETLV+LDIGDI+VETK
Subjt:  GFTLFGKETLVDLDIGDIVVETK

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]1.4e-16593.85Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET--AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYL
        MACGLLIWSPISLRS SFP+LSFSLSS K TQLSISA +ET  AADD+QQLSARERR+LRNERRE+KTTTNWREEVEERLC+KPKKEFATWTEKLNLDYL
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET--AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR
        +KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTYTVKPKA+FPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR

Query:  QINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKV
        QINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQ+ APNSS LETDLDTNGTTATK KGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKV
Subjt:  QINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKV

Query:  TVGFTLFGKETLVDLDIGDIVVETK
        TVGFTLFGKETLVDLDIGDI+VETK
Subjt:  TVGFTLFGKETLVDLDIGDIVVETK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein5.7e-16290.8Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
        MACGLL+WS +SL S SFP+LSFSLSSS+ TQLS+SA++ET   AADD QQLS RERR+LRNERRE+KTTTNWREEVEERLCRKPKKEFA WTEKLNLDY
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKR+LKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAK+EQERHDQAFLEKEQEEAPN+S L+TDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVETK
        VTVGFTLFGKETLVDLDIGDI+VETK
Subjt:  VTVGFTLFGKETLVDLDIGDIVVETK

A0A1S3BKU2 transcription termination/antitermination protein NusG4.7e-16493.25Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
        MA GLLIWSPISL S S P+LSFSLSSS+ TQLSISA++ET   AADDVQQLSAR+RR+LRNERRE+KTTTNWREEVEERLCRKPKKEFATWTEKLNLDY
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET---AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKR+LKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSS L+TDLDTNGTTATKHKGR KKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVETK
        VTVGFTLFGKETLVDLDIGDI+VETK
Subjt:  VTVGFTLFGKETLVDLDIGDIVVETK

A0A6J1DDH4 uncharacterized protein LOC1110194924.4e-16293.81Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET-AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLA
        MA GLL WS ISLRS SFP+LSFSLSSSKCTQLSISAALET AADDVQQLSARERRRLRNERRE+KTTTNWREEVEERLC+KPKKEFA+WTEKLNLDYLA
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALET-AADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLA

Query:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
Subjt:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT
        INKPKPVSEADMEAIFKEAKEEQERHDQ FLEKEQE+APNS++ +TDLDTNGTTATK KGR KKAVN LSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT
Subjt:  INKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDIVVET
        VGFTLFGKETLVDLDIGDIVVET
Subjt:  VGFTLFGKETLVDLDIGDIVVET

A0A6J1EV13 uncharacterized protein LOC1114382232.4e-16091.64Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK
        MACGLL W+ + LRSPSFPSLSFSLSSS  TQLSISAALETAADDV QLSARERRRLRNERRE K TTNWREEVEERLC+KPKKEFA WTEKLNLDYLAK
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEKE+E+APN SVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRV+SGTFAEFEGSLKK+NRKS KVTV
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVETK
        GFTLFGKETLV+LDIGDI+VETK
Subjt:  GFTLFGKETLVDLDIGDIVVETK

A0A6J1HQX6 uncharacterized protein LOC1114658893.5e-15991.33Show/hide
Query:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK
        MAC LL W+ +SLRSPSFPSLSFSLSSS  TQLSISAALETAADDV QLSARERRRLRNERRE K TTNWREEVEERLC+KPKKEFA WTEKLNLDYL+K
Subjt:  MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEK++E+APN SVLET LDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKK+NRKSGKVTV
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVETK
        GFTLFGKETLV LDIGDI+VETK
Subjt:  GFTLFGKETLVDLDIGDIVVETK

SwissProt top hitse value%identityAlignment
P35872 Transcription termination/antitermination protein NusG1.1e-0827.46Show/hide
Query:  DLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMN-----KEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKEAKEEQERHD
        D  F++  P+ +     + G   V  K +FPG +FI+  +       E  + +R   G+ GFVGA +        +P P+S  ++  I + +        
Subjt:  DLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMN-----KEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKEAKEEQERHD

Query:  QAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIV
           L K  +EAP + V                            G  VRV SG FA+F G++ ++N + GKV V  T+FG+ET V+LD   +V
Subjt:  QAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIV

P65591 Transcription termination/antitermination protein NusG1.9e-0822.02Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ET V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

P65592 Transcription termination/antitermination protein NusG1.9e-0822.02Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G +        
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G FA+F G ++++N +  K+ V
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ET V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

Q06795 Transcription termination/antitermination protein NusG3.7e-0927.27Show/hide
Query:  DLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKEAKEEQERHDQAFLE
        D  F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+         +KP P+   + E I K    ++ + D  F  
Subjt:  DLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKEAKEEQERHDQAFLE

Query:  KEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        KE                                       TV+V  G FA F GS+++++    KV V   +FG+ET V+L+   I
Subjt:  KEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Q9HWC4 Transcription termination/antitermination protein NusG4.1e-0819.72Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        +  +W+V+       + ++  L   +     + +F       +E   ++NG      +  FPG V ++  MN+     +++   V GF+G          
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV
        +KP P+++ + +AI +   +  +                                  K +PK       PG TVRV  G FA+F G ++++N +  ++ V
Subjt:  NKPKPVSEADMEAIFKEAKEEQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDI
           +FG+ T V+L+   +
Subjt:  GFTLFGKETLVDLDIGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 139.6e-9356.51Show/hide
Query:  GLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGP
        GLL WS    RS   PS+   ++    TQ SI+A +    +   QL+A+ERR+LRNERRE K   +WREEVEE+L +KPKK +ATWTE+LNLD LA+ GP
Subjt:  GLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGP

Query:  QWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP
        QWW +RV+R+RG E  + LAR+LAR +P+++F +Y PSVQ KR+LKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+ DGVGGF+G+KVGNTKRQINKP
Subjt:  QWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP

Query:  KPVSEADMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------PNSSVLETDLDT-------NGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAE
        +PV ++D+EAIFK+AKE QE+ D  F E  + +EEA            NS V+ET  ++         T AT+ K + KK    L+ GSTVRV SGTFAE
Subjt:  KPVSEADMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------PNSSVLETDLDT-------NGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAE

Query:  FEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVE
        F G+LKKLNRK+ K TVGFTLFGKETLV++DI ++V E
Subjt:  FEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTGGGCTTCTGATTTGGAGTCCAATTTCTCTTCGCTCTCCCTCTTTCCCTTCCCTTTCCTTCTCCCTCTCATCTTCCAAATGCACCCAGTTATCAATCTCCGC
CGCCCTCGAAACCGCCGCCGACGATGTCCAGCAGCTTTCGGCTCGGGAGAGGAGGAGGCTGAGGAACGAGAGGAGAGAGGTCAAAACCACTACCAATTGGAGAGAAGAAG
TGGAAGAGAGGCTCTGCAGGAAGCCCAAGAAGGAATTTGCTACTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGCCCTCAATGGTGGGTTATGCGTGTC
GCTCGTGTTAGGGGGCAGGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTCGATTTCAAGATATATTACCCATCGGTCCAGGAGAAGAGGAG
ATTAAAGAATGGTACTTACACGGTTAAACCGAAAGCTGTATTTCCTGGATCTGTATTTATAAGGTGTATCATGAACAAGGAGATACATGACTTCATTAGAGAGTGTGATG
GAGTTGGAGGCTTCGTTGGTGCGAAGGTCGGAAACACGAAACGACAGATAAATAAACCAAAGCCAGTGTCTGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAA
GAGCAAGAAAGACATGACCAGGCATTTCTAGAGAAAGAGCAAGAGGAAGCTCCAAACTCTAGCGTGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTACAAAGCA
CAAAGGTAGACCGAAAAAAGCTGTTAATACTTTATCTCCAGGGTCAACGGTTCGGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTA
AAAGTGGAAAGGTAACTGTGGGATTCACACTATTTGGGAAGGAAACCCTTGTAGACCTTGACATTGGCGATATTGTAGTGGAGACGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTGTGGGCTTCTGATTTGGAGTCCAATTTCTCTTCGCTCTCCCTCTTTCCCTTCCCTTTCCTTCTCCCTCTCATCTTCCAAATGCACCCAGTTATCAATCTCCGC
CGCCCTCGAAACCGCCGCCGACGATGTCCAGCAGCTTTCGGCTCGGGAGAGGAGGAGGCTGAGGAACGAGAGGAGAGAGGTCAAAACCACTACCAATTGGAGAGAAGAAG
TGGAAGAGAGGCTCTGCAGGAAGCCCAAGAAGGAATTTGCTACTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGCCCTCAATGGTGGGTTATGCGTGTC
GCTCGTGTTAGGGGGCAGGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTCGATTTCAAGATATATTACCCATCGGTCCAGGAGAAGAGGAG
ATTAAAGAATGGTACTTACACGGTTAAACCGAAAGCTGTATTTCCTGGATCTGTATTTATAAGGTGTATCATGAACAAGGAGATACATGACTTCATTAGAGAGTGTGATG
GAGTTGGAGGCTTCGTTGGTGCGAAGGTCGGAAACACGAAACGACAGATAAATAAACCAAAGCCAGTGTCTGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAA
GAGCAAGAAAGACATGACCAGGCATTTCTAGAGAAAGAGCAAGAGGAAGCTCCAAACTCTAGCGTGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTACAAAGCA
CAAAGGTAGACCGAAAAAAGCTGTTAATACTTTATCTCCAGGGTCAACGGTTCGGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTA
AAAGTGGAAAGGTAACTGTGGGATTCACACTATTTGGGAAGGAAACCCTTGTAGACCTTGACATTGGCGATATTGTAGTGGAGACGAAGTGA
Protein sequenceShow/hide protein sequence
MACGLLIWSPISLRSPSFPSLSFSLSSSKCTQLSISAALETAADDVQQLSARERRRLRNERREVKTTTNWREEVEERLCRKPKKEFATWTEKLNLDYLAKLGPQWWVMRV
ARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSEADMEAIFKEAKE
EQERHDQAFLEKEQEEAPNSSVLETDLDTNGTTATKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVETK