; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016621 (gene) of Snake gourd v1 genome

Gene IDTan0016621
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription termination/antitermination protein NusG
Genome locationLG03:64652665..64656045
RNA-Seq ExpressionTan0016621
SyntenyTan0016621
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]1.2e-15888.96Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL S  FPALSFSLSSS+RTQLS+SA++ET   AADD Q LS RERR+LRNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPD DFKIYYPSV+EKR+LKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSE DMEAIFKEAK+EQERHDQAFLEKEQEEA N+S LKTDLDTNGT ATKHKGRPKKAVNTLSPGSTV+VASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVKTQ
        VTVGFTLFGKETLVDLDIGDI+V+T+
Subjt:  VTVGFTLFGKETLVDLDIGDIVVKTQ

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]1.7e-16091.1Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL S   PALSFSLSSS+RTQLSISA++ET   AADDVQ LSAR+RR+LRNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPD DFKIYYPSV+EKR+LKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSE DMEAIFKEAKEEQERHDQAFLEKEQEEA NSS LKTDLDTNGT ATKHKGR KKAVNTLSPGSTV+VASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVKTQ
        VTVGFTLFGKETLVDLDIGDI+V+T+
Subjt:  VTVGFTLFGKETLVDLDIGDIVVKTQ

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]4.7e-15892.26Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET-AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLA
        MA GLL WS ISLRS  FPALSFSLSSSK TQLSISAALET AADDVQ LSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLA
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET-AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLA

Query:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        KLGPQWWVMRVARVRGQEIVERLARSLARNYPD DFKIYYPSVQEKRRLKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
Subjt:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT
        INKPKPVSE DMEAIFKEAKEEQERHDQ FLEKEQE+A NS++ KTDLDTNGT ATK KGR KKAVN LSPGSTV+VASGTFAEFEGSLKKLNRKSGKVT
Subjt:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDIVVKT
        VGFTLFGKETLVDLDIGDIVV+T
Subjt:  VGFTLFGKETLVDLDIGDIVVKT

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]4.0e-15789.47Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK
        MACGLLTW+ + LRSP FP+LSFSLSSS RTQLSISAALETAADDV  LSARERRRLRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAK
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPD DFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEKE+E+A N SVL+TDLDTNGT ATKHKGRPKKAVNTLSPGSTV+V+SGTFAEFEGSLKK+NRKS KVTV
Subjt:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVKTQ
        GFTLFGKETLV+LDIGDI+V+T+
Subjt:  GFTLFGKETLVDLDIGDIVVKTQ

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]4.1e-16291.69Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET--AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLL WSPISLRS  FPALSFSLSS KRTQLSISA +ET  AADD+Q LSARERR+LRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET--AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR
        +KLGPQWWVMRVARVRGQEIVERLARSLARNYPD DFKIYYPSVQEKR+LKNGTYTVKPKA+FPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKR

Query:  QINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKV
        QINKPKPVSE DMEAIFKEAKEEQERHDQAFLEKEQ+ A NSS L+TDLDTNGT ATK KGRPKKAVNTLSPGSTV+VASGTFAEFEGSLKKLNRKSGKV
Subjt:  QINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKV

Query:  TVGFTLFGKETLVDLDIGDIVVKTQ
        TVGFTLFGKETLVDLDIGDI+V+T+
Subjt:  TVGFTLFGKETLVDLDIGDIVVKTQ

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein5.9e-15988.96Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL S  FPALSFSLSSS+RTQLS+SA++ET   AADD Q LS RERR+LRNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPD DFKIYYPSV+EKR+LKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSE DMEAIFKEAK+EQERHDQAFLEKEQEEA N+S LKTDLDTNGT ATKHKGRPKKAVNTLSPGSTV+VASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVKTQ
        VTVGFTLFGKETLVDLDIGDI+V+T+
Subjt:  VTVGFTLFGKETLVDLDIGDIVVKTQ

A0A1S3BKU2 transcription termination/antitermination protein NusG8.3e-16191.1Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL S   PALSFSLSSS+RTQLSISA++ET   AADDVQ LSAR+RR+LRNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET---AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPD DFKIYYPSV+EKR+LKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVGAKVGNTK
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTK

Query:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK
        RQINKPKPVSE DMEAIFKEAKEEQERHDQAFLEKEQEEA NSS LKTDLDTNGT ATKHKGR KKAVNTLSPGSTV+VASGTFAEFEGSLKKLNRKSGK
Subjt:  RQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGK

Query:  VTVGFTLFGKETLVDLDIGDIVVKTQ
        VTVGFTLFGKETLVDLDIGDI+V+T+
Subjt:  VTVGFTLFGKETLVDLDIGDIVVKTQ

A0A6J1DDH4 uncharacterized protein LOC1110194922.3e-15892.26Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET-AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLA
        MA GLL WS ISLRS  FPALSFSLSSSK TQLSISAALET AADDVQ LSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYLA
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALET-AADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLA

Query:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        KLGPQWWVMRVARVRGQEIVERLARSLARNYPD DFKIYYPSVQEKRRLKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
Subjt:  KLGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT
        INKPKPVSE DMEAIFKEAKEEQERHDQ FLEKEQE+A NS++ KTDLDTNGT ATK KGR KKAVN LSPGSTV+VASGTFAEFEGSLKKLNRKSGKVT
Subjt:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDIVVKT
        VGFTLFGKETLVDLDIGDIVV+T
Subjt:  VGFTLFGKETLVDLDIGDIVVKT

A0A6J1EV13 uncharacterized protein LOC1114382231.9e-15789.47Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK
        MACGLLTW+ + LRSP FP+LSFSLSSS RTQLSISAALETAADDV  LSARERRRLRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYLAK
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPD DFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEKE+E+A N SVL+TDLDTNGT ATKHKGRPKKAVNTLSPGSTV+V+SGTFAEFEGSLKK+NRKS KVTV
Subjt:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVKTQ
        GFTLFGKETLV+LDIGDI+V+T+
Subjt:  GFTLFGKETLVDLDIGDIVVKTQ

A0A6J1HQX6 uncharacterized protein LOC1114658892.8e-15689.16Show/hide
Query:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK
        MAC LLTW+ +SLRSP FP+LSFSLSSS RTQLSISAALETAADDV  LSARERRRLRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL+K
Subjt:  MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAK

Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI
        LGPQWWVMRV+RVRGQEIVERLARSLARNYPD DFKIYYPSV EKR+LKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVGAKVGNTKRQI
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQI

Query:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV
        NKPKPVS+ DMEAIFKEAKEEQERHDQAFLEK++E+A N SVL+T LDTNGT ATKHKGRPKKAVNTLSPGSTV+VASGTFAEFEGSLKK+NRKSGKVTV
Subjt:  NKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTV

Query:  GFTLFGKETLVDLDIGDIVVKTQ
        GFTLFGKETLV LDIGDI+V+T+
Subjt:  GFTLFGKETLVDLDIGDIVVKTQ

SwissProt top hitse value%identityAlignment
P29397 Transcription termination/antitermination protein NusG5.4e-0827.71Show/hide
Query:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGT
        Y  K + +FPG VF+  IMN E ++F+R    V GFV +          +P PV + +M  I + A  E           E EE                
Subjt:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGT

Query:  IATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
             K +P K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L + ++
Subjt:  IATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

P35872 Transcription termination/antitermination protein NusG7.1e-0826.84Show/hide
Query:  FKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMN-----KEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAF
        F++  P+ +     + G   V  K +FPG +FI+  +       E  + +R   G+ GFVGA +        +P P+S             ++ RH    
Subjt:  FKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMN-----KEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAF

Query:  LEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIV
                         L+ +G +  K      KA      G  V+V SG FA+F G++ ++N + GKV V  T+FG+ET V+LD   +V
Subjt:  LEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIV

P65591 Transcription termination/antitermination protein NusG3.2e-0822.37Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLAR-NYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        +  +W+V++      + +   L   +AR    D+  +I  P V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G +       
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLAR-NYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT
         N+P P+S+ + E I ++         Q  +EK                            PK  V     G  V+V  G FA+F G ++++N +  K+ 
Subjt:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDI
        V   +FG+ET V+L+   +
Subjt:  VGFTLFGKETLVDLDIGDI

P65592 Transcription termination/antitermination protein NusG3.2e-0822.37Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLAR-NYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ
        +  +W+V++      + +   L   +AR    D+  +I  P V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G +       
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLAR-NYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQ

Query:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT
         N+P P+S+ + E I ++         Q  +EK                            PK  V     G  V+V  G FA+F G ++++N +  K+ 
Subjt:  INKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVT

Query:  VGFTLFGKETLVDLDIGDI
        V   +FG+ET V+L+   +
Subjt:  VGFTLFGKETLVDLDIGDI

Q06795 Transcription termination/antitermination protein NusG1.4e-0827.17Show/hide
Query:  FKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQ
        F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+         +KP P+   + E I K    ++ + D  F  KE 
Subjt:  FKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKEEQERHDQAFLEKEQ

Query:  EEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
                                              TV+V  G FA F GS+++++    KV V   +FG+ET V+L+   I
Subjt:  EEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 133.3e-9355.59Show/hide
Query:  GLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGP
        GLL WS    RS   P++   ++   +TQ SI+A +    +    L+A+ERR+LRNERRE K   +WREEVEE+L KKPKK +ATWTE+LNLD LA+ GP
Subjt:  GLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGP

Query:  QWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP
        QWW +RV+R+RG E  + LAR+LAR +P+ +F +Y PSVQ KR+LKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+ DGVGGF+G+KVGNTKRQINKP
Subjt:  QWWVMRVARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKP

Query:  KPVSELDMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------SNSSVLKTDLDT-------NGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAE
        +PV + D+EAIFK+AKE QE+ D  F E  + +EEA           SNS V++T  ++         T+AT+ K + KK    L+ GSTV+V SGTFAE
Subjt:  KPVSELDMEAIFKEAKEEQERHDQAFLE--KEQEEA-----------SNSSVLKTDLDT-------NGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAE

Query:  FEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVKTQ
        F G+LKKLNRK+ K TVGFTLFGKETLV++DI ++V + Q
Subjt:  FEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVKTQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTGGCCTTCTGACTTGGAGTCCAATTTCTCTTCGCTCTCCCTGTTTCCCTGCCCTTTCCTTCTCTCTCTCTTCTTCCAAACGCACCCAATTATCAATCTCCGC
CGCCCTCGAAACCGCCGCCGACGATGTCCAGCATCTGTCGGCGCGGGAGAGGAGGAGGCTGAGGAACGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAG
TGGAAGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTATGCGTGTC
GCTCGTGTTAGAGGTCAAGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACTTCGATTTCAAGATATATTACCCGTCTGTCCAGGAGAAGAGGAG
ATTAAAGAATGGTACTTACACGGTTAAACCGAAAGCTGTCTTTCCTGGATCTGTATTTATAAGGTGTATCATGAACAAGGAGATACATGACTTCATTAGAGAGTGTGATG
GAGTTGGAGGCTTTGTTGGTGCGAAGGTCGGAAACACGAAACGGCAGATAAACAAACCAAAGCCAGTGTCTGAACTTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAA
GAGCAAGAAAGACATGACCAGGCTTTTCTAGAGAAAGAGCAAGAGGAAGCTTCAAACTCTAGCGTGCTCAAGACTGACTTGGATACAAATGGTACTATTGCTACAAAGCA
CAAAGGAAGACCAAAAAAAGCTGTTAATACTTTGTCGCCAGGGTCAACCGTTCAGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGTTGAACCGTA
AGAGTGGAAAGGTAACTGTGGGATTCACACTATTTGGGAAGGAAACCCTTGTAGACCTTGACATTGGTGATATTGTAGTCAAGACGCAGTGA
mRNA sequenceShow/hide mRNA sequence
CGACAAATGGGAGTTGCAATAACGACCACTTGTCGGCGTTATCGTCACCGGTGACTCAACAGCCGAGCCAGTAATCGGAAACTGGTTTCCGTAAAGACGATGGCCTGTGG
CCTTCTGACTTGGAGTCCAATTTCTCTTCGCTCTCCCTGTTTCCCTGCCCTTTCCTTCTCTCTCTCTTCTTCCAAACGCACCCAATTATCAATCTCCGCCGCCCTCGAAA
CCGCCGCCGACGATGTCCAGCATCTGTCGGCGCGGGAGAGGAGGAGGCTGAGGAACGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTGGAAGAGAGG
CTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTCGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTATGCGTGTCGCTCGTGTTAG
AGGTCAAGAAATTGTCGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACTTCGATTTCAAGATATATTACCCGTCTGTCCAGGAGAAGAGGAGATTAAAGAATG
GTACTTACACGGTTAAACCGAAAGCTGTCTTTCCTGGATCTGTATTTATAAGGTGTATCATGAACAAGGAGATACATGACTTCATTAGAGAGTGTGATGGAGTTGGAGGC
TTTGTTGGTGCGAAGGTCGGAAACACGAAACGGCAGATAAACAAACCAAAGCCAGTGTCTGAACTTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGAAAG
ACATGACCAGGCTTTTCTAGAGAAAGAGCAAGAGGAAGCTTCAAACTCTAGCGTGCTCAAGACTGACTTGGATACAAATGGTACTATTGCTACAAAGCACAAAGGAAGAC
CAAAAAAAGCTGTTAATACTTTGTCGCCAGGGTCAACCGTTCAGGTGGCATCTGGGACTTTTGCAGAATTTGAAGGCTCTCTTAAGAAGTTGAACCGTAAGAGTGGAAAG
GTAACTGTGGGATTCACACTATTTGGGAAGGAAACCCTTGTAGACCTTGACATTGGTGATATTGTAGTCAAGACGCAGTGAAGCAAATGACTTTCTCATTGATGTAAAGG
GGAATAAGCTCATGGAGATGGAGGATAAGTAAGCAGCTCCAGCTCTCAGCTCACAGGCACATCCATGAACCAATGAATTGAAGCGCTATGTTGCTTGAATTGCCAACAGC
TCTCACTTTGATGAACATGTCTAGAGATCAGCTGTGGTTCTTGAAATTTACTCTGCATTTGGAAATTCTGAATGGAAATCAACGAGGGTCTTGAAGGAATTGGCAGAAGC
CCAAATTGTATTTTTGTTTTGACTTTTGAGAAACCAAAATGGGGAAATATTATATCCTTGGGAACTTTCAGAATCAATTTTTTTTGGAAAAGGTAATATGGAATAGTGTT
GATCTAGCATTGTTGATAGCCAAAACCATCATGGCACGACAAGGTATAAATTCAGATGTCAAAGTATTCTATAAATTTCTAAAACTCTAGTGGTTGTTTGGGCCGTTGAG
TGGATTGTAATAATAGGGTTTATAATAATATGTGGGTTATTATAATCTGTGGAATGATACAATATTATTTAAAATACGGAG
Protein sequenceShow/hide protein sequence
MACGLLTWSPISLRSPCFPALSFSLSSSKRTQLSISAALETAADDVQHLSARERRRLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVMRV
ARVRGQEIVERLARSLARNYPDFDFKIYYPSVQEKRRLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGAKVGNTKRQINKPKPVSELDMEAIFKEAKE
EQERHDQAFLEKEQEEASNSSVLKTDLDTNGTIATKHKGRPKKAVNTLSPGSTVQVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIVVKTQ