; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G07680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G07680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptiontranscription termination/antitermination protein NusG
Genome locationClcChr04:21458223..21461218
RNA-Seq ExpressionClc04G07680
SyntenyClc04G07680
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]1.8e-15283.77Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKLRNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWV+RVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAK+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]2.6e-15485.8Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKLRNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWV+RVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]3.0e-14783.67Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LRNERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWV+RVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]5.7e-14681.98Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWV+RV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]1.7e-15887.21Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLL WSPISLRS SF ALSF LSS KRTQLSISA VE PSAAD++QQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        +KLGPQWWV+RVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKA+FPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQ++A NS ALETDLDTNGTTA K KGRPKKAVNTLSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein8.9e-15383.77Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL STSF ALSF LSSS+RTQLS+SA+VE P +AAD+ QQLS RERRKLRNERREIKTTTNWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWV+RVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAK+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A1S3BKU2 transcription termination/antitermination protein NusG1.2e-15485.8Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL STS  ALSF LSSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKLRNERREIKTTTNWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWV+RVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1DDH4 uncharacterized protein LOC1110194921.5e-14783.67Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MA GLL WS ISLRS+SF ALSF LSSSK TQLSISAA+E  +AAD+VQQLSARERR+LRNERREIKTTTNWREEVEERLCKKPKKEFA+WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWV+RVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET

A0A6J1EV13 uncharacterized protein LOC1114382232.8e-14681.98Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLLTW+ + LRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWV+RV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1HQX6 uncharacterized protein LOC1114658894.0e-14581.69Show/hide
Query:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MAC LLTW+ +SLRS SF +LSF LSSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE K TTNWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        +KLGPQWWV+RV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEK++EQA N   LET LDTNGTTA KHKGRPKKAVNTLSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKSGKVTVGFTLFGKETLV LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

SwissProt top hitse value%identityAlignment
P29397 Transcription termination/antitermination protein NusG8.3e-0723.24Show/hide
Query:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKE
        Y  K + +FPG VF+  IMN E ++F+R    V GFV +                             +P PV + +M  I + A               
Subjt:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKE

Query:  QEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
                         G    + K +P K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L + ++
Subjt:  QEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

P65591 Transcription termination/antitermination protein NusG3.2e-0620.25Show/hide
Query:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G          
Subjt:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                         + N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  K+ V   +FG+ET V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

P65592 Transcription termination/antitermination protein NusG3.2e-0620.25Show/hide
Query:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G          
Subjt:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                         + N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  K+ V   +FG+ET V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Q06795 Transcription termination/antitermination protein NusG1.7e-0724.76Show/hide
Query:  DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADME
        D  F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+  +                         +KP P+   + E
Subjt:  DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADME

Query:  AIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVD
         I K    ++ + D  F  KE                                       TV+V  G FA F GS+++++    KV V   +FG+ET V+
Subjt:  AIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVD

Query:  LDIGDI
        L+   I
Subjt:  LDIGDI

Q9HWC4 Transcription termination/antitermination protein NusG1.4e-0618.99Show/hide
Query:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V+       + ++  L   +     + +F       +E  +++NG      +  FPG V ++  MN+     +++   V GF+G          
Subjt:  LGPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                           +KP P+++ + +AI +                   + ++SG                K +PK       PG TVRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  ++ V   +FG+ T V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 139.2e-8651.25Show/hide
Query:  GLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKL
        GLL WS    RS+   ++  P++   +TQ SI+A V      +   QL+A+ERR+LRNERRE K   +WREEVEE+L KKPKK +ATWTE+LNLD LA+ 
Subjt:  GLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKL

Query:  GPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDY
        GPQWW +RV+R+RG E  + LAR+LAR +P+++F +Y PSVQ KRKLKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+ DGVGGF+G++         
Subjt:  GPQWWVLRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDY

Query:  AIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK
                     +KRQINKP+PV ++D+EAIFK+AKE Q + D  F E  + +E+A           SNS  +ET  ++         T A + K + K
Subjt:  AIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK

Query:  KAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE
        K    L+ GSTVRV SGTFAEF G+LKKLNRK+ K TVGFTLFGKETLV++DI +++ E
Subjt:  KAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGC
TGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGG
AAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTG
CGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAA
GAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGT
GTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAG
ATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCA
AGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAA
CTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACC
CTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGA
mRNA sequenceShow/hide mRNA sequence
CTTCATCAACCAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCT
TCTGACTTGGAGCCCAATTTCTCTTCGCTCAACTTCTTTCCGTGCCCTTTCCTTCCCTCTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCTGCCGTCGAAATCC
CCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTACCAATTGGAGGGAAGAAGTAGAGGAG
AGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTTTGCGTGTGGCTCGTGT
TAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGA
ATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATCCATGACTTCATTAGAGAGTGTGATGGTGTTGGA
GGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAA
GCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTG
GCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAACTGTTCGGGTGGCG
TCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGA
CATTGGTGATATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCCAG
CTCTCAGCTCTCAACTCATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTG
CTTTTTTGAAATTACTCTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCTGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAA
ATAGGGAATATCATGATGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTTAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAG
CTATGTTCTAAAGTATTTCCAAAATTGTGTTACCATTAAAATCATAGTAATTTTTGAGCTCCTATTCATTGTGGAC
Protein sequenceShow/hide protein sequence
MACGLLTWSPISLRSTSFRALSFPLSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTTNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVL
RVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQ
INKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKET
LVDLDIGDIIVETK