; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G073200 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G073200
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptiontranscription termination/antitermination protein NusG
Genome locationCiama_Chr04:22410781..22413699
RNA-Seq ExpressionCaUC04G073200
SyntenyCaUC04G073200
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
InterPro domainsIPR006645 - NusG, N-terminal
IPR008991 - Translation protein SH3-like domain superfamily
IPR014722 - Ribosomal protein L2, domain 2
IPR036735 - NusG, N-terminal domain superfamily
IPR043425 - NusG-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147896.1 uncharacterized protein LOC101211195 [Cucumis sativus]6.3e-15383.77Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL STSFPALSF  SSS+RTQLS+SA+VE P +AAD+ QQLS RERRKLRNERREIKTT NWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAK+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_008448915.1 PREDICTED: transcription termination/antitermination protein NusG [Cucumis melo]8.8e-15585.8Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL STS PALSF  SSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKLRNERREIKTT NWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_022151589.1 uncharacterized protein LOC111019492 [Momordica charantia]1.4e-14783.67Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MA GLL WS ISLRS+SFPALSF  SSSK TQLSISAA+E  +AAD+VQQLSARERR+LRNERREIKTT NWREEVEERLCKKPKKEFA+WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET

XP_022931956.1 uncharacterized protein LOC111438223 [Cucurbita moschata]5.2e-14782.27Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLLTW+ + LRS SFP+LSF  SSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE KTT NWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

XP_038880828.1 transcription termination/antitermination protein NusG isoform X1 [Benincasa hispida]5.9e-15987.21Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLL WSPISLRS SFPALSF  SS KRTQLSISA VE PSAAD++QQLSARERRKLRNERREIKTT NWREEVEERLCKKPKKEFATWTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        +KLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKA+FPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQ++A NS ALETDLDTNGTTA K KGRPKKAVNTLSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

TrEMBL top hitse value%identityAlignment
A0A0A0KZV1 NGN domain-containing protein3.0e-15383.77Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY
        MACGLL WS +SL STSFPALSF  SSS+RTQLS+SA+VE P +AAD+ QQLS RERRKLRNERREIKTT NWREEVEERLC+KPKKEFA WTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWVMRVARVR QEIVERLAR LARNYPDLDFKIYYPSV+EKRKLKNGTYTV PKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAK+EQ RHDQAFLEKEQE+A N+ AL+TDLDTNGTTA KHKGRPKKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A1S3BKU2 transcription termination/antitermination protein NusG4.2e-15585.8Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY
        MA GLL WSPISL STS PALSF  SSS+RTQLSISA+VE P +AAD+VQQLSAR+RRKLRNERREIKTT NWREEVEERLC+KPKKEFATWTEKLNLDY
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIP-SAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDY

Query:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA
        LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSV+EKRKLKNGTYTVKPKAVFPGSVFIRC+MNKEIHDFIRECDGVGGFVG +     
Subjt:  LAKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALA

Query:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS
                         +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQAFLEKEQE+A NS AL+TDLDTNGTTA KHKGR KKAVNTLSPGSTVRVAS
Subjt:  LWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVAS

Query:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
Subjt:  GTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1DDH4 uncharacterized protein LOC1110194926.6e-14883.67Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MA GLL WS ISLRS+SFPALSF  SSSK TQLSISAA+E  +AAD+VQQLSARERR+LRNERREIKTT NWREEVEERLCKKPKKEFA+WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKR+LKNGTY VKP+AVFPGSVFIRCIMNKEIHDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVSEADMEAIFKEAKEEQ RHDQ FLEKEQE+A NS   +TDLDTNGTTA K KGR KKAVN LSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET
        TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI+VET
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVET

A0A6J1EV13 uncharacterized protein LOC1114382232.5e-14782.27Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MACGLLTW+ + LRS SFP+LSF  SSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE KTT NWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        AKLGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEKE+EQA N   LETDLDTNGTTA KHKGRPKKAVNTLSPGSTVRV+SG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKS KVTVGFTLFGKETLV+LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

A0A6J1HQX6 uncharacterized protein LOC1114658893.6e-14681.98Show/hide
Query:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL
        MAC LLTW+ +SLRS SFP+LSF  SSS RTQLSISAA+E  +AAD+V QLSARERR+LRNERRE KTT NWREEVEERLCKKPKKEFA WTEKLNLDYL
Subjt:  MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYL

Query:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL
        +KLGPQWWVMRV+RVRGQEIVERLARSLARNYPDLDFKIYYPSV EKRKLKNG+YTVKPKAVFPGSVFIRCIMNKE+HDFIRECDGVGGFVG +      
Subjt:  AKLGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALAL

Query:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG
                        +KRQINKPKPVS+ DMEAIFKEAKEEQ RHDQAFLEK++EQA N   LET LDTNGTTA KHKGRPKKAVNTLSPGSTVRVASG
Subjt:  WDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASG

Query:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK
        TFAEFEGSLKK+NRKSGKVTVGFTLFGKETLV LDIGDIIVETK
Subjt:  TFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVETK

SwissProt top hitse value%identityAlignment
P29397 Transcription termination/antitermination protein NusG8.3e-0723.24Show/hide
Query:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKE
        Y  K + +FPG VF+  IMN E ++F+R    V GFV +                             +P PV + +M  I + A               
Subjt:  YTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKE

Query:  QEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
                         G    + K +P K       G  V++ SG F +F G +K+++ +  ++ V  T+FG+ET V L + ++
Subjt:  QEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

P65591 Transcription termination/antitermination protein NusG3.2e-0620.25Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G          
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                         + N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  K+ V   +FG+ET V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

P65592 Transcription termination/antitermination protein NusG3.2e-0620.25Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V++      + +   L   +AR      F      V++   ++NG  T+  +  +PG V +   M  +    ++    V GF+G          
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                         + N+P P+S+ + E I ++         Q  +EK                            PK  V     G  VRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  K+ V   +FG+ET V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Q06795 Transcription termination/antitermination protein NusG1.7e-0724.76Show/hide
Query:  DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADME
        D  F++  P  +E+  +KNG   V  K VFPG V +  +M  +    +R   GV GFVG+  +                         +KP P+   + E
Subjt:  DLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQINKPKPVSEADME

Query:  AIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVD
         I K    ++ + D  F  KE                                       TV+V  G FA F GS+++++    KV V   +FG+ET V+
Subjt:  AIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVD

Query:  LDIGDI
        L+   I
Subjt:  LDIGDI

Q9HWC4 Transcription termination/antitermination protein NusG1.9e-0618.99Show/hide
Query:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD
        +  +W+V+       + ++  L   +     + +F       +E  +++NG      +  FPG V ++  MN+     +++   V GF+G          
Subjt:  LGPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWD

Query:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF
                           +KP P+++ + +AI +                   + ++SG                K +PK       PG TVRV  G F
Subjt:  YAIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTF

Query:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI
        A+F G ++++N +  ++ V   +FG+ T V+L+   +
Subjt:  AEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDI

Arabidopsis top hitse value%identityAlignment
AT3G09210.1 plastid transcriptionally active 131.1e-8651.53Show/hide
Query:  GLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYLAKL
        GLL WS    RS+  P++  P +   +TQ SI+A V      +   QL+A+ERR+LRNERRE K   +WREEVEE+L KKPKK +ATWTE+LNLD LA+ 
Subjt:  GLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYLAKL

Query:  GPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDY
        GPQWW +RV+R+RG E  + LAR+LAR +P+++F +Y PSVQ KRKLKNG+ +VKPK VFPG +FIRCI+NKEIHD IR+ DGVGGF+G++         
Subjt:  GPQWWVMRVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDY

Query:  AIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK
                     +KRQINKP+PV ++D+EAIFK+AKE Q + D  F E  + +E+A           SNS  +ET  ++         T A + K + K
Subjt:  AIFSVVLIVWAIYSKRQINKPKPVSEADMEAIFKEAKEEQVRHDQAFLE--KEQEQA-----------SNSGALETDLDT-------NGTTAIKHKGRPK

Query:  KAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE
        K    L+ GSTVRV SGTFAEF G+LKKLNRK+ K TVGFTLFGKETLV++DI +++ E
Subjt:  KAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKETLVDLDIGDIIVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTGGGCTTCTGACTTGGAGCCCAATTTCTCTTCGTTCAACTTCTTTCCCTGCCCTTTCCTTCCCTTTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGC
CGCCGTCGAAATCCCCTCCGCCGCCGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTAACAATTGGAGGG
AAGAAGTAGAGGAGAGGCTCTGCAAGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTATG
CGTGTGGCTCGTGTTAGAGGCCAAGAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAA
GAGGAAATTAAAGAATGGTACTTACACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATACATGACTTCATCAGAGAGT
GTGATGGTGTTGGAGGCTTTGTTGGGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAG
ATAAATAAACCAAAGCCGGTATCCGAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCA
AGCTTCAAACTCTGGCGCGCTCGAGACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAA
CTGTTCGGGTGGCGTCTGGGACTTTTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACC
CTCGTAGACCTTGACATTGGTGATATTATTGTAGAGACAAAATGA
mRNA sequenceShow/hide mRNA sequence
CAAACGACAAATGGGAGTTGCAATAACGACCACCCATCGGCGTTTATCGTCACCGGTGACTCCACAGCCGAGCCAGTAATCGGAAAATGGCCTGTGGGCTTCTGACTTGG
AGCCCAATTTCTCTTCGTTCAACTTCTTTCCCTGCCCTTTCCTTCCCTTTCTCTTCTTCCAAACGTACCCAATTATCAATCTCCGCCGCCGTCGAAATCCCCTCCGCCGC
CGACGAAGTTCAGCAGCTGTCGGCCCGGGAGAGGAGGAAGCTGAGGAATGAGAGGAGAGAGATTAAAACCACTAACAATTGGAGGGAAGAAGTAGAGGAGAGGCTCTGCA
AGAAGCCCAAGAAGGAATTTGCCACTTGGACTGAGAAGCTCAACCTTGATTACCTCGCTAAATTGGGTCCTCAATGGTGGGTTATGCGTGTGGCTCGTGTTAGAGGCCAA
GAAATTGTTGAACGCCTCGCTCGTTCTCTTGCTAGGAACTACCCCGACCTAGATTTCAAGATATATTACCCGTCTGTTCAGGAGAAGAGGAAATTAAAGAATGGTACTTA
CACCGTTAAACCGAAAGCTGTTTTTCCTGGATCTGTATTTATTAGGTGTATCATGAACAAGGAGATACATGACTTCATCAGAGAGTGTGATGGTGTTGGAGGCTTTGTTG
GGAATGAGAGAACTGCATTGGCCCTTTGGGATTATGCAATTTTCAGTGTTGTTCTGATAGTTTGGGCAATATACAGTAAACGACAGATAAATAAACCAAAGCCGGTATCC
GAAGCTGACATGGAAGCAATCTTCAAAGAGGCAAAGGAAGAGCAAGTAAGACATGACCAGGCTTTCCTAGAGAAAGAGCAAGAGCAAGCTTCAAACTCTGGCGCGCTCGA
GACTGACTTAGATACAAATGGTACTACTGCTATAAAGCACAAAGGAAGACCGAAAAAAGCTGTTAATACTTTGTCTCCAGGGTCAACTGTTCGGGTGGCGTCTGGGACTT
TTGCTGAATTTGAAGGCTCTCTTAAGAAGCTGAACCGTAAAAGTGGAAAGGTAACTGTAGGATTCACACTATTTGGGAAGGAAACCCTCGTAGACCTTGACATTGGTGAT
ATTATTGTAGAGACAAAATGAGTGAATGTACTTTCTCAGCCATGTGAAGTGTGGATGAAGCTCATGGATTTGGAGGATAAGCAAGCAGCTCCAGCTCTCAGCTCTCAACT
CATAAGCACGTTCACGAACCAATGAACACAAAAGTACTTTTGCTTGAATTGCCAAACAGCTTACTTTGATGAACATGTCCACAAATCAGCTGTGCTTTTTTGAAATTACT
CTGCATTTGGAGATTCTTATTGGAAATCAACCAAGGTCTTGAAGGAACTGGCAGAAGCCCAAATGTATTATTGTTAGACTATTCAGAAACCAAAATAGGGAATATCATGA
TGAACCATAGATAGCAATTGTAAATTCAAATGTTTCTGAAGTCAGAAATTTGAATCATGATTGTCTTAATTCAGGAAAATATTCTATAAACTAGCTATGTTCTAAAGTA
Protein sequenceShow/hide protein sequence
MACGLLTWSPISLRSTSFPALSFPFSSSKRTQLSISAAVEIPSAADEVQQLSARERRKLRNERREIKTTNNWREEVEERLCKKPKKEFATWTEKLNLDYLAKLGPQWWVM
RVARVRGQEIVERLARSLARNYPDLDFKIYYPSVQEKRKLKNGTYTVKPKAVFPGSVFIRCIMNKEIHDFIRECDGVGGFVGNERTALALWDYAIFSVVLIVWAIYSKRQ
INKPKPVSEADMEAIFKEAKEEQVRHDQAFLEKEQEQASNSGALETDLDTNGTTAIKHKGRPKKAVNTLSPGSTVRVASGTFAEFEGSLKKLNRKSGKVTVGFTLFGKET
LVDLDIGDIIVETK