; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G042230 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G042230
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionNusB domain-containing protein
Genome locationCicolChr02:37204639..37210704
RNA-Seq ExpressionCcUC02G042230
SyntenyCcUC02G042230
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0031564 - transcription antitermination (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006027 - NusB/RsmB/TIM44
IPR011605 - NusB antitermination factor
IPR035926 - NusB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040119.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]1.7e-14158.3Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGIFTHRDKVITIR-----------------------------------------------------------------------------
          PG SA KT +F   DKVIT R                                                                             
Subjt:  --PGDSADKTGIFTHRDKVITIR-----------------------------------------------------------------------------

Query:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV
                                        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Subjt:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV

Query:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI
        DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI             
Subjt:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI

Query:  KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV
                         EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV
Subjt:  KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV

Query:  DLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        DLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  DLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

TYK21741.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]1.4e-14675.65Show/hide
Query:  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKD
        FS SFSTL S K S F     DIGL DS  A Q                  PG SA KT +FT  DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+
Subjt:  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKD

Query:  PMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS
         MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMS
Subjt:  PMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS

Query:  FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI
        FGGPPVTVET+EEADELLRKDE+DSTI                              EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVI
Subjt:  FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI

Query:  PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        P TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_004149639.3 uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus]1.8e-15475.06Show/hide
Query:  MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTH
        MSLAPPTS YP+     P   +HLSS +Q   SHP  F     FHLSFS SFSTL SLK S F    +D GL DS  ADQ       PG SA KT +FT 
Subjt:  MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTH

Query:  RDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
         DKVIT RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  RDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                              EAEILAA
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA

Query:  PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK
        PPK+VYSKLILRFTRKLLVAV D WDSR LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK
Subjt:  PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK

Query:  DIKEIDSTHAGDK
        DIKEIDS  A +K
Subjt:  DIKEIDSTHAGDK

XP_008449897.1 PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo]2.1e-15572.98Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SA KT +F   DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSP
Subjt:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP
        RAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                      
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP

Query:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
                EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
Subjt:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG

Query:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        AAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_038901769.1 uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida]1.0e-16274.36Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPP SPY       P PKSCLP+HLSS +Q   SH KLFP HS  H SFS S STL SLK        + +GL D   ADQ                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SAD TG+FT  DKVIT RPNFHFS HISGICRFPFRASSIVPH KD M HLCPQASLRASTSF ENCVAE+R+SI+VSS+ETIPKVDKSGKFCSP
Subjt:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP
        RAARELALSIVYAACLEGSDPVRLFEKRLN RRE GYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDS I                      
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP

Query:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
                EAEILAAPPKMVYSKLILRFTRKLLVAVVD WDSRVLKIEKVIP TWK+KPA RILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
Subjt:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG

Query:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        AAPRIINGCLRTFVKDIKEIDS+HA +KQEVRA
Subjt:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ5 NusB domain-containing protein8.6e-15575.06Show/hide
Query:  MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTH
        MSLAPPTS YP+     P   +HLSS +Q   SHP  F     FHLSFS SFSTL SLK S F    +D GL DS  ADQ       PG SA KT +FT 
Subjt:  MSLAPPTSPYPFPKSCLP---NHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGIFTH

Query:  RDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
         DKVIT RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  RDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                              EAEILAA
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAA

Query:  PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK
        PPK+VYSKLILRFTRKLLVAV D WDSR LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK
Subjt:  PPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK

Query:  DIKEIDSTHAGDK
        DIKEIDS  A +K
Subjt:  DIKEIDSTHAGDK

A0A1S3BNR2 uncharacterized protein LOC103491638 isoform X11.0e-15572.98Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SA KT +F   DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSP
Subjt:  --PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP
        RAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                      
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAP

Query:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
                EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG
Subjt:  NVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDG

Query:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        AAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  AAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A1S4DXQ5 uncharacterized protein LOC103491638 isoform X21.5e-13582.58Show/hide
Query:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR
        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVR
Subjt:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR

Query:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSK
        LFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI                              EAEILAAPPKMVYSK
Subjt:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSK

Query:  LILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDST
        LILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST
Subjt:  LILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDST

Query:  HAGDKQEVRA
         A +KQEVRA
Subjt:  HAGDKQEVRA

A0A5A7TFT7 NusB/RsmB/TIM448.3e-14258.3Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS +Q   SHP LFP    FHLSFS SFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGIFTHRDKVITIR-----------------------------------------------------------------------------
          PG SA KT +F   DKVIT R                                                                             
Subjt:  --PGDSADKTGIFTHRDKVITIR-----------------------------------------------------------------------------

Query:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV
                                        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Subjt:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV

Query:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI
        DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTI             
Subjt:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERI

Query:  KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV
                         EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVIP TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV
Subjt:  KRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAV

Query:  DLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        DLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  DLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A5D3DDJ4 NusB/RsmB/TIM446.6e-14775.65Show/hide
Query:  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKD
        FS SFSTL S K S F     DIGL DS  A Q                  PG SA KT +FT  DKVIT RPNFHFS HISGIC+FPF ASSIVPH K+
Subjt:  FSNSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGIFTHRDKVITIRPNFHFSYHISGICRFPFRASSIVPHAKD

Query:  PMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS
         MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMS
Subjt:  PMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMS

Query:  FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI
        FGGPPVTVET+EEADELLRKDE+DSTI                              EAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+R LKIEKVI
Subjt:  FGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVI

Query:  PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        P TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  PSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

SwissProt top hitse value%identityAlignment
A7GWZ7 Transcription antitermination protein NusB2.7e-0434.15Show/hide
Query:  VAVVDEWDSRVLK---IEKVIPSTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        +  ++ +D+  LK   +++++    K K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP+ ING L
Subjt:  VAVVDEWDSRVLK---IEKVIPSTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

B1WXY6 Transcription antitermination protein NusB9.2e-0532.97Show/hide
Query:  RKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK
        R+  + ++   + R  +I++ + +  K+    R+  ++  IL LA++EI  +    ++ INEAV+LAKR+ D    R ING LR F   IK
Subjt:  RKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK

Q18B61 Transcription antitermination protein NusB2.9e-0639.19Show/hide
Query:  KIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
        KI+++I    KN    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P+ ING L + V +I
Subjt:  KIEKVIPSTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Q5N1J7 Transcription antitermination protein NusB2.1e-0452.08Show/hide
Query:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Q8GIR7 Transcription antitermination protein NusB2.1e-0452.08Show/hide
Query:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Arabidopsis top hitse value%identityAlignment
AT4G26370.1 antitermination NusB domain-containing protein3.9e-8364.9Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLS
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S I         
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLS

Query:  NERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVI
                             EAE+L+APPK+VYSKL+LRF +KLL AVVD+WDS V+ IEK+ P  WK+ PAGRILE  ILHLAMSE+ V+ TRH IVI
Subjt:  NERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTWKNKPAGRILELCILHLAMSEITVIGTRHQIVI

Query:  NEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGD-KQEV
        NEAVDLAKRFCDG+APRIINGCLRTFVKD     +  A + KQEV
Subjt:  NEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGD-KQEV

AT4G26370.2 antitermination NusB domain-containing protein8.5e-3876.84Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQL
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S IG+ L
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTTCTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCC
ACACTCTCGCTTTCATCTTAGTTTCTCCAACTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATCGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTC
ATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGAATCTTCACCCATCGGGATAAAGTTATTACTATCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATA
TGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTG
TGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGT
CAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTAATG
GAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGGTAAACAATT
ATGTGACCTTCTATCTAATGAAAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCACTATGGATCAGCAAAAGGAGGCAGAAATCCTCGCAGCCCCACCAAAGA
TGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACGAATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTTCAACTTGG
AAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGA
TCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAACGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGC
AAGAAGTCCGGGCATAA
mRNA sequenceShow/hide mRNA sequence
CCTTCTTCCACTACTAGAAATGCCTTTGCCTCCAAATCCCGAGTGAGTAGCGTGGGCTGATTTTGGGGCATTTCACAATGTCTTTAGCTCCACCCACCTCCCCTTATCCG
TTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTTCTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCCACACTCTCGCTTTCATCTTAGTTTCTCCAACTC
CTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATCGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGA
CTGGAATCTTCACCCATCGGGATAAAGTTATTACTATCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTT
CCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTC
TTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTG
ATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCAGGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGCTTTGGAGGCCCGCCA
GTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGGTAAACAATTATGTGACCTTCTATCTAATGAAAGAATCAAGAG
GACTACTAGTTCCAAGGCCCCAAATGTCACTATGGATCAGCAAAAGGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACAC
GAAAACTTTTGGTTGCAGTTGTGGACGAATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTTCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTT
TGTATTCTCCACCTGGCCATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACC
CCGTATTATTAACGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATAAGCCTGAACTTTTGTGG
TGCCTCAGGACCCTTTGTAAGGATATCGAAGAATTCGACTGAACTCATGCTAGCTCGAGTAAGTGATTCTGGAGTCTCAACTTTTGGGGGTACGTCTATTAAACCTGTTG
CTGTCAAAAGCAGCCAAAGTGCAGTGATTTGAAGAATCTCATACTTGCCGCAAACCACAGCAAGAATTTCTTTCCCTTTCCCTTGTAGGCGTGGATTTCTGTTATATCCA
AATTATCCATAGAGTTGAGTAATGAGGTGCATGATGTGTGAGAGAATGAGTTGCCAAATTAATCTAGTACTTGGAGAGATGTTATCATGCACATAATGTTGTCAATGTAT
TGAAGGGTTTACTTTTTTCTTTTTCTTTTTTTTAATTTTTAAGTATGTG
Protein sequenceShow/hide protein sequence
MSLAPPTSPYPFPKSCLPNHLSSSSQSSSSHPKLFPPHSRFHLSFSNSFSTLQSLKPSPFIARASDIGLRDSVHADQPGDSADKTGIFTHRDKVITIRPNFHFSYHISGI
CRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLM
EYNHMSFGGPPVTVETVEEADELLRKDEKDSTIGKQLCDLLSNERIKRTTSSKAPNVTMDQQKEAEILAAPPKMVYSKLILRFTRKLLVAVVDEWDSRVLKIEKVIPSTW
KNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA