; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G25850 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G25850
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNusB domain-containing protein
Genome locationClcChr02:37380644..37386793
RNA-Seq ExpressionClc02G25850
SyntenyClc02G25850
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0031564 - transcription antitermination (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006027 - NusB/RsmB/TIM44
IPR011605 - NusB antitermination factor
IPR035926 - NusB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040119.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]4.2e-13555.66Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS TQ   SHP LFP   LFHLSFSTSFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIR-----------------------------------------------------------------------------
          PG SA KT LF  GDKV T R                                                                             
Subjt:  --PGDSADKTGLFTHGDKVTTIR-----------------------------------------------------------------------------

Query:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV
                                        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Subjt:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV

Query:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSK
        DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    + 
Subjt:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSK

Query:  APNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI
         P +++                                ++ RFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI
Subjt:  APNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI

Query:  VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

TYK21741.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]6.2e-13970.28Show/hide
Query:  PPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA
        PP+ L    FSTSFSTL S K S F     DIGL DS  A Q                  PG SA KT LFT GDKV T RPNFHFS HISGIC+FPF A
Subjt:  PPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA

Query:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT
        SSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKT
Subjt:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT

Query:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAV
        SLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++                                ++ RFTRKLLVAV
Subjt:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAV

Query:  VDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        VDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  VDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_004149639.3 uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus]5.6e-14871.36Show/hide
Query:  MSLAPPTSPYPFPKSCLP---NHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH
        MSLAPPTS YP+     P   +HLSS TQ   SHP  F     FHLSFSTSFSTL SLK S F    +D GL DS  ADQ       PG SA KT LFT 
Subjt:  MSLAPPTSPYPFPKSCLP---NHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH

Query:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
        GDKV T RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQ
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++                  
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQ

Query:  QTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC
                      ++ RFTRKLLVAV DGWDSR LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC
Subjt:  QTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC

Query:  LRTFVKDIKEIDSTHAGDK
        LRTFVKDIKEIDS  A +K
Subjt:  LRTFVKDIKEIDSTHAGDK

XP_008449897.1 PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo]5.1e-14969.48Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS TQ   SHP LFP   LFHLSFSTSFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SA KT LF  GDKV T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSP
Subjt:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA
        RAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++   
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA

Query:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA
                                     ++ RFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA
Subjt:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA

Query:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        KRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_038901769.1 uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida]2.5e-15670.84Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPP SPY       P PKSCLP+HLSS TQ   SH KLFP HSL H SFSTS STL SLK        + +GL D   ADQ                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SAD TGLFT GDKV T RPNFHFS HISGICRFPFRASSIVPH KD M HLCPQASLRASTSF ENCVAE+R+SI+VSS+ETIPKVDKSGKFCSP
Subjt:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA
        RAARELALSIVYAACLEGSDPVRLFEKRLN RRE GYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDS       I+    +  P +++   
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA

Query:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA
                                     ++ RFTRKLLVAVVDGWDSRVLKIEKVIPPTWK+KPA RILELCILHLAMSEITVIGTRHQIVINEAVDLA
Subjt:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA

Query:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        KRFCDGAAPRIINGCLRTFVKDIKEIDS+HA +KQEVRA
Subjt:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ5 NusB domain-containing protein2.7e-14871.36Show/hide
Query:  MSLAPPTSPYPFPKSCLP---NHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH
        MSLAPPTS YP+     P   +HLSS TQ   SHP  F     FHLSFSTSFSTL SLK S F    +D GL DS  ADQ       PG SA KT LFT 
Subjt:  MSLAPPTSPYPFPKSCLP---NHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH

Query:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
        GDKV T RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQ
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++                  
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQ

Query:  QTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC
                      ++ RFTRKLLVAV DGWDSR LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC
Subjt:  QTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGC

Query:  LRTFVKDIKEIDSTHAGDK
        LRTFVKDIKEIDS  A +K
Subjt:  LRTFVKDIKEIDSTHAGDK

A0A1S3BNR2 uncharacterized protein LOC103491638 isoform X12.5e-14969.48Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS TQ   SHP LFP   LFHLSFSTSFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SA KT LF  GDKV T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSP
Subjt:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA
        RAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++   
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINA

Query:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA
                                     ++ RFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA
Subjt:  EGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLA

Query:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        KRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  KRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A1S4DXQ5 uncharacterized protein LOC103491638 isoform X27.7e-12776.27Show/hide
Query:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR
        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVR
Subjt:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR

Query:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEI
        LFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++                          
Subjt:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEI

Query:  RSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
              ++ RFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
Subjt:  RSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Query:  KEIDSTHAGDKQEVRA
        KE DST A +KQEVRA
Subjt:  KEIDSTHAGDKQEVRA

A0A5A7TFT7 NusB/RsmB/TIM442.0e-13555.66Show/hide
Query:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPPTS Y       P PKS    HLSS TQ   SHP LFP   LFHLSFSTSFSTL S K S F     DIGL DS  A Q                
Subjt:  MSLAPPTSPY-------PFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIR-----------------------------------------------------------------------------
          PG SA KT LF  GDKV T R                                                                             
Subjt:  --PGDSADKTGLFTHGDKVTTIR-----------------------------------------------------------------------------

Query:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV
                                        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+
Subjt:  --------------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKV

Query:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSK
        DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    + 
Subjt:  DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSK

Query:  APNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI
         P +++                                ++ RFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI
Subjt:  APNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQI

Query:  VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  VINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A5D3DDJ4 NusB/RsmB/TIM443.0e-13970.28Show/hide
Query:  PPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA
        PP+ L    FSTSFSTL S K S F     DIGL DS  A Q                  PG SA KT LFT GDKV T RPNFHFS HISGIC+FPF A
Subjt:  PPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA

Query:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT
        SSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKT
Subjt:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT

Query:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAV
        SLMEYNHMSFGGPPVTVET+EEADELLRKDE+DST      I+    +  P +++                                ++ RFTRKLLVAV
Subjt:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAV

Query:  VDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        VDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  VDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

SwissProt top hitse value%identityAlignment
A6QBK6 Transcription antitermination protein NusB2.1e-0437.35Show/hide
Query:  FTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        F+  L    ++  +    +IEK +   W     GR+ E  IL L   EI V  T   I+INEAV+LAK   D  +P+ ING L
Subjt:  FTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

A7GWZ7 Transcription antitermination protein NusB2.5e-0535.37Show/hide
Query:  VAVVDGWDSRVLK---IEKVIPPTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        +  ++ +D+  LK   +++++ P  K K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP+ ING L
Subjt:  VAVVDGWDSRVLK---IEKVIPPTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

B1WXY6 Transcription antitermination protein NusB7.2e-0543.75Show/hide
Query:  WKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK
        W+ K   +I +  IL LA++EI  +    ++ INEAV+LAKR+ D    R ING LR F   IK
Subjt:  WKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK

Q18B61 Transcription antitermination protein NusB3.8e-0639.19Show/hide
Query:  KIEKVIPPTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
        KI+++I    KN    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P+ ING L + V +I
Subjt:  KIEKVIPPTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Q8GIR7 Transcription antitermination protein NusB2.1e-0452.08Show/hide
Query:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Arabidopsis top hitse value%identityAlignment
AT4G26370.1 antitermination NusB domain-containing protein4.0e-7559.36Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRT
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S      +I+  
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRT

Query:  TSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGT
          S  P +++                                +V RF +KLL AVVD WDS V+ IEK+ PP WK+ PAGRILE  ILHLAMSE+ V+ T
Subjt:  TSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGT

Query:  RHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGD-KQEV
        RH IVINEAVDLAKRFCDG+APRIINGCLRTFVKD     +  A + KQEV
Subjt:  RHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGD-KQEV

AT4G26370.2 antitermination NusB domain-containing protein3.6e-3678.02Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTI
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S I
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCTCTACTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCC
ACACTCTCTCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATTGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTC
ATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGACTCTTCACCCATGGGGATAAAGTTACTACTATCAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATA
TGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTG
TGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGT
CAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCGGGATATGAATTTGACAAGACATCATTAATG
GAATATAATCATATGAGTTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGCAAGGATGAAAAGGATTCTACAATTGTATCTAATGA
AAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCATATGGATCAATGCAGAAGGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATC
TTACGGATGAAATAAGGTCAGCCATTGTCTTCATGGTTTTCAGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACGGATGGGACAGCCGTGTGCTTAAAATTGAAAAA
GTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCTATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGT
CATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAA
CTCATGCTGGAGACAAGCAAGAAGTCCGGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATTTTTCCCCATCCGATTCGGCTTATTTCCGACCAAACAAGTCGGAAGAAGTATTGGTTCAGAAACCATCTTCCTTCTTCCACTACTGGAAATGCCTTTGCCTCCAAATC
CCCAGTGAGTAGCGTGGGCTGAATTTGGGGCATTTTCAAAATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATCCTGCCTTCCTAACCATTTATCTTCCT
CTACTCAATCTTCATCTTCCCATCCCAAACTCTTTCCCCCACACTCTCTCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTC
ATTGCCAGAGCGTCCGACATTGGACTCCGAGATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGACTCTTCACCCATGGGGATAAAGTTACTACTAT
CAGGCCTAATTTTCATTTCTCCTACCACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTC
AAGCCTCACTTCGGGCTTCTACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGT
AAATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCG
ACGAGAATCGGGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGTTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTT
TACGCAAGGATGAAAAGGATTCTACAATTGTATCTAATGAAAGAATCAAGAGGACTACTAGTTCCAAGGCCCCAAATGTCATATGGATCAATGCAGAAGGAGGCAGAAAT
CCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGATGAAATAAGGTCAGCCATTGTCTTCATGGTTTTCAGGTTTACACGAAAACTTTTGGTTGCAGT
TGTGGACGGATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCACCTGGCTA
TGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATTATTAATGGGTGC
CTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATGAGCCTGAACTTTTGTGGTGCCTCAGGACCCTTTGTA
AGGATATCGAAGAATTTGACTCAACTCATGCTAGCTCGAGTAAGTGATTCTGGAGTCTCAACTTTTGGGGGTACGTCTATTAAACCTGTTGCTGTCAATAGCAGCTAAAG
TGCAGTGATTTGAAGAATCTCATACTTGCCGCAAACCACAGCAAGAATTTCTTTCCCTTCCCCTTGTAGGCGTGGATTTCTGTTATATCCAAATTATCCATAGAGTTAAG
TAATGAGGTGCATGATGTGTGAGAGAATAAGTTGTCAAATTAATCTAGTACTTGGAGAGATGTTATCATGCACATAATATTATCAATGTATTGAAGGGTTTACTTTTTTC
TTTTTCTTTTTTTTAATTTTTAAGTATGTGTCTCTCACCACTTTGTACGTGCCATATGGCTCAAAATGACTGTGGGGTAAGGCGAGTTGTCAATCCTTTCTCCCCAAAGT
TGTGTGCTATGTAAGCCGTGTGGTTCCTCTTGTATGCTTTCTAGCCTACAACATTTGTTTTCTCAAATGTTTTCCAAA
Protein sequenceShow/hide protein sequence
MSLAPPTSPYPFPKSCLPNHLSSSTQSSSSHPKLFPPHSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQPGDSADKTGLFTHGDKVTTIRPNFHFSYHISGI
CRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLM
EYNHMSFGGPPVTVETVEEADELLRKDEKDSTIVSNERIKRTTSSKAPNVIWINAEGGRNPRSPTKDGLQQTDLTDEIRSAIVFMVFRFTRKLLVAVVDGWDSRVLKIEK
VIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA