; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G049770 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G049770
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionNusB domain-containing protein
Genome locationCiama_Chr02:37468987..37474440
RNA-Seq ExpressionCaUC02G049770
SyntenyCaUC02G049770
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0031564 - transcription antitermination (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006027 - NusB/RsmB/TIM44
IPR011605 - NusB antitermination factor
IPR035926 - NusB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040119.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]2.9e-15162.99Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG
        MSLAPPTS Y +     P   +HLSS TQ   SHP LFP R LFHLSFSTSFSTL S K S F     DIGL DS  A Q                  PG
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG

Query:  DSADKTGLFTHGDKVTTIR---------------------------------------------------------------------------------
         SA KT LF  GDKV T R                                                                                 
Subjt:  DSADKTGLFTHGDKVTTIR---------------------------------------------------------------------------------

Query:  ----------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG
                                    PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSG
Subjt:  ----------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG

Query:  KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLI
        KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLI
Subjt:  KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLI

Query:  LRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHA
        LRFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A
Subjt:  LRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHA

Query:  GDKQEVRA
         +KQEVRA
Subjt:  GDKQEVRA

TYK21741.1 NusB/RsmB/TIM44 [Cucumis melo var. makuwa]2.1e-15481.99Show/hide
Query:  PPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA
        PP  L    FSTSFSTL S K S F     DIGL DS  A Q                  PG SA KT LFT GDKV T RPNFHFS HISGIC+FPF A
Subjt:  PPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA

Query:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT
        SSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKT
Subjt:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT

Query:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLA
        SLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLA
Subjt:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLA

Query:  MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_004149639.3 uncharacterized protein LOC101216754 isoform X1 [Cucumis sativus]3.3e-16382.25Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH
        MSLAPPTS YP+     P   +HLSS TQ   SHP  F     FHLSFSTSFSTL SLK S F    +D GL DS  ADQ       PG SA KT LFT 
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH

Query:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
        GDKV T RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVL
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPK+VYSKLILRFTRKLLVAV DGWDSR L
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVL

Query:  KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK
        KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDS  A +K
Subjt:  KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK

XP_008449897.1 PREDICTED: uncharacterized protein LOC103491638 isoform X1 [Cucumis melo]3.5e-16580.2Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG
        MSLAPPTS Y +     P   +HLSS TQ   SHP LFP R LFHLSFSTSFSTL S K S F     DIGL DS  A Q                  PG
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG

Query:  DSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAAR
         SA KT LF  GDKV T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAAR
Subjt:  DSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAAR

Query:  ELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLV
        ELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLILRFTRKLLV
Subjt:  ELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLV

Query:  AVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        AVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  AVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

XP_038901769.1 uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida]1.3e-17080.89Show/hide
Query:  MSLAPPTSPY-------PFPKFCLPNHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------
        MSLAPP SPY       P PK CLP+HLSS TQ   SH KLFP  SL H SFSTS STL SLK        + +GL D   ADQ                
Subjt:  MSLAPPTSPY-------PFPKFCLPNHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ----------------

Query:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP
          PG SAD TGLFT GDKV T RPNFHFS HISGICRFPFRASSIVPH KD M HLCPQASLRASTSF ENCVAE+R+SI+VSS+ETIPKVDKSGKFCSP
Subjt:  --PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSP

Query:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTR
        RAARELALSIVYAACLEGSDPVRLFEKRLN RRE GYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDS IEAEILAAPPKMVYSKLILRFTR
Subjt:  RAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTR

Query:  KLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQE
        KLLVAVVDGWDSRVLKIEKVIPPTWK+KPA RILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDS+HA +KQE
Subjt:  KLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQE

Query:  VRA
        VRA
Subjt:  VRA

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ5 NusB domain-containing protein1.6e-16382.25Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH
        MSLAPPTS YP+     P   +HLSS TQ   SHP  F     FHLSFSTSFSTL SLK S F    +D GL DS  ADQ       PG SA KT LFT 
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ-------PGDSADKTGLFTH

Query:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC
        GDKV T RPNFHFSYHISGIC+FPF ASSIVPH KD MP  C QASLRASTSFSEN VAEER+SI++SSIE IPKVDKSGKFCSPRAARELALSIVYAAC
Subjt:  GDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAAC

Query:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVL
        LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPK+VYSKLILRFTRKLLVAV DGWDSR L
Subjt:  LEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVL

Query:  KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK
        KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDS  A +K
Subjt:  KIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDK

A0A1S3BNR2 uncharacterized protein LOC103491638 isoform X11.7e-16580.2Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG
        MSLAPPTS Y +     P   +HLSS TQ   SHP LFP R LFHLSFSTSFSTL S K S F     DIGL DS  A Q                  PG
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG

Query:  DSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAAR
         SA KT LF  GDKV T RPNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAAR
Subjt:  DSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAAR

Query:  ELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLV
        ELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLILRFTRKLLV
Subjt:  ELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLV

Query:  AVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        AVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  AVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A1S4DXQ5 uncharacterized protein LOC103491638 isoform X22.0e-14292.14Show/hide
Query:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR
        PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVR
Subjt:  PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVR

Query:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPP
        LFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWD+R LKIEKVIPP
Subjt:  LFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPP

Query:  TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  TWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

A0A5A7TFT7 NusB/RsmB/TIM441.4e-15162.99Show/hide
Query:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG
        MSLAPPTS Y +     P   +HLSS TQ   SHP LFP R LFHLSFSTSFSTL S K S F     DIGL DS  A Q                  PG
Subjt:  MSLAPPTSPYPFPKFCLP---NHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PG

Query:  DSADKTGLFTHGDKVTTIR---------------------------------------------------------------------------------
         SA KT LF  GDKV T R                                                                                 
Subjt:  DSADKTGLFTHGDKVTTIR---------------------------------------------------------------------------------

Query:  ----------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG
                                    PNFHFS HISGIC+FPF ASSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSG
Subjt:  ----------------------------PNFHFSYHISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSG

Query:  KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLI
        KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKTSLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLI
Subjt:  KFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLI

Query:  LRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHA
        LRFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A
Subjt:  LRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHA

Query:  GDKQEVRA
         +KQEVRA
Subjt:  GDKQEVRA

A0A5D3DDJ4 NusB/RsmB/TIM441.0e-15481.99Show/hide
Query:  PPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA
        PP  L    FSTSFSTL S K S F     DIGL DS  A Q                  PG SA KT LFT GDKV T RPNFHFS HISGIC+FPF A
Subjt:  PPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQ------------------PGDSADKTGLFTHGDKVTTIRPNFHFSYHISGICRFPFRA

Query:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT
        SSIVPH K+ MP LC QASLRASTSF EN VAEER+SI+VSSIETIPK+DKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLN+RRESGYEFDKT
Subjt:  SSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKT

Query:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLA
        SLMEYNHMSFGGPPVTVET+EEADELLRKDE+DSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWD+R LKIEKVIPPTWKNKPAGRILELCILHLA
Subjt:  SLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLA

Query:  MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA
        MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKE DST A +KQEVRA
Subjt:  MSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA

SwissProt top hitse value%identityAlignment
A6QBK6 Transcription antitermination protein NusB1.9e-0437.35Show/hide
Query:  FTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        F+  L    ++  +    +IEK +   W     GR+ E  IL L   EI V  T   I+INEAV+LAK   D  +P+ ING L
Subjt:  FTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

A7GWZ7 Transcription antitermination protein NusB2.3e-0535.37Show/hide
Query:  VAVVDGWDSRVLK---IEKVIPPTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        +  ++ +D+  LK   +++++ P  K K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP+ ING L
Subjt:  VAVVDGWDSRVLK---IEKVIPPTWKNKPAGR--ILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

B1WXY6 Transcription antitermination protein NusB6.6e-0543.75Show/hide
Query:  WKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK
        W+ K   +I +  IL LA++EI  +    ++ INEAV+LAKR+ D    R ING LR F   IK
Subjt:  WKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIK

Q18B61 Transcription antitermination protein NusB3.5e-0639.19Show/hide
Query:  KIEKVIPPTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
        KI+++I    KN    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P+ ING L + V +I
Subjt:  KIEKVIPPTWKNKPAGRI--LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Q8GIR7 Transcription antitermination protein NusB1.9e-0452.08Show/hide
Query:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Arabidopsis top hitse value%identityAlignment
AT4G26370.1 antitermination NusB domain-containing protein9.8e-8974.42Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPP
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S IEAE+L+APP
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPP

Query:  KMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
        K+VYSKL+LRF +KLL AVVD WDS V+ IEK+ PP WK+ PAGRILE  ILHLAMSE+ V+ TRH IVINEAVDLAKRFCDG+APRIINGCLRTFVKD 
Subjt:  KMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILHLAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Query:  KEIDSTHAGD-KQEV
            +  A + KQEV
Subjt:  KEIDSTHAGD-KQEV

AT4G26370.2 antitermination NusB domain-containing protein2.5e-3673.96Show/hide
Query:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEIL
        +PK+DKSG+  SPRAARELAL I+YAACLEGSDP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DEK+S I   ++
Subjt:  IPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRESGYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATTCTGCCTTCCTAACCATTTATCTTCCTCTACTCAATCTTCATCTTCCCATCCCAAACTCTTT
CCCCCACGCTCTCTCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATTGCCAGAGCGTCCGACATTGGACTCCGA
GATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGACTCTTCACCCATGGGGATAAAGTTACTACTATCAGGCCTAATTTTCATTTCTCCTAC
CACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCT
ACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCC
AGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCG
GGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGTTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGC
AAGGATGAAAAGGATTCTACAATTGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCA
GTTGTGGACGGATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCAC
CTGGCTATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATT
ATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTAGCTCCACCCACCTCCCCTTATCCGTTCCCCAAATTCTGCCTTCCTAACCATTTATCTTCCTCTACTCAATCTTCATCTTCCCATCCCAAACTCTTT
CCCCCACGCTCTCTCTTTCATCTTAGTTTCTCCACCTCCTTTTCAACCCTTCAGTCCCTTAAACCCTCACCTTTCATTGCCAGAGCGTCCGACATTGGACTCCGA
GATTCCGTTCATGCTGACCAACCTGGTGATTCTGCCGACAAGACTGGACTCTTCACCCATGGGGATAAAGTTACTACTATCAGGCCTAATTTTCATTTCTCCTAC
CACATTTCTGGAATATGTCGTTTCCCGTTTCGTGCTAGTTCAATCGTTCCCCACGCAAAAGATCCCATGCCACACCTCTGCCCTCAAGCCTCACTTCGGGCTTCT
ACTTCTTTTTCTGAAAATTGTGTGGCTGAAGAGAGGAATTCTATTGCAGTTTCTTCCATTGAAACAATACCAAAGGTCGACAAGAGCGGTAAATTTTGTAGCCCC
AGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCTGATCCCGTTCGACTCTTTGAGAAGCGGTTGAATGCCCGACGAGAATCG
GGATATGAATTTGACAAGACATCATTAATGGAATATAATCATATGAGTTTTGGAGGCCCGCCAGTTACCGTAGAAACAGTTGAAGAAGCAGATGAGCTTTTACGC
AAGGATGAAAAGGATTCTACAATTGAGGCAGAAATCCTCGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCA
GTTGTGGACGGATGGGACAGCCGTGTGCTTAAAATTGAAAAAGTCATTCCTCCAACTTGGAAGAACAAGCCAGCAGGACGGATTCTAGAGCTTTGTATTCTCCAC
CTGGCTATGTCTGAAATAACGGTTATTGGAACAAGGCATCAGATTGTCATTAATGAGGCCGTTGATCTTGCAAAACGATTCTGTGATGGAGCAGCACCCCGTATT
ATTAATGGGTGCCTTAGGACCTTTGTAAAGGACATCAAAGAAATTGATTCAACTCATGCTGGAGACAAGCAAGAAGTCCGGGCATGA
Protein sequenceShow/hide protein sequence
MSLAPPTSPYPFPKFCLPNHLSSSTQSSSSHPKLFPPRSLFHLSFSTSFSTLQSLKPSPFIARASDIGLRDSVHADQPGDSADKTGLFTHGDKVTTIRPNFHFSY
HISGICRFPFRASSIVPHAKDPMPHLCPQASLRASTSFSENCVAEERNSIAVSSIETIPKVDKSGKFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNARRES
GYEFDKTSLMEYNHMSFGGPPVTVETVEEADELLRKDEKDSTIEAEILAAPPKMVYSKLILRFTRKLLVAVVDGWDSRVLKIEKVIPPTWKNKPAGRILELCILH
LAMSEITVIGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIKEIDSTHAGDKQEVRA