; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014600 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014600
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNusB domain-containing protein
Genome locationtig00000892:118553..124108
RNA-Seq ExpressionSgr014600
SyntenySgr014600
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0031564 - transcription antitermination (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006027 - NusB/RsmB/TIM44
IPR011605 - NusB antitermination factor
IPR035926 - NusB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153810.1 uncharacterized protein LOC111021236 [Momordica charantia]1.2e-12979.05Show/hide
Query:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM
        MEG SLLFCLSC SSPC  FRPNF+    H+S IC RFP               SAA+ YVKDSVPH C+QASLRASTS  + S+  ++ S S SS+EI+
Subjt:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM

Query:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK
        PKVDK+GR CSPRAARELALSIVYAACLEGSDPVRLFEKRLN RREPGYEFDK SLM YNHMSFGGPPVTVETAEEADELLRKDE D AIEA ILAAPPK
Subjt:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK

Query:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE
        MVYSKLILRF+RKLLVAVVDRWD+HVLKIE VIP  WK+KPAGRILELCILHLAMSEITV+GTR QIVINEA+DLAKRFCDGAAPRIINGCLRTFVKDI+
Subjt:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE

Query:  EIDSTHAQEKQEVVG
        EIDST+A+EKQEV G
Subjt:  EIDSTHAQEKQEVVG

XP_022948088.1 uncharacterized protein LOC111451776 [Cucurbita moschata]4.5e-12180.2Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF    VS IC RFP              TSA + YVK+SVPH C QASLRASTS S+ S+ K++ S   SS E +PK+DKSGRFCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SI+YA+CLEGSDPVRLFEKRLN R EPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDE DSAIEAEILAAPPK+VYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
        RWDSHVLKI+KVIP  WK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DSTHA+ K+ V
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

XP_023007062.1 uncharacterized protein LOC111499665 [Cucurbita maxima]9.9e-12178.84Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF  H     C  FP              TSA + YVK+SVP  C QASLRASTS S+ S+ K++ S   SS E +PK+DKSGRFCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SI+YA+CLEGSDPVRLFEKRLN R EPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDE DSAIEAEILAAPPK+VYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
        RWDSHVLKI+KVIP  WK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DSTHA+ K+ V
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

XP_038901769.1 uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida]2.2e-12080.2Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF   H+S IC RFP               S+ + +VKDS+ H C QASLRASTS  +  +A+D  S S SS E +PKVDKSG+FCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SIVYAACLEGSDPVRLFEKRLNTRRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE DS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
         WDS VLKIEKVIPP WK+KPA RILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+EIDS+HA+EKQEV
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

XP_038901770.1 uncharacterized protein LOC120088495 isoform X2 [Benincasa hispida]2.2e-12080.2Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF   H+S IC RFP               S+ + +VKDS+ H C QASLRASTS  +  +A+D  S S SS E +PKVDKSG+FCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SIVYAACLEGSDPVRLFEKRLNTRRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE DS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
         WDS VLKIEKVIPP WK+KPA RILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+EIDS+HA+EKQEV
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ5 NusB domain-containing protein2.2e-11878.97Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF Y H+S IC +FP               S+ + ++KDS+P FC QASLRASTS S+  +A++  S S SS E++PKVDKSG+FCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SIVYAACLEGSDPVRLFEKRLN RRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE DS IEAEILAAPPK+VYSKLILRFTRKLLVAV D
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEK
         WDS  LKIEKVIPP WKNKPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+EIDS  A+EK
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEK

A0A5D3DDJ4 NusB/RsmB/TIM442.2e-11878.16Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF  +H+S IC +FP               S+ + +VK+S+P  C QASLRASTS  +  +A++  S S SS E +PK+DKSG+FCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SIVYAACLEGSDPVRLFEKRLN+RRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE DS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
         WD+  LKIEKVIPP WKNKPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DST A+EKQEV
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

A0A6J1DHW5 uncharacterized protein LOC1110212365.7e-13079.05Show/hide
Query:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM
        MEG SLLFCLSC SSPC  FRPNF+    H+S IC RFP               SAA+ YVKDSVPH C+QASLRASTS  + S+  ++ S S SS+EI+
Subjt:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM

Query:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK
        PKVDK+GR CSPRAARELALSIVYAACLEGSDPVRLFEKRLN RREPGYEFDK SLM YNHMSFGGPPVTVETAEEADELLRKDE D AIEA ILAAPPK
Subjt:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK

Query:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE
        MVYSKLILRF+RKLLVAVVDRWD+HVLKIE VIP  WK+KPAGRILELCILHLAMSEITV+GTR QIVINEA+DLAKRFCDGAAPRIINGCLRTFVKDI+
Subjt:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE

Query:  EIDSTHAQEKQEVVG
        EIDST+A+EKQEV G
Subjt:  EIDSTHAQEKQEVVG

A0A6J1G8E0 uncharacterized protein LOC1114517762.2e-12180.2Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF    VS IC RFP              TSA + YVK+SVPH C QASLRASTS S+ S+ K++ S   SS E +PK+DKSGRFCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SI+YA+CLEGSDPVRLFEKRLN R EPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDE DSAIEAEILAAPPK+VYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
        RWDSHVLKI+KVIP  WK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DSTHA+ K+ V
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

A0A6J1KXJ4 uncharacterized protein LOC1114996654.8e-12178.84Show/hide
Query:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL
        RPNFHF  H     C  FP              TSA + YVK+SVP  C QASLRASTS S+ S+ K++ S   SS E +PK+DKSGRFCSPRAARELAL
Subjt:  RPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFCSPRAARELAL

Query:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
        SI+YA+CLEGSDPVRLFEKRLN R EPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDE DSAIEAEILAAPPK+VYSKLILRFTRKLLVAVVD
Subjt:  SIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD

Query:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV
        RWDSHVLKI+KVIP  WK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI+E DSTHA+ K+ V
Subjt:  RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEV

SwissProt top hitse value%identityAlignment
A7GWZ7 Transcription antitermination protein NusB1.1e-0535.37Show/hide
Query:  VAVVDRWDSHVLK---IEKVIPPNWKNKPAGR--ILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        +  ++ +D+  LK   +++++ P  K K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP+ ING L
Subjt:  VAVVDRWDSHVLK---IEKVIPPNWKNKPAGR--ILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

B1WXY6 Transcription antitermination protein NusB1.4e-0534.88Show/hide
Query:  LVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE
        L+  ++R    + +  + +  +W+ K   +I +  IL LA++EI  L    ++ INEAV+LAKR+ D    R ING LR F   I+
Subjt:  LVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE

B5YE05 Transcription antitermination protein NusB2.5e-0538.37Show/hide
Query:  DRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEI-TVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDS
        D  DS + +  K    NWK +  G  LE  IL LA++E+ T       + +NEAV+LAK++    A R +NG LR  V++ EE+ S
Subjt:  DRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEI-TVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDS

Q18B61 Transcription antitermination protein NusB5.9e-0740.54Show/hide
Query:  KIEKVIPPNWKNKPAGRI--LELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI
        KI+++I  + KN    R+  +++ IL L++ EI  L T +++ INEAV+LAK +CD  +P+ ING L + V +I
Subjt:  KIEKVIPPNWKNKPAGRI--LELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDI

Q8GIR7 Transcription antitermination protein NusB7.2e-0554.17Show/hide
Query:  LELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  LGT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Arabidopsis top hitse value%identityAlignment
AT4G26370.1 antitermination NusB domain-containing protein7.4e-9060.19Show/hide
Query:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM
        MEGT    CL  SS+ C  F  N   P  H S     F    +  PT   +  T    +          S +SLR   S ++ +L     S        M
Subjt:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM

Query:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK
        PK+DKSGR  SPRAARELAL I+YAACLEGSDP+RLFEKR+N RREPGYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DE +S IEAE+L+APPK
Subjt:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPK

Query:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE
        +VYSKL+LRF +KLL AVVD+WDSHV+ IEK+ PP+WK+ PAGRILE  ILHLAMSE+ VL TRH IVINEAVDLAKRFCDG+APRIINGCLRTFVKD  
Subjt:  MVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIE

Query:  EIDSTHAQE-KQEV
           +  A E KQEV
Subjt:  EIDSTHAQE-KQEV

AT4G26370.2 antitermination NusB domain-containing protein3.1e-3549.74Show/hide
Query:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM
        MEGT    CL  SS+ C  F  N   P  H S     F    +  PT   +  T    +          S +SLR   S ++ +L     S        M
Subjt:  MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIM

Query:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEIL
        PK+DKSGR  SPRAARELAL I+YAACLEGSDP+RLFEKR+N RREPGYEFDK+SL+EYNHMSFGGPPV  ET EE DEL+R DE +S I   ++
Subjt:  PKVDKSGRFCSPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAACCTCCCTTCTATTCTGCCTTTCCTGCTCTTCCTCTCCGTGCGTCAGGTTCAGGCCTAATTTTCATTTCCCTTACCACCACGTTTCTGAAATATGCCGCCG
GTTCCCAACTACAACTACACCTACACCTACACCTACAACTACAACTACAACTAGTGCAGCCCTTGCCTACGTTAAAGATTCCGTGCCTCACTTCTGCTCTCAAGCCTCGC
TTCGCGCTTCTACTTCTCTTTCCCAAATTTCGCTGGCTAAAGACGCCCATTCTACTTCACCTTCTTCCAGAGAAATAATGCCCAAGGTCGACAAGAGCGGGAGGTTTTGC
AGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCCGACCCAGTTCGGCTCTTCGAGAAGCGGTTGAATACCCGGCGAGAACC
GGGATATGAATTTGACAAGACATCATTGATGGAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACTGTGGAAACAGCTGAAGAAGCAGATGAGCTTTTACGTAAAG
ACGAAAATGATTCTGCAATTGAAGCAGAAATCCTTGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATTTTACGGTTTACACGCAAACTTTTGGTTGCAGTTGTGGAC
AGATGGGACAGTCACGTGCTTAAAATCGAAAAAGTAATTCCTCCAAATTGGAAGAACAAGCCAGCAGGACGGATTCTGGAACTTTGTATTCTTCACCTGGCTATGTCTGA
AATAACGGTTCTTGGAACAAGGCATCAGATTGTCATTAACGAGGCCGTTGATCTTGCAAAACGATTCTGTGACGGAGCAGCACCTCGTATTATTAATGGGTGCCTTAGGA
CCTTTGTGAAGGACATCGAAGAAATTGATTCAACTCATGCTCAAGAGAAGCAAGAAGTAGTCGGTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAACCTCCCTTCTATTCTGCCTTTCCTGCTCTTCCTCTCCGTGCGTCAGGTTCAGGCCTAATTTTCATTTCCCTTACCACCACGTTTCTGAAATATGCCGCCG
GTTCCCAACTACAACTACACCTACACCTACACCTACAACTACAACTACAACTAGTGCAGCCCTTGCCTACGTTAAAGATTCCGTGCCTCACTTCTGCTCTCAAGCCTCGC
TTCGCGCTTCTACTTCTCTTTCCCAAATTTCGCTGGCTAAAGACGCCCATTCTACTTCACCTTCTTCCAGAGAAATAATGCCCAAGGTCGACAAGAGCGGGAGGTTTTGC
AGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGCTTGTTTAGAAGGCTCCGACCCAGTTCGGCTCTTCGAGAAGCGGTTGAATACCCGGCGAGAACC
GGGATATGAATTTGACAAGACATCATTGATGGAATATAATCATATGAGCTTTGGAGGCCCGCCAGTTACTGTGGAAACAGCTGAAGAAGCAGATGAGCTTTTACGTAAAG
ACGAAAATGATTCTGCAATTGAAGCAGAAATCCTTGCAGCCCCACCAAAGATGGTCTACAGCAAACTGATTTTACGGTTTACACGCAAACTTTTGGTTGCAGTTGTGGAC
AGATGGGACAGTCACGTGCTTAAAATCGAAAAAGTAATTCCTCCAAATTGGAAGAACAAGCCAGCAGGACGGATTCTGGAACTTTGTATTCTTCACCTGGCTATGTCTGA
AATAACGGTTCTTGGAACAAGGCATCAGATTGTCATTAACGAGGCCGTTGATCTTGCAAAACGATTCTGTGACGGAGCAGCACCTCGTATTATTAATGGGTGCCTTAGGA
CCTTTGTGAAGGACATCGAAGAAATTGATTCAACTCATGCTCAAGAGAAGCAAGAAGTAGTCGGTGAATGA
Protein sequenceShow/hide protein sequence
MEGTSLLFCLSCSSSPCVRFRPNFHFPYHHVSEICRRFPTTTTPTPTPTTTTTTSAALAYVKDSVPHFCSQASLRASTSLSQISLAKDAHSTSPSSREIMPKVDKSGRFC
SPRAARELALSIVYAACLEGSDPVRLFEKRLNTRREPGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDENDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVD
RWDSHVLKIEKVIPPNWKNKPAGRILELCILHLAMSEITVLGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKDIEEIDSTHAQEKQEVVGE