; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003598 (gene) of Snake gourd v1 genome

Gene IDTan0003598
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNusB domain-containing protein
Genome locationLG05:2236788..2242901
RNA-Seq ExpressionTan0003598
SyntenyTan0003598
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0031564 - transcription antitermination (biological process)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006027 - NusB/RsmB/TIM44
IPR011605 - NusB antitermination factor
IPR035926 - NusB-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605513.1 hypothetical protein SDJN03_02830, partial [Cucurbita argyrosperma subsp. sororia]2.7e-14488.33Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVSMVVLPTRLD V  TR N HFS  +SGICRF FPTSAI+PYVK+SVP  C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

XP_022948088.1 uncharacterized protein LOC111451776 [Cucurbita moschata]9.4e-14588.33Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVS+VVLPTRLD V  TR N HFS  +SGICRF FPTSAI+PYVK+SVPH C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

XP_023007062.1 uncharacterized protein LOC111499665 [Cucurbita maxima]3.6e-14488.33Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVSMVVLPTRLD V  TR N HFS H+SGIC F FPTSAI+PYVK+SVP  C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

XP_023532267.1 uncharacterized protein LOC111794466 [Cucurbita pepo subsp. pepo]3.6e-14488Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVSMVVLPTRLD V  TR N HFS H+SGICRF FPTSAI+PYVK+SVP  C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAE+LAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILEL ILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

XP_038901769.1 uncharacterized protein LOC120088495 isoform X1 [Benincasa hispida]1.6e-13687.11Show/hide
Query:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL
        DKV TTR N HFS HISGICRF F  S+I+P+VKDS+ H C QASLR STSF EN VA++R SI VSS+ETIPKVDKSG+FCSPRAARELALSIVYAACL
Subjt:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL

Query:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK
        EG+DPVRLFEKRLN RRELGYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDEKDS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD WDS VLK
Subjt:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK

Query:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA
        IEKVIPPTWKDKPA RILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+I+EIDS+HA+EKQ V A
Subjt:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA

TrEMBL top hitse value%identityAlignment
A0A0A0KWZ5 NusB domain-containing protein1.1e-13386.17Show/hide
Query:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL
        DKV TTR N HFSYHISGIC+F F  S+I+P++KDS+P FC QASLR STSFSEN VA+ER SI +SSIE IPKVDKSG+FCSPRAARELALSIVYAACL
Subjt:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL

Query:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK
        EG+DPVRLFEKRLNARRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE+DS IEAEILAAPPK+VYSKLILRFTRKLLVAV D WDS  LK
Subjt:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK

Query:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEK
        IEKVIPPTWK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+I+EIDS  A+EK
Subjt:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEK

A0A1S3BNR2 uncharacterized protein LOC103491638 isoform X12.3e-13385.02Show/hide
Query:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL
        DKV TTR N HFS HISGIC+F F  S+I+P+VK+S+P  C QASLR STSF EN VA+ER SI VSSIETIPK+DKSG+FCSPRAARELALSIVYAACL
Subjt:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL

Query:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK
        EG+DPVRLFEKRLN+RRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE+DS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+  LK
Subjt:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK

Query:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA
        IEKVIPPTWK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+I+E DST A+EKQ V A
Subjt:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA

A0A5D3DDJ4 NusB/RsmB/TIM442.3e-13385.02Show/hide
Query:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL
        DKV TTR N HFS HISGIC+F F  S+I+P+VK+S+P  C QASLR STSF EN VA+ER SI VSSIETIPK+DKSG+FCSPRAARELALSIVYAACL
Subjt:  DKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACL

Query:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK
        EG+DPVRLFEKRLN+RRE GYEFDKTSLMEYNHMSFGGPPVTVET EEADELLRKDE+DS IEAEILAAPPKMVYSKLILRFTRKLLVAVVD WD+  LK
Subjt:  EGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLK

Query:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA
        IEKVIPPTWK+KPAGRILELCILHLAMSEITV+GTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+I+E DST A+EKQ V A
Subjt:  IEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA

A0A6J1G8E0 uncharacterized protein LOC1114517764.6e-14588.33Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVS+VVLPTRLD V  TR N HFS  +SGICRF FPTSAI+PYVK+SVPH C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

A0A6J1KXJ4 uncharacterized protein LOC1114996651.7e-14488.33Show/hide
Query:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA
        MVELVSMVVLPTRLD V  TR N HFS H+SGIC F FPTSAI+PYVK+SVP  C QASLR STSFS+NSV KE  S+LVSSIE IPK+DKSGRFCSPRA
Subjt:  MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRA

Query:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL
        ARELALSI+YA+CLEG+DPVRLFEKRLNAR E GYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPK+VYSKLILRFTRKL
Subjt:  ARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKL

Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS
        LVAVVDRWDSHVLKI+KVIP TWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVK+IQE DSTHA+ K+VVS
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVS

SwissProt top hitse value%identityAlignment
A7GWZ7 Transcription antitermination protein NusB6.2e-0635.37Show/hide
Query:  VAVVDRWDSHVLK---IEKVIPPTWKDKPAGR--ILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCL
        +  ++ +D+  LK   +++++ P  K+K   R  I+EL IL L + E+   GT   ++INEA++LAK     +AP+ ING L
Subjt:  VAVVDRWDSHVLK---IEKVIPPTWKDKPAGR--ILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCL

B1WXY6 Transcription antitermination protein NusB5.2e-0533.72Show/hide
Query:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQ
        L+  ++R    + +  + +   W+ K   +I +  IL LA++EI  +    ++ INEAV+LAKR+ D    R ING LR F  +I+
Subjt:  LVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQ

B1XIZ3 Transcription antitermination protein NusB1.2e-0437.97Show/hide
Query:  KIEKVIPPTWKDKPAGRI--LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDS
        +I++VI  +  D    R+  L+  IL +A++EI  + T +++ INEAV+LAKR+ D    R ING LR     ++  DS
Subjt:  KIEKVIPPTWKDKPAGRI--LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDS

Q18B61 Transcription antitermination protein NusB2.3e-0537.84Show/hide
Query:  KIEKVIPPTWKDKPAGRI--LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNI
        KI+++I    K+    R+  +++ IL L++ EI  + T +++ INEAV+LAK +CD  +P+ ING L + V  I
Subjt:  KIEKVIPPTWKDKPAGRI--LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNI

Q8GIR7 Transcription antitermination protein NusB2.0e-0452.08Show/hide
Query:  LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR
        L+  IL LA +EI  +GT  Q+ INEAV+LA R+ D    R ING LR
Subjt:  LELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLR

Arabidopsis top hitse value%identityAlignment
AT4G26370.1 antitermination NusB domain-containing protein4.6e-8968.29Show/hide
Query:  TQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPV
        T+++LRT T  +E           V  +  +PK+DKSGR  SPRAARELAL I+YAACLEG+DP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV
Subjt:  TQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPV

Query:  TVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVI
          ET EE DEL+R DEK+S IEAE+L+APPK+VYSKL+LRF +KLL AVVD+WDSHV+ IEK+ PP WK  PAGRILE  ILHLAMSE+ V+ TRH IVI
Subjt:  TVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIPPTWKDKPAGRILELCILHLAMSEITVVGTRHQIVI

Query:  NEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQE-KQVVS
        NEAVDLAKRFCDG+APRIINGCLRTFVK+     +  A E KQ VS
Subjt:  NEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQE-KQVVS

AT4G26370.2 antitermination NusB domain-containing protein4.5e-3661.9Show/hide
Query:  TQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPV
        T+++LRT T  +E           V  +  +PK+DKSGR  SPRAARELAL I+YAACLEG+DP+RLFEKR+NARRE GYEFDK+SL+EYNHMSFGGPPV
Subjt:  TQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVYAACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPV

Query:  TVETAEEADELLRKDEKDSAIEAEIL
          ET EE DEL+R DEK+S I   ++
Subjt:  TVETAEEADELLRKDEKDSAIEAEIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGAACTTGTATCAATGGTGGTTTTGCCCACAAGACTGGATAAAGTTTTCACTACCAGGCTTAATCTTCATTTCTCCTACCACATTTCTGGAATATGCCGGTTCTC
ATTCCCCACTAGTGCAATCCTTCCTTATGTAAAAGATTCTGTGCCACACTTTTGCACTCAAGCCTCGCTTCGCACTTCTACTTCCTTCTCTGAAAATTCCGTGGCTAAAG
AGCGTGATTCTATTTTAGTTTCTTCCATAGAAACAATACCAAAGGTCGACAAGAGCGGGAGATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTAT
GCAGCTTGTTTAGAAGGCGCTGATCCAGTTCGACTCTTCGAGAAGCGGTTGAATGCCCGTCGTGAACTGGGATACGAATTTGACAAGACATCATTGATGGAATATAATCA
TATGAGCTTTGGAGGCCCACCAGTTACTGTGGAAACGGCTGAAGAAGCAGATGAGCTTTTACGTAAGGATGAAAAGGATTCTGCAATTGAGGCAGAAATCCTTGCGGCCC
CACCAAAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACAGATGGGACAGTCACGTGCTTAAAATTGAAAAAGTCATTCCT
CCAACTTGGAAGGACAAGCCCGCAGGACGAATTCTGGAACTTTGTATTCTTCACCTGGCTATGTCTGAAATAACCGTTGTCGGAACAAGGCATCAGATTGTCATTAACGA
GGCTGTTGATCTCGCAAAACGCTTCTGCGATGGAGCAGCACCTCGAATTATTAATGGGTGCCTTAGGACCTTTGTGAAGAACATCCAAGAAATTGATTCAACTCATGCTC
AAGAGAAGCAAGTAGTCAGTGCATGA
mRNA sequenceShow/hide mRNA sequence
GAATTTTGGGGGCATGTCCTTAGCTCCACGACCCGCCTTCCCTTATCTGTATTGAGTATGCTCCCCATCCCAAACTGTTTCCCTTCAACGCTCTCTTTCATCTCACTTTC
TCCATTCCCGGCTTAAACCCTCCTCTTTCATCATCGTCAGAGGTGAACTGGAACTGGAACTGGAACTGGAGCAGCCATATGGTTTGAAATTTTCCAAGCCCCAAGATGGT
GGAACTTGTATCAATGGTGGTTTTGCCCACAAGACTGGATAAAGTTTTCACTACCAGGCTTAATCTTCATTTCTCCTACCACATTTCTGGAATATGCCGGTTCTCATTCC
CCACTAGTGCAATCCTTCCTTATGTAAAAGATTCTGTGCCACACTTTTGCACTCAAGCCTCGCTTCGCACTTCTACTTCCTTCTCTGAAAATTCCGTGGCTAAAGAGCGT
GATTCTATTTTAGTTTCTTCCATAGAAACAATACCAAAGGTCGACAAGAGCGGGAGATTTTGTAGCCCCAGAGCTGCTAGAGAGCTCGCTTTGTCAATTGTTTATGCAGC
TTGTTTAGAAGGCGCTGATCCAGTTCGACTCTTCGAGAAGCGGTTGAATGCCCGTCGTGAACTGGGATACGAATTTGACAAGACATCATTGATGGAATATAATCATATGA
GCTTTGGAGGCCCACCAGTTACTGTGGAAACGGCTGAAGAAGCAGATGAGCTTTTACGTAAGGATGAAAAGGATTCTGCAATTGAGGCAGAAATCCTTGCGGCCCCACCA
AAGATGGTCTACAGCAAACTGATCTTACGGTTTACACGAAAACTTTTGGTTGCAGTTGTGGACAGATGGGACAGTCACGTGCTTAAAATTGAAAAAGTCATTCCTCCAAC
TTGGAAGGACAAGCCCGCAGGACGAATTCTGGAACTTTGTATTCTTCACCTGGCTATGTCTGAAATAACCGTTGTCGGAACAAGGCATCAGATTGTCATTAACGAGGCTG
TTGATCTCGCAAAACGCTTCTGCGATGGAGCAGCACCTCGAATTATTAATGGGTGCCTTAGGACCTTTGTGAAGAACATCCAAGAAATTGATTCAACTCATGCTCAAGAG
AAGCAAGTAGTCAGTGCATGACTCTGTGAGGATATCAAAAAATTTGACTAAACTCATGCTCCAGAGAAGCAAGAAGTGACTCTGAAATTGGAGGGCACTTCTCTTCCATG
GAAAAGCTAATTGGAGTATGGAGTTTTAGATGTGTAATTTATCTAGCGAGGTTGTTTATACAAAAGATTTTTGGTAATTGAAGTGTATCAATTTTTACGAAAAATTTTCA
ATTTCATCAAATTAAACTCTAGACTTACATAATTGTTGCAA
Protein sequenceShow/hide protein sequence
MVELVSMVVLPTRLDKVFTTRLNLHFSYHISGICRFSFPTSAILPYVKDSVPHFCTQASLRTSTSFSENSVAKERDSILVSSIETIPKVDKSGRFCSPRAARELALSIVY
AACLEGADPVRLFEKRLNARRELGYEFDKTSLMEYNHMSFGGPPVTVETAEEADELLRKDEKDSAIEAEILAAPPKMVYSKLILRFTRKLLVAVVDRWDSHVLKIEKVIP
PTWKDKPAGRILELCILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFVKNIQEIDSTHAQEKQVVSA