; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G001890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G001890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTranscription termination factor MTERF5
Genome locationchr07:2061543..2062718
RNA-Seq ExpressionLsi07G001890
SyntenyLsi07G001890
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015351.1 Transcription termination factor MTERF8, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-17481.07Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N  SLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIM SRRGI+STEKPQSVYKYLSELGFS AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IE+TLKPKI+FFQNLGLVGSDLG+FIS+HS+LLTVSLEK LMPSVEILK+ FP+D+CNSDLL  ++RCS  LM +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFGKRESQ+RDIVSMVV+TGFSTNTKMF HGLHAI SV+NATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VVKSK LLKK P+ + ILGMSD+DFLDKFV RFPD++ +LL AFRG  + ELQPKEL T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

XP_022929526.1 uncharacterized protein LOC111436067 [Cucurbita moschata]4.8e-17380.56Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N  SLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIM SRRGIESTEKPQSVYKYLSELGFS AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQN GLVGSDLG+FIS+HS+LLTVSLEK LMPSVEILK+ FP+D+CNSDLL  ++RC   LM +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFGKRESQ+R+IVSMVV+TGFSTNTKMF HGLHAI SV+NATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VVKSK LLKK P+ + ILGMSD+DFLDKFV RFPD++ +LL AFRG  + ELQPK+L T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

XP_022984673.1 uncharacterized protein LOC111482884 isoform X1 [Cucurbita maxima]7.3e-17480.82Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N PSLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIMSSRRGI STEKPQSVYKYLSELGFS+AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQNLGLVGSDLG+FIS+HS+LLT+SLEK LMPSVEILK+ FP+D+CNSDLL  +RRCS  LM +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFG RESQ+RD+VSMVV+TGFSTNT+MF HGLHAI SVSNATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VV SKRL KK  + I ILGMSD++FLDKFV RFPD++ +LL AFRG  + ELQPKEL T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

XP_023552362.1 uncharacterized protein LOC111810052 isoform X3 [Cucurbita pepo subsp. pepo]4.8e-17381.07Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N  SLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIM SRRGIESTEKPQSVYKYLSELGFS AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQNLGLVGSDLG+FIS+HS+LLTVSLEK LMPSVEIL++ FP+D+CNSDLL  ++RC   LM  PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFGKRESQ+RDIVSMVV+TGFSTNTKMF HGLHAI SV+NATF KKVELICSFGI+EKECMRM TSAPVLIRTS GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAM+GRVLPRYRVL+VVKSKRLL K P+ I ILGMSD+DFLDKFV RFPD++ +LL AFRG  + ELQPKEL T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

XP_038888519.1 transcription termination factor MTERF5, chloroplastic-like [Benincasa hispida]1.6e-17382.29Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFF---SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAI
        MYALRPSFSVSSRLN PS LRIHPFLYYFF   SSSSHTQ+SASN IVVQYLID FQLSPARA SIMS RRG++STEKPQSVYKYLS+LGFSDAHI+SAI
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFF---SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAI

Query:  RIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSC
        R+APQIAFS+IEKTLKPKI+FFQNLGLVGSDL KFISRHS+LLTVSLEK LMPSVEILKN  P+D CN DLL  IRR S  LM  P KRLS+NINYL+SC
Subjt:  RIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSC

Query:  GIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEF
        GIVDSQ+ MLLKRQ  LFG+ ES+L+DIVS+VV+ GFSTNT+MF HGLHAI SVS  T  KKVELICSFGI+EKECMRM TSAPVLIRTSIGKLK GLEF
Subjt:  GIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEF

Query:  FMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV
        FMNEAKVSKSDIV  P+CLMHAMQGRVLPRYRVL++VKSKRL+KK PKFI  LGMSD+DFLD+FVCRFPDNMKDLLVA+RGNSV
Subjt:  FMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV

TrEMBL top hitse value%identityAlignment
A0A6J1C8W1 uncharacterized protein LOC111008506 isoform X13.1e-15473.91Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFF-SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRI
        MYALR  FS+S+R   P    I P L YFF SSSS  +ASA+N IVVQYL+DNF LS ARA++IMS R+G+ESTEKP+SV KYLSELGFSDAHIQSAIRI
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFF-SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRI

Query:  APQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGI
        +PQIAFSS+EKTLKPKI+FFQNLGLVGSDLGKFIS HSSLLTVSL+  L PSVEILKN FP+D+ NS+LL  +RRCS  LM  P  RL VNINY RSCGI
Subjt:  APQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGI

Query:  VDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFM
        VDSQ+SMLLKRQP LFG+RES++RD+VSM V+TGFSTNTKMF HGLHAI SVSNATF KKVELICSFG +EKECM+M TSAPVLIRTSI KLK+G+EFFM
Subjt:  VDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFM

Query:  NEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELA
        NEAKVS+S IVR+P+ LMH+MQGRVLPRYRVL+VVKSKRL +K P+ +  LG+S++DF DKFV RFPDN++DLLVA+ G  V  L+ KELA
Subjt:  NEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELA

A0A6J1EN21 uncharacterized protein LOC1114358604.3e-17280.05Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N  SLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIM SRRG+ESTEKPQSVYKYLSELGFS AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IE+TLKPKI+FFQNLGLVGSDLG+FIS+HS+LLTVSLEK LMPSVEILK+ FP+D+CNSDLL  ++RCS  LM +PY RL VNI+YL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFG RESQ+RDIVSMVV+TGFSTNTKMF HGLH+I SVSNATF KKVELICSFGI+EKECMRM TSAP LIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIV KP+CLMHAMQGRVLPRYRVL+VVKSKRLL+K P+ + ILGMSD+DFLDKFV RFPD++ +LL AFRG  + ELQPKEL T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

A0A6J1END2 uncharacterized protein LOC1114360672.3e-17380.56Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N  SLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIM SRRGIESTEKPQSVYKYLSELGFS AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQN GLVGSDLG+FIS+HS+LLTVSLEK LMPSVEILK+ FP+D+CNSDLL  ++RC   LM +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFGKRESQ+R+IVSMVV+TGFSTNTKMF HGLHAI SV+NATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VVKSK LLKK P+ + ILGMSD+DFLDKFV RFPD++ +LL AFRG  + ELQPK+L T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

A0A6J1J5Y4 uncharacterized protein LOC111482884 isoform X44.8e-17180.88Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N PSLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIMSSRRGI STEKPQSVYKYLSELGFS+AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQNLGLVGSDLG FIS+HS+LLT+SLEK LMPSVEILK+ FP+D+CNSD L  +RRCS  L  +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFG RESQ+RDIVSMVV+TGFSTNTKMF HGLHAI SVSNATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPK
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VV SKRL KK  + I ILGMSD++FLDKFV RFPD + +LL AFRG  + ELQPK
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPK

A0A6J1JB80 uncharacterized protein LOC111482884 isoform X13.6e-17480.82Show/hide
Query:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA
        MYALR SFS+SSR N PSLLRI P LY FFSSSSH+QASAS GIVVQYLID F+LSPARAVSIMSSRRGI STEKPQSVYKYLSELGFS+AHIQS IR+A
Subjt:  MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIA

Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV
        PQIAFS+IEKTLKPKI+FFQNLGLVGSDLG+FIS+HS+LLT+SLEK LMPSVEILK+ FP+D+CNSDLL  +RRCS  LM +PY RL VNINYL+SCGIV
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIV

Query:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN
         SQ+SMLLKRQP LFG RESQ+RD+VSMVV+TGFSTNT+MF HGLHAI SVSNATF KKVELICSFGI+EKECMRM TSAPVLIRTS+GKLK GLEFFMN
Subjt:  DSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMN

Query:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT
        EA+VS+SDIVRKP+CLMHAMQGRVLPRYRVL+VV SKRL KK  + I ILGMSD++FLDKFV RFPD++ +LL AFRG  + ELQPKEL T
Subjt:  EAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic1.5e-0425.81Show/hide
Query:  GIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDL
        G    E+ + + KY   LG     ++  + + P +    +EKT+ PK+ F Q +G+    +G  + +  SLLT SL K + P V  L           D+
Subjt:  GIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDL

Query:  LFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLR
           I      L  +   +L  N+ Y  S GI   Q+  ++   P L       LR
Subjt:  LFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLR

Q84X53 Transcription termination factor MTEF1, chloroplastic6.8e-0533.71Show/hide
Query:  TEKPQS----VYKYLS-ELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILK
        T  P+S    V ++LS E+  S+  I  +I   P++  SS++  L+P + F + LG VG D     SR++ LL  ++E+ L+P +E L+
Subjt:  TEKPQS----VYKYLS-ELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILK

Q9FK23 Transcription termination factor MTERF8, chloroplastic7.5e-1224.07Show/hide
Query:  PQIAFSSIEKTLKPKIDFFQNLGLVGSD---LGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRC-SYTLMIAPYKRLSVNINYLRS
        P I  S ++  L P++DF +NL   G D    G  + R  ++L+ S+E ++   VE LK+F       S+ +F I       +  +  ++L   I +L+ 
Subjt:  PQIAFSSIEKTLKPKIDFFQNLGLVGSD---LGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRC-SYTLMIAPYKRLSVNINYLRS

Query:  CGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLE
        CG     +   L + P +    E+ L   +  +V  G+   TK  A  + A+   S+    + + L  S+G+S ++ + M T  P +++ +   L+  LE
Subjt:  CGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLE

Query:  FFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSK
        + +        +++  P+ L + +  R+  RY   E +KS+
Subjt:  FFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSK

Q9FM80 Transcription termination factor MTERF9, chloroplastic2.1e-0620.2Show/hide
Query:  YLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMI
        YL  +G     I+  +   PQI   ++E  LK  I F   LG+  S +G+ ++   SL + S+E +L P++  L     E       +  + + S  +++
Subjt:  YLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMI

Query:  APYKRLSV--NINYL---RSCGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRM
           +RL +  N  Y+   +  G     +  ++K+ P            ++   +D GF                        ++  + S G+   + +++
Subjt:  APYKRLSV--NINYL---RSCGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRM

Query:  ITSAPVLIRTSI-GKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVA
        +TS   ++  S+   LK    + +NE       + + P  L  ++  R+ PR+R L  +K    ++K P  +  L  +D+ F  ++         D  +A
Subjt:  ITSAPVLIRTSI-GKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVA

Query:  FR
        FR
Subjt:  FR

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial5.2e-0527.21Show/hide
Query:  EKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGL-VGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNF--FPEDKCNSDLLF
        EK   +  +   LG  +  +   I   P++   SI+  L   + F  +LGL     +GK + ++  L+  S++K L P+ E LK+     ED   S    
Subjt:  EKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGL-VGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNF--FPEDKCNSDLLF

Query:  AIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGK
         +      L     K L  N +YL+ CG  DSQI+ ++   P +  K
Subjt:  AIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGK

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein4.8e-3829.71Show/hide
Query:  SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLG
        S +   Q        V YL+D+  LS   A S  S    + S++KP SV     + GF++  I S I+  P++   S E  + PK+ FF ++G   SD  
Subjt:  SSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLVGSDLG

Query:  KFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVV
        K IS    +L+ SL K L+P  + LK+   E++     L    RC ++L I     +S+ ++  R  G+ D  I  L++  P  F  RE +  ++++ V 
Subjt:  KFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLRDIVSMVV

Query:  DTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRV
          GF      F H + A    S +   +K +L   FG S+++ +  I   P  +  S  K+   LE+ +N   +   DIV +P  L  +M+ R+ PR +V
Subjt:  DTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRV

Query:  LEVVKSKRLLKKQP-KFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGN
        + ++ SK L+KK+   +  IL +   +F+DKFV ++ D M  L+  F  N
Subjt:  LEVVKSKRLLKKQP-KFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGN

AT1G61990.1 Mitochondrial transcription termination factor family protein1.5e-2024.69Show/hide
Query:  FSSSSHTQASASNG-----IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGL
        FSS+     S  +G       V YL+D+  LS   A SI S +   E    P SV       GF+D+ I + I   P +  +  +K L  K+   Q+ G 
Subjt:  FSSSSHTQASASNG-----IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGL

Query:  VGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLL--KRQPGLFGKRESQL
          S++ + +S    +L    +K++    + +K          D++ A    SY L          N++ LR  G+    +  LL  K QP + GK     
Subjt:  VGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLL--KRQPGLFGKRESQL

Query:  RDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKV-----------------------------------ELICSFGISEKECMRMITSAPVLIRTS
           +  VV+ GF   T  F   L  +  +S  T  +KV                                   E     G S  E + M+   P  I  S
Subjt:  RDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKV-----------------------------------ELICSFGISEKECMRMITSAPVLIRTS

Query:  IGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKK---QPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV
        +  +K   EF + + K  ++ +V  P    ++M+ R++PR  +LE + SK LL+K    P    +L  +D+ FLD++V +  + +  L+  F   SV
Subjt:  IGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKK---QPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV

AT1G62120.1 Mitochondrial transcription termination factor family protein5.9e-2023.87Show/hide
Query:  FFSSSSHTQASASNG-----IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLG
        F SS+     S+ +G       V YL+D+  L+   A SI S +   ++   P SV   L   GF+D+ I + IR  P++     EK+L PK+ F Q++G
Subjt:  FFSSSSHTQASASNG-----IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLG

Query:  LVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSV-NINYLRSCGIVDSQI-SMLLKRQPGLFGKRESQ
           S+L + +S    +L     K+L    + +K     DK +      + +  ++L     +   + N+  LR  G+    + S+L+     + GK   +
Subjt:  LVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSV-NINYLRSCGIVDSQI-SMLLKRQPGLFGKRESQ

Query:  LRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSN---------------------ATFNK--------------KVELICSFGISEKECMRMITSAPVLIRT
         ++ +   V+ GF   T  F   L+ +  +S+                     A F K               VE     G S  E + M+   P  I  
Subjt:  LRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSN---------------------ATFNK--------------KVELICSFGISEKECMRMITSAPVLIRT

Query:  SIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQ-PKFIHILGMSDQDFLDKFVCRFPDN--MKDLLVAFRGNSV
        S   +K+  EF + E       +   P  L ++++ R +PR  V++V+ SK LL+ + P    +L  + + FL  +V +  D   + +L+  F G+ V
Subjt:  SIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRVLEVVKSKRLLKKQ-PKFIHILGMSDQDFLDKFVCRFPDN--MKDLLVAFRGNSV

AT3G46950.1 Mitochondrial transcription termination factor family protein1.1e-1822.17Show/hide
Query:  PFLYYFFSSSSHTQASASNG-IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNL
        PF    FSS+    +S       V YL+++  L+   A +I S +   E    P SV   L   GF D+ I   IR  P++  +  EK+L+PK+ F ++ 
Subjt:  PFLYYFFSSSSHTQASASNG-IVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNL

Query:  GLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLS--VNINYLRSCGIVDSQI-SMLLKRQPGLFGKRE
        G   S++ + +S   ++L    E+++    + +K+   + K            S  +     K+ +   NI+ LR  G+    + S+L+ R   + GK  
Subjt:  GLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLS--VNINYLRSCGIVDSQI-SMLLKRQPGLFGKRE

Query:  SQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKV-----------------------------------ELICSFGISEKECMRMITSAPVLI
         +  + +  VVD GF      F   LH +  +S  T  +KV                                   E +   G+ E+E + ++ S P  I
Subjt:  SQLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKV-----------------------------------ELICSFGISEKECMRMITSAPVLI

Query:  RTSIGKLKSGLEFFM-----------------------NEAKVSKSDIVRK------------PSCLMHAMQGRVLPRYRVLEVVKSKRLL-KKQPKFIH
        R+S  K+   +E F+                        E    K +++ K            P+ L ++++ R++PR  V++ + SK L+  + P    
Subjt:  RTSIGKLKSGLEFFM-----------------------NEAKVSKSDIVRK------------PSCLMHAMQGRVLPRYRVLEVVKSKRLL-KKQPKFIH

Query:  ILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV
        +L  +DQ+FL ++V +    +  L+  F    V
Subjt:  ILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSV

AT5G64950.1 Mitochondrial transcription termination factor family protein3.5e-6539.39Show/hide
Query:  LYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLV
        L++F  SS+ T AS SN   V++L DN    P  A++I      ++S E+P+SV + L    FSD  IQ +IR+ P++ F ++EK L+PK+ FF+++G  
Subjt:  LYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEKTLKPKIDFFQNLGLV

Query:  GSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIA-PYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLRD
        GS LGKF+S++SS++ VSL K L+P+VEILK+       + DL   + RC + L+   P   L  NI+YL +CGIV SQ++ LL+RQP +F   E +LR 
Subjt:  GSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIA-PYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRESQLRD

Query:  IVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRV
         VS  +D GF+ N++M  H + ++ S+S  TF++KV+L  + G SE E   +I  +P LIR S  KL  G EF++    + +  + ++P  L + ++ RV
Subjt:  IVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRV

Query:  LPRYRVLEVVKSKRLL----KKQPKFIHILGMSDQDFLDKFVCRFPDNM-KDLLVAFR
        +PR +VL++++ K LL    KK+   + I+ M+++ FL+K+V RF D + ++LLVA++
Subjt:  LPRYRVLEVVKSKRLL----KKQPKFIHILGMSDQDFLDKFVCRFPDNM-KDLLVAFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGCTCTGAGACCATCATTCTCTGTTTCCTCTCGCCTAAATCGTCCTTCCCTCCTCCGAATTCATCCGTTCTTGTATTATTTCTTCTCTTCTTCTTCCCACACCCA
AGCCTCTGCTTCAAATGGAATCGTCGTCCAATACCTCATCGATAACTTCCAACTGTCCCCCGCCAGAGCGGTGTCGATTATGAGCAGCCGTAGAGGCATTGAATCAACGG
AAAAGCCTCAATCCGTTTACAAATATCTTTCAGAGCTCGGATTCTCCGACGCTCACATTCAATCGGCCATTCGAATCGCGCCGCAGATCGCGTTTTCCAGTATTGAAAAG
ACTCTGAAGCCGAAGATCGACTTCTTCCAGAATCTTGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAGGCATTCTTCGCTTTTGACTGTTAGTTTGGAGAAGAA
CTTGATGCCCAGTGTTGAGATTCTTAAGAATTTTTTCCCCGAGGATAAGTGTAATAGTGATCTCCTGTTTGCTATCCGGCGATGTTCTTATACGCTTATGATAGCCCCGT
ATAAAAGGTTGTCGGTAAATATTAATTACTTACGAAGTTGTGGGATTGTTGATTCTCAAATCTCTATGTTACTGAAGAGGCAACCTGGACTTTTTGGTAAGCGTGAATCT
CAACTTAGAGATATTGTTTCCATGGTTGTAGACACTGGTTTTTCTACGAATACTAAAATGTTTGCTCATGGACTTCATGCTATCGGTTCTGTAAGTAATGCGACCTTTAA
CAAGAAAGTGGAGTTGATTTGTAGCTTTGGAATAAGTGAGAAAGAATGTATGAGAATGATTACTTCTGCTCCTGTTTTGATAAGGACTTCCATTGGTAAACTTAAGTCTG
GTCTAGAATTCTTCATGAATGAGGCAAAAGTTAGCAAATCAGATATTGTTCGTAAACCTTCTTGTTTGATGCACGCCATGCAGGGGAGGGTGCTCCCTCGTTATAGAGTT
CTAGAGGTTGTGAAGTCGAAGAGGCTATTGAAGAAGCAACCGAAATTTATCCATATATTGGGGATGTCTGATCAGGATTTCTTGGATAAATTCGTGTGTAGGTTTCCTGA
TAATATGAAAGATTTGTTGGTGGCCTTTAGAGGTAATTCTGTAGGTGAATTGCAGCCTAAGGAGTTAGCAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTACGCTCTGAGACCATCATTCTCTGTTTCCTCTCGCCTAAATCGTCCTTCCCTCCTCCGAATTCATCCGTTCTTGTATTATTTCTTCTCTTCTTCTTCCCACACCCA
AGCCTCTGCTTCAAATGGAATCGTCGTCCAATACCTCATCGATAACTTCCAACTGTCCCCCGCCAGAGCGGTGTCGATTATGAGCAGCCGTAGAGGCATTGAATCAACGG
AAAAGCCTCAATCCGTTTACAAATATCTTTCAGAGCTCGGATTCTCCGACGCTCACATTCAATCGGCCATTCGAATCGCGCCGCAGATCGCGTTTTCCAGTATTGAAAAG
ACTCTGAAGCCGAAGATCGACTTCTTCCAGAATCTTGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAGGCATTCTTCGCTTTTGACTGTTAGTTTGGAGAAGAA
CTTGATGCCCAGTGTTGAGATTCTTAAGAATTTTTTCCCCGAGGATAAGTGTAATAGTGATCTCCTGTTTGCTATCCGGCGATGTTCTTATACGCTTATGATAGCCCCGT
ATAAAAGGTTGTCGGTAAATATTAATTACTTACGAAGTTGTGGGATTGTTGATTCTCAAATCTCTATGTTACTGAAGAGGCAACCTGGACTTTTTGGTAAGCGTGAATCT
CAACTTAGAGATATTGTTTCCATGGTTGTAGACACTGGTTTTTCTACGAATACTAAAATGTTTGCTCATGGACTTCATGCTATCGGTTCTGTAAGTAATGCGACCTTTAA
CAAGAAAGTGGAGTTGATTTGTAGCTTTGGAATAAGTGAGAAAGAATGTATGAGAATGATTACTTCTGCTCCTGTTTTGATAAGGACTTCCATTGGTAAACTTAAGTCTG
GTCTAGAATTCTTCATGAATGAGGCAAAAGTTAGCAAATCAGATATTGTTCGTAAACCTTCTTGTTTGATGCACGCCATGCAGGGGAGGGTGCTCCCTCGTTATAGAGTT
CTAGAGGTTGTGAAGTCGAAGAGGCTATTGAAGAAGCAACCGAAATTTATCCATATATTGGGGATGTCTGATCAGGATTTCTTGGATAAATTCGTGTGTAGGTTTCCTGA
TAATATGAAAGATTTGTTGGTGGCCTTTAGAGGTAATTCTGTAGGTGAATTGCAGCCTAAGGAGTTAGCAACATGA
Protein sequenceShow/hide protein sequence
MYALRPSFSVSSRLNRPSLLRIHPFLYYFFSSSSHTQASASNGIVVQYLIDNFQLSPARAVSIMSSRRGIESTEKPQSVYKYLSELGFSDAHIQSAIRIAPQIAFSSIEK
TLKPKIDFFQNLGLVGSDLGKFISRHSSLLTVSLEKNLMPSVEILKNFFPEDKCNSDLLFAIRRCSYTLMIAPYKRLSVNINYLRSCGIVDSQISMLLKRQPGLFGKRES
QLRDIVSMVVDTGFSTNTKMFAHGLHAIGSVSNATFNKKVELICSFGISEKECMRMITSAPVLIRTSIGKLKSGLEFFMNEAKVSKSDIVRKPSCLMHAMQGRVLPRYRV
LEVVKSKRLLKKQPKFIHILGMSDQDFLDKFVCRFPDNMKDLLVAFRGNSVGELQPKELAT