; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029766 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029766
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationscaffold6:10241981..10243198
RNA-Seq ExpressionSpg029766
SyntenySpg029766
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015351.1 Transcription termination factor MTERF8, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-17481.41Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG++STEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKS+FPK+ECN  LLQVM+RC D L+RSPYT LLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VRDIVSMVVE GFSTNTKMFVHGLHAISSV+N TF+KKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ +KSK LLKK+P+L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPKEL T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

XP_022929189.1 uncharacterized protein LOC111435860 [Cucurbita moschata]5.3e-17580.94Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG+ESTEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEK+L+PSVEILKS+FPK+ECN  LLQVM+RC D L+RSPYT LLVNI Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GFSTNTKMFVHGLH+ISSVSN TFKKKVELICSFGITEKECMRMFT AP LIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ +KSKRLL+K+P+L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPKEL TRR
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR

Query:  RGQN
         GQN
Subjt:  RGQN

XP_022929526.1 uncharacterized protein LOC111436067 [Cucurbita moschata]2.9e-17381.16Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG+ESTEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQN GLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKS+FPK+ECN  LLQ M+RC D L+RSPYT LLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VR+IVSMVVE GFSTNTKMFVHGLHAISSV+N TFKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ +KSK LLKK+P+L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPK+L T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

XP_022984673.1 uncharacterized protein LOC111482884 isoform X1 [Cucurbita maxima]1.4e-17280.65Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN PS LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS MS RRG+ STEK QSVYKYLSELG S+AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLL-VNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLT+SLEK+L+PSVEILKS+FPK+ECN  LLQV+RRC D L+RSPYT L VNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLL-VNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRD+VSMVVE GFSTNT+MFVHGLHAISSVSN TFKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ + SKRL KK+ +LID LGMSDE+FLDKFV+RFPD+V  LL   RGQ +D  LQPKEL T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

XP_023552362.1 uncharacterized protein LOC111810052 isoform X3 [Cucurbita pepo subsp. pepo]5.5e-17281.16Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSR+N  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG+ESTEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEKKL+PSVEIL+S+FPK+ECN  LLQ M+RC D L+R PYT LLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VRDIVSMVVE GFSTNTKMFVHGLHAISSV+N TFKKKVELICSFGITEKECMRMFT APVLIRTS GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH M+GRVLPRYRVLQ +KSKRLL K+P+LID LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPKEL T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

TrEMBL top hitse value%identityAlignment
A0A6J1C8W1 uncharacterized protein LOC111008506 isoform X12.3e-16076.18Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRS FSLS+RH  P    IDP L  FFSSS     S+   ASA+N IVVQYLVD F LS ARA++ MSCR+GVESTEK +SV KYLSELG SDAHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        SAIR++PQIAFSS+EKTLKPKIEFFQNLGLVGSDLGKFIS HSSLLTVSL+ KL PSVEILK++FPK+E N  LLQVMRRC D L+R P + LLVNI+YF
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        RSCGI DSQLSMLLKRQP LFG+RESRVRD+VSM VE GFSTNTKMFVHGLHAISSVSN TFKKKVELICSFG TEKECM+MFT APVLIRTSI KLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR
        +EFF+NEAK SRS IVRRPT LMH MQGRVLPRYRVLQ +KSKRL +KNP+L+DTLG+S+EDF DKFVYRFPDNV+ LL+   GQ VD +L+ KELA +R
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR

Query:  RGQ
        R +
Subjt:  RGQ

A0A6J1EN21 uncharacterized protein LOC1114358602.6e-17580.94Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG+ESTEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEK+L+PSVEILKS+FPK+ECN  LLQVM+RC D L+RSPYT LLVNI Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GFSTNTKMFVHGLH+ISSVSN TFKKKVELICSFGITEKECMRMFT AP LIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ +KSKRLL+K+P+L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPKEL TRR
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRR

Query:  RGQN
         GQN
Subjt:  RGQN

A0A6J1END2 uncharacterized protein LOC1114360671.4e-17381.16Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN  S LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS M  RRG+ESTEK QSVYKYLSELG S AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQN GLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKS+FPK+ECN  LLQ M+RC D L+RSPYT LLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VR+IVSMVVE GFSTNTKMFVHGLHAISSV+N TFKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ +KSK LLKK+P+L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D  LQPK+L T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

A0A6J1J5Y4 uncharacterized protein LOC111482884 isoform X42.2e-17181.47Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN PS LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS MS RRG+ STEK QSVYKYLSELG S+AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG FISKHS+LLT+SLEK+L+PSVEILKS+FPK+ECN   LQVMRRC D L RSPYT LLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYT-LLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GFSTNTKMFVHGLHAISSVSN TFKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPK
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ + SKRL KK+ +LID LGMSDE+FLDKFV+RFPD V  LL   RGQ +D  LQPK
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPK

A0A6J1JB80 uncharacterized protein LOC111482884 isoform X16.9e-17380.65Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ
        MYALRSSFS+SSRHN PS LRIDPLLYCFFSSSS SQ      ASAS GIVVQYL+DTFELSPARAVS MS RRG+ STEK QSVYKYLSELG S+AHIQ
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQ

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLL-VNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLT+SLEK+L+PSVEILKS+FPK+ECN  LLQV+RRC D L+RSPYT L VNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLL-VNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRD+VSMVVE GFSTNT+MFVHGLHAISSVSN TFKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT
        LEFF+NEA+ SRSDIVR+PTCLMH MQGRVLPRYRVLQ + SKRL KK+ +LID LGMSDE+FLDKFV+RFPD+V  LL   RGQ +D  LQPKEL T
Subjt:  LEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELAT

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic5.7e-0725.79Show/hide
Query:  RAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSI
        R +++     G    E+ + + KY   LGI    ++  + + P +    +EKT+ PK+ F Q +G+    +G  + K  SLLT SL KK+ P V  L + 
Subjt:  RAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSI

Query:  FPKNECNIGLLQVMRRCWDALLRSPY--TLLVNIDYFRSCGIADSQLSMLLKRQPALFG------------KRESRVRDIVSMV-VEVGFSTNTKMFVHG
            + +IG +  M     ALL       L  N+ Y+ S GI   QL  ++   P L               R + +R +  ++     FS + +  +  
Subjt:  FPKNECNIGLLQVMRRCWDALLRSPY--TLLVNIDYFRSCGIADSQLSMLLKRQPALFG------------KRESRVRDIVSMV-VEVGFSTNTKMFVHG

Query:  LHAISSVSNVTFKKKVELICS
         H I   + V FK +  L C+
Subjt:  LHAISSVSNVTFKKKVELICS

F4JVI3 Transcription termination factor MTERF5, chloroplastic6.1e-0931.68Show/hide
Query:  KIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRR
        KI+ V ++L +LGI  + I + +   PQI   S+   LKP + F + LG+  +   K IS+  ++LT S  +KL  +VE L       E  IG  +++ R
Subjt:  KIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRR

Query:  CWDALLRS-PYTLLVNIDYFRSCGIADSQLSMLLKRQPALFG-KRESRVRDIVSMVVEVGF
        C + +  S    L   ++YFRS  +    +++LL R P  FG   ES ++ +    +E GF
Subjt:  CWDALLRS-PYTLLVNIDYFRSCGIADSQLSMLLKRQPALFG-KRESRVRDIVSMVVEVGF

Q9FM80 Transcription termination factor MTERF9, chloroplastic2.0e-0723.21Show/hide
Query:  YLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLR
        YL  +G+    I+  +   PQI   ++E  LK  I F   LG+  S +G+ ++   SL + S+E  L P++  L       E ++G +  +         
Subjt:  YLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLR

Query:  SPYTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPV
        SP  L+  +D      I  +   M L ++  L   R+S    +V MV         K     LH   S+ +  F  ++  + S G+   + +++ T    
Subjt:  SPYTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPV

Query:  LIRTSI-GKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKF
        ++  S+   LK    + +NE       + + P  L   +  R+ PR+R L  +K    ++K P  + +L  +DE F  ++
Subjt:  LIRTSI-GKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKF

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial8.9e-0827.04Show/hide
Query:  EKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGL-VGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVM
        EK+  +  +   LG+ +  +   I   P++   SI+  L   + F  +LGL     +GK + K+  L+  S++K+L P+ E LKS    +E   G+  V+
Subjt:  EKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGL-VGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVM

Query:  RRCWDALLRSPYTLL-VNIDYFRSCGIADSQLSMLLKRQPALFGKR-----ESRVRDIVSMVVE--VGFSTNTKMFVHGLHAISSVSNVTFKKKVE
              L R    +L  N DY + CG  DSQ++ ++   P +  K      + R+R +V ++       ++  + F HGL           KKKVE
Subjt:  RRCWDALLRSPYTLL-VNIDYFRSCGIADSQLSMLLKRQPALFGKR-----ESRVRDIVSMVVE--VGFSTNTKMFVHGLHAISSVSNVTFKKKVE

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein7.9e-3629.88Show/hide
Query:  VQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLE
        V YLVD+  LS   A S     + V S++K  SV     + G ++  I S I+  P++   S E  + PK+ FF ++G   SD  K IS    +L+ SL 
Subjt:  VQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLE

Query:  KKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLH
        K+LIP  + LKSI  + E  +  L+   RC+   L+  + + + +   R  G+ D  +  L++  P  F  RE R  ++++ V   GF      FVH + 
Subjt:  KKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFSTNTKMFVHGLH

Query:  AISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNP-K
        A    S    ++K +L   FG ++++ +      P  +  S  K+   LE+ +N       DIV RP  L   M+ R+ PR +V+  + SK L+KK    
Subjt:  AISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKNP-K

Query:  LIDTLGMSDEDFLDKFVYRFPDNVKHLL
            L +   +F+DKFV ++ D +  L+
Subjt:  LIDTLGMSDEDFLDKFVYRFPDNVKHLL

AT1G61990.1 Mitochondrial transcription termination factor family protein1.1e-1624.43Show/hide
Query:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG
        FSS+  +  S        N   V YLVD+  LS   A S +S +   E      SV       G +D+ I + I   P +  +  +K L  K++  Q+ G
Subjt:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG

Query:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESRV
           S++ + +S    +L    +K +    + +K I   +  +          ++    S    + N+   R  G+    L  LL  K QP + GK     
Subjt:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESRV

Query:  RDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGK--------LKLGL-------------------
           +  VVE+GF   T  FV  L  +  +S  T ++KV +  S G T  +   +F   P +++ S  K        L LG                    
Subjt:  RDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGK--------LKLGL-------------------

Query:  --------EFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKN---PKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSV
                EF + + K  R+ +V  P    + M+ R++PR  +L+ + SK LL+K    P +   L  +DE FLD++V +  + V  L+ +    SV
Subjt:  --------EFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKN---PKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSV

AT1G62120.1 Mitochondrial transcription termination factor family protein1.1e-1623.65Show/hide
Query:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG
        FSSS+ +   +S      +   V YLVD+  L+   A S +S +   ++     SV   L   G +D+ I + IR  P++     EK+L PK++F Q++G
Subjt:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG

Query:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPY-TLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRV
           S+L + +S    +L     K L    + +K I   ++ +    ++ + C      S     + N+   R  G+    L S+L+     + GK   + 
Subjt:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPY-TLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRV

Query:  RDIVSMVVEVGFSTNTKMFVHGLHAISSVSN---------------------VTFKK--------------KVELICSFGITEKECMRMFTCAPVLIRTS
        ++ +   VE+GF   T  FV  L+ +  +S+                       FKK               VE     G +  E + M    P  I  S
Subjt:  RDIVSMVVEVGFSTNTKMFVHGLHAISSVSN---------------------VTFKK--------------KVELICSFGITEKECMRMFTCAPVLIRTS

Query:  IGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKN-PKLIDTLGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSS
           +K   EF + E       +   P  L + ++ R +PR  V++ + SK LL+   P +   L  + E FL  +V +  D   V  L+ +  G  V  +
Subjt:  IGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLLKKN-PKLIDTLGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSS

Query:  LQPKEL
         Q   L
Subjt:  LQPKEL

AT3G46950.1 Mitochondrial transcription termination factor family protein1.4e-1621.93Show/hide
Query:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG
        F++ S S   A  S+   +   V YLV++  L+   A + +S +   E      SV   L   G  D+ I   IR  P++  +  EK+L+PK++F ++ G
Subjt:  FSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLG

Query:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRVR
           S++ + +S   ++L    E+ +    + +K I    +           C           + NI   R  G+    L S+L+ R   + GK   +  
Subjt:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRVR

Query:  DIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKV-----------------------------------ELICSFGITEKECMRMFTCAPVLIRTSI
        + +  VV++GF      FV  LH +  +S  T ++KV                                   E +   G+ E+E + +    P  IR+S 
Subjt:  DIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKV-----------------------------------ELICSFGITEKECMRMFTCAPVLIRTSI

Query:  GKLKLGLEFFLNEAKASRSD------------------------------------IVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLL-KKNPKLIDTLG
         K+   +E FL     SR D                                    +V  PT L + ++ R++PR  V++ + SK L+  +NP +   L 
Subjt:  GKLKLGLEFFLNEAKASRSD------------------------------------IVRRPTCLMHGMQGRVLPRYRVLQFIKSKRLL-KKNPKLIDTLG

Query:  MSDEDFLDKFVYRFPDNVKHLLLV
         +D++FL ++V +    V  L+ +
Subjt:  MSDEDFLDKFVYRFPDNVKHLLLV

AT5G64950.1 Mitochondrial transcription termination factor family protein2.6e-6340.86Show/hide
Query:  ASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGK
        +S +  AS SN   V++L D     P  A++       ++S E+ +SV + L     SD  IQ +IR+ P++ F ++EK L+PK+ FF+++G  GS LGK
Subjt:  ASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGK

Query:  FISKHSSLLTVSLEKKLIPSVEILKSIF-PKNECNIGLLQVMRRC-WDALLRSP-YTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMV
        F+S++SS++ VSL KKLIP+VEILKSI  PK+E    L  ++ RC W  L R P   LL NI Y  +CGI  SQL+ LL+RQP +F   E ++R  VS  
Subjt:  FISKHSSLLTVSLEKKLIPSVEILKSIF-PKNECNIGLLQVMRRC-WDALLRSP-YTLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMV

Query:  VEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYR
        +++GF+ N++M VH + ++SS+S  TF +KV+L  + G +E E   +   +P LIR S  KL LG EF+L      R  + +RP  L + ++ RV+PR +
Subjt:  VEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVLPRYR

Query:  VLQFIKSKRLL----KKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLV
        VLQ ++ K LL    KK   ++  + M++E FL+K+V RF D +   LLV
Subjt:  VLQFIKSKRLL----KKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTCTGAGATCATCATTCTCTCTTTCCTCTCGCCATAATCGTCCCTCCTTCCTCCGAATTGATCCACTCTTGTATTGTTTCTTCTCTTCTTCTTCCCGCAGCCA
AGCCTCTGCTTCTGCTTCTGCTTCTGCTTCAAATGGAATCGTCGTCCAATACCTCGTCGACACCTTCGAACTGTCCCCCGCCAGAGCGGTGTCGTTTATGAGCTGCCGCA
GAGGCGTTGAATCAACGGAAAAGATTCAATCCGTCTACAAATATCTTTCAGAGCTCGGAATCTCCGACGCCCACATTCAGTCGGCGATTCGCCTCACGCCGCAGATCGCC
TTTTCCAGCATCGAAAAGACTCTGAAGCCGAAGATCGAGTTCTTCCAGAATCTTGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAAGCATTCTTCGCTTTTGAC
TGTTAGTTTGGAGAAGAAATTGATCCCCAGTGTCGAGATTCTTAAGAGTATTTTCCCCAAGAATGAGTGTAATATCGGTCTCCTGCAAGTTATGCGTCGATGTTGGGATG
CGCTTTTGAGATCCCCGTATACCTTGCTGGTAAATATTGATTACTTTCGAAGCTGTGGGATTGCTGATTCTCAACTCTCTATGTTGCTGAAGAGGCAACCTGCACTTTTT
GGTAAGCGTGAATCTCGAGTTAGAGATATTGTTTCCATGGTTGTAGAGGTTGGTTTTTCTACTAATACTAAAATGTTTGTTCATGGACTTCATGCTATCAGTAGTGTAAG
TAATGTGACCTTTAAGAAAAAAGTGGAGCTGATTTGCAGCTTTGGAATAACTGAGAAAGAATGTATGAGAATGTTTACTTGTGCTCCTGTTTTGATAAGGACCTCCATTG
GTAAGCTTAAGCTTGGTCTAGAATTCTTCTTGAATGAGGCAAAAGCTAGCAGATCAGATATTGTCCGCAGACCTACTTGTTTGATGCACGGCATGCAGGGGAGGGTGCTC
CCTCGGTATAGAGTACTACAGTTCATTAAATCGAAGAGGCTATTGAAGAAGAACCCAAAATTGATCGACACATTGGGGATGTCTGATGAGGATTTCTTGGATAAATTCGT
GTATAGGTTTCCTGATAATGTGAAACATTTGTTGCTGGTCTTAAGAGGTCAATCTGTAGATTCTTCATTGCAGCCTAAGGAGTTAGCAACGCGAAGACGCGGCCAGAATC
CGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGCTCTGAGATCATCATTCTCTCTTTCCTCTCGCCATAATCGTCCCTCCTTCCTCCGAATTGATCCACTCTTGTATTGTTTCTTCTCTTCTTCTTCCCGCAGCCA
AGCCTCTGCTTCTGCTTCTGCTTCTGCTTCAAATGGAATCGTCGTCCAATACCTCGTCGACACCTTCGAACTGTCCCCCGCCAGAGCGGTGTCGTTTATGAGCTGCCGCA
GAGGCGTTGAATCAACGGAAAAGATTCAATCCGTCTACAAATATCTTTCAGAGCTCGGAATCTCCGACGCCCACATTCAGTCGGCGATTCGCCTCACGCCGCAGATCGCC
TTTTCCAGCATCGAAAAGACTCTGAAGCCGAAGATCGAGTTCTTCCAGAATCTTGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAAGCATTCTTCGCTTTTGAC
TGTTAGTTTGGAGAAGAAATTGATCCCCAGTGTCGAGATTCTTAAGAGTATTTTCCCCAAGAATGAGTGTAATATCGGTCTCCTGCAAGTTATGCGTCGATGTTGGGATG
CGCTTTTGAGATCCCCGTATACCTTGCTGGTAAATATTGATTACTTTCGAAGCTGTGGGATTGCTGATTCTCAACTCTCTATGTTGCTGAAGAGGCAACCTGCACTTTTT
GGTAAGCGTGAATCTCGAGTTAGAGATATTGTTTCCATGGTTGTAGAGGTTGGTTTTTCTACTAATACTAAAATGTTTGTTCATGGACTTCATGCTATCAGTAGTGTAAG
TAATGTGACCTTTAAGAAAAAAGTGGAGCTGATTTGCAGCTTTGGAATAACTGAGAAAGAATGTATGAGAATGTTTACTTGTGCTCCTGTTTTGATAAGGACCTCCATTG
GTAAGCTTAAGCTTGGTCTAGAATTCTTCTTGAATGAGGCAAAAGCTAGCAGATCAGATATTGTCCGCAGACCTACTTGTTTGATGCACGGCATGCAGGGGAGGGTGCTC
CCTCGGTATAGAGTACTACAGTTCATTAAATCGAAGAGGCTATTGAAGAAGAACCCAAAATTGATCGACACATTGGGGATGTCTGATGAGGATTTCTTGGATAAATTCGT
GTATAGGTTTCCTGATAATGTGAAACATTTGTTGCTGGTCTTAAGAGGTCAATCTGTAGATTCTTCATTGCAGCCTAAGGAGTTAGCAACGCGAAGACGCGGCCAGAATC
CGAGCTAA
Protein sequenceShow/hide protein sequence
MYALRSSFSLSSRHNRPSFLRIDPLLYCFFSSSSRSQASASASASASNGIVVQYLVDTFELSPARAVSFMSCRRGVESTEKIQSVYKYLSELGISDAHIQSAIRLTPQIA
FSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSIFPKNECNIGLLQVMRRCWDALLRSPYTLLVNIDYFRSCGIADSQLSMLLKRQPALF
GKRESRVRDIVSMVVEVGFSTNTKMFVHGLHAISSVSNVTFKKKVELICSFGITEKECMRMFTCAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVRRPTCLMHGMQGRVL
PRYRVLQFIKSKRLLKKNPKLIDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSSLQPKELATRRRGQNPS