; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019107 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019107
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationchr5:38671573..38672790
RNA-Seq ExpressionLag0019107
SyntenyLag0019107
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015351.1 Transcription termination factor MTERF8, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-17179.35Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR ++STEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKSVFPK+ECN  LLQVM+RC D L+ SPYTRLLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VRDIVSMVVE GF+TNTKMFVHGLHAISSV+N +F+KKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++KSK LLKK+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPKEL T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

XP_022929189.1 uncharacterized protein LOC111435860 [Cucurbita moschata]3.2e-17278.91Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR +ESTEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEK+L+PSVEILKSVFPK+ECN  LLQVM+RC D+L+ SPYTRLLVNI Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GF+TNTKMFVHGLH+ISSVSN +FKKKVELICSFGITEKECMRMFT AP LIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++KSKRLL+K+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPKEL TRR 
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR

Query:  SQN
         QN
Subjt:  SQN

XP_022929526.1 uncharacterized protein LOC111436067 [Cucurbita moschata]6.0e-17179.35Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR +ESTEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQN GLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKSVFPK+ECN  LLQ M+RCCD L+ SPYTRLLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VR+IVSMVVE GF+TNTKMFVHGLHAISSV+N +FKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++KSK LLKK+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPK+L T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

XP_022984673.1 uncharacterized protein LOC111482884 isoform X1 [Cucurbita maxima]4.6e-17178.59Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN PS LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+MS RR + STEK QSVYKYLSELG S+AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLT+SLEK+L+PSVEILKSVFPK+ECN  LLQV+RRC D+L+ SPYTRL VNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRD+VSMVVE GF+TNT+MFVHGLHAISSVSN +FKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++ SKRL KK+++L+D LGMSDE+FLDKFV+RFPD+V  LL   RGQ +D LQPKEL T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

XP_023552362.1 uncharacterized protein LOC111810052 isoform X3 [Cucurbita pepo subsp. pepo]1.1e-16979.09Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSR+N  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR +ESTEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEKKL+PSVEIL+SVFPK+ECN  LLQ M+RCCD L+  PYTRLLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VRDIVSMVVE GF+TNTKMFVHGLHAISSV+N +FKKKVELICSFGITEKECMRMFT APVLIRTS GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH M+GRVLPRYRVLQ++KSKRLL K+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPKEL T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

TrEMBL top hitse value%identityAlignment
A0A6J1C8W1 uncharacterized protein LOC111008506 isoform X11.0e-16075.12Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRS FSLS+RH  P    IDP L YFFSSSS  +      ASA+N +V QYLVD F LS ARA+++MSCR+ VESTEK +SV KYLSELG SDAHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        SAIR++PQIAFSS+EKTLKPKIEFFQNLGLVGSDLGKFIS HSSLLTVSL+ KL PSVEILK+VFPK+E N  LLQVMRRC D+L+  P +RLLVNI+YF
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        RSCGI DSQLSMLLKRQP LFG+RESRVRD+VSM VE GF+TNTKMFVHGLHAISSVSN +FKKKVELICSFG TEKECM+MFT APVLIRTSI KLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR
        +EFF+NEAK SRS IV RPT LMH MQGRVLPRYRVLQ++KSKRL +KN +LVDTLG+S+EDF DKFVYRFPDNV+ LL+   GQ VD+L+ KELA +RR
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR

Query:  SQ
        ++
Subjt:  SQ

A0A6J1EN21 uncharacterized protein LOC1114358601.5e-17278.91Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR +ESTEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IE+TLKPKIEFFQNLGLVGSDLG+FISKHS+LLTVSLEK+L+PSVEILKSVFPK+ECN  LLQVM+RC D+L+ SPYTRLLVNI Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GF+TNTKMFVHGLH+ISSVSN +FKKKVELICSFGITEKECMRMFT AP LIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++KSKRLL+K+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPKEL TRR 
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRR

Query:  SQN
         QN
Subjt:  SQN

A0A6J1END2 uncharacterized protein LOC1114360672.9e-17179.35Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN  S LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+M  RR +ESTEK QSVYKYLSELG S AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQN GLVGSDLG+FISKHS+LLTVSLEKKL+PSVEILKSVFPK+ECN  LLQ M+RCCD L+ SPYTRLLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFGKRES+VR+IVSMVVE GF+TNTKMFVHGLHAISSV+N +FKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++KSK LLKK+ +L+D LGMSDEDFLDKFV+RFPD+V  LL   RGQ +D LQPK+L T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

A0A6J1J5Y4 uncharacterized protein LOC111482884 isoform X47.2e-17079.39Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN PS LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+MS RR + STEK QSVYKYLSELG S+AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG FISKHS+LLT+SLEK+L+PSVEILKSVFPK+ECN   LQVMRRC D+L  SPYTRLLVNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRDIVSMVVE GF+TNTKMFVHGLHAISSVSN +FKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPK
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++ SKRL KK+++L+D LGMSDE+FLDKFV+RFPD V  LL   RGQ +D LQPK
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPK

A0A6J1JB80 uncharacterized protein LOC111482884 isoform X12.2e-17178.59Show/hide
Query:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH
        MYALRSSFS+SSRHN PS LRIDPLLY FFSSSS +Q      ASAS G+V QYL+DTFELSPARAVS+MS RR + STEK QSVYKYLSELG S+AHI 
Subjt:  MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIH

Query:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF
        S IRL PQIAFS+IEKTLKPKIEFFQNLGLVGSDLG+FISKHS+LLT+SLEK+L+PSVEILKSVFPK+ECN  LLQV+RRC D+L+ SPYTRL VNI+Y 
Subjt:  SAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYF

Query:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG
        +SCGI  SQLSMLLKRQPALFG RES+VRD+VSMVVE GF+TNT+MFVHGLHAISSVSN +FKKKVELICSFGITEKECMRMFT APVLIRTS+GKLK G
Subjt:  RSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLG

Query:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT
        LEFF+NEA+ SRSDIV +PTCLMH MQGRVLPRYRVLQ++ SKRL KK+++L+D LGMSDE+FLDKFV+RFPD+V  LL   RGQ +D LQPKEL T
Subjt:  LEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELAT

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic1.2e-0727.31Show/hide
Query:  LMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKN
        LM C  S+E  E+ + + KY   LGI    +   + + P +    +EKT+ PK+ F Q +G+    +G  + K  SLLT SL KK+ P V  L +     
Subjt:  LMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKN

Query:  ECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFG------------KRESRVRDIVSMV-VEVGFATNTKMFVHGLHAIS
        + +IG +  M     +L  S  T+L  N+ Y+ S GI   QL  ++   P L               R + +R +  ++     F+ + +  +   H I 
Subjt:  ECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFG------------KRESRVRDIVSMV-VEVGFATNTKMFVHGLHAIS

Query:  SVSNVSFKKKVELICS
          + V+FK +  L C+
Subjt:  SVSNVSFKKKVELICS

F4JVI3 Transcription termination factor MTERF5, chloroplastic6.6e-1131.68Show/hide
Query:  KIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRR
        KI+ V ++L +LGI  + I + +   PQI   S+   LKP + F + LG+  +   K IS+  ++LT S  +KL  +VE L       E  IG  +++ R
Subjt:  KIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRR

Query:  CCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFG-KRESRVRDIVSMVVEVGF
        C +++  S   +L   ++YFRS  +    +++LL R P  FG   ES ++ +    +E GF
Subjt:  CCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFG-KRESRVRDIVSMVVEVGF

Q9FM80 Transcription termination factor MTERF9, chloroplastic1.4e-0520.28Show/hide
Query:  YLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLL-----QVMRRCC
        YL  +G+    I   +   PQI   ++E  LK  I F   LG+  S +G+ ++   SL + S+E  L P++  L       E ++G +     Q++ +  
Subjt:  YLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLL-----QVMRRCC

Query:  DVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRM
        D+   + Y  L       +  G     +  ++K+ P L                              LH   S+ +  F  ++  + S G+   + +++
Subjt:  DVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRM

Query:  FTRAPVLIRTSI-GKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKF
         T    ++  S+   LK    + +NE       +   P  L   +  R+ PR+R L  +K    ++K    + +L  +DE F  ++
Subjt:  FTRAPVLIRTSI-GKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKF

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial3.0e-0827.04Show/hide
Query:  EKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGL-VGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVM
        EK+  +  +   LG+ +  +   I   P++   SI+  L   + F  +LGL     +GK + K+  L+  S++K+L P+ E LKS    +E   G+  V+
Subjt:  EKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGL-VGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVM

Query:  RRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKR-----ESRVRDIVSMVVE--VGFATNTKMFVHGLHAISSVSNVSFKKKVE
             +L       L  N DY + CG  DSQ++ ++   P +  K      + R+R +V ++       A+  + F HGL           KKKVE
Subjt:  RRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKR-----ESRVRDIVSMVVE--VGFATNTKMFVHGLHAISSVSNVSFKKKVE

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein7.9e-3629.66Show/hide
Query:  YLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKK
        YLVD+  LS   A S     + V S++K  SV     + G ++  I S I+  P++   S E  + PK+ FF ++G   SD  K IS    +L+ SL K+
Subjt:  YLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKK

Query:  LIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHA
        LIP  + LKS+  + E  +  L+   RC  + +T   +   + +   R  G+ D  +  L++  P  F  RE R  ++++ V   GF      FVH + A
Subjt:  LIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHA

Query:  ISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKK-NMKL
            S  + ++K +L   FG ++++ +    R P  +  S  K+   LE+ +N       DIV+RP  L   M+ R+ PR +V+ ++ SK L+KK ++  
Subjt:  ISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKK-NMKL

Query:  VDTLGMSDEDFLDKFVYRFPDNVKHLL
           L +   +F+DKFV ++ D +  L+
Subjt:  VDTLGMSDEDFLDKFVYRFPDNVKHLL

AT1G61990.1 Mitochondrial transcription termination factor family protein5.9e-1524.12Show/hide
Query:  FSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLG
        FSS+     S        N  V+ YLVD+  LS   A S+ S + S E      SV       G +D+ I + I   P +  +  +K L  K++  Q+ G
Subjt:  FSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLG

Query:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESR
           S++ + +S              +P +   KS+    +    ++         L        + N+   R  G+    L  LL  K QP + GK    
Subjt:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESR

Query:  VRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGK--------LKLGL------------------
            +  VVE+GF   T  FV  L  +  +S  + ++KV +  S G T  +   +F + P +++ S  K        L LG                   
Subjt:  VRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGK--------LKLGL------------------

Query:  ---------EFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKL---VDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSV
                 EF + + K  R+ +V  P    + M+ R++PR  +L+ + SK LL+K  +L      L  +DE FLD++V +  + V  L+ +    SV
Subjt:  ---------EFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKL---VDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSV

AT1G62010.1 Mitochondrial transcription termination factor family protein5.9e-1523.98Show/hide
Query:  LMSCRRSVESTEK--IQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFP
        L S R + + T+K    SV   L   G +D+ I S IR   ++   +   +L  K++F Q+ G   S+L + +S    +L     K L    + +K +  
Subjt:  LMSCRRSVESTEK--IQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFP

Query:  KNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKK
         ++ +        +   +  +      + NI   R  G+   +L +LL  K QP + GK +      +  VVE+GF   T  FVH LH +  +S+ + ++
Subjt:  KNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLL--KRQPALFGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKK

Query:  KVELICSFGI------------------TEKEC-----------------MRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGR
        K+ +  S G                   +EK+                  M MF R P  I  S   +K   EF + E       + S P  L + ++ R
Subjt:  KVELICSFGI------------------TEKEC-----------------MRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGR

Query:  VLPRYRVLQIIKSKRLLKKNM-KLVDTLGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSLQPK
         +PR  V++++ SK LL+  +  +   L  + E FL+ +V +  D   V  L+ +  G  V     K
Subjt:  VLPRYRVLQIIKSKRLLKKNM-KLVDTLGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSLQPK

AT1G62120.1 Mitochondrial transcription termination factor family protein4.4e-1823.57Show/hide
Query:  FSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLG
        FSSS+      S      +     YLVD+  L+   A S+ S + S ++     SV   L   G +D+ I + IR  P++     EK+L PK++F Q++G
Subjt:  FSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLG

Query:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRV
           S+L + +S    +L     K L    + +K +   ++ +    ++ + C  +   S     + N+   R  G+    L S+L+     + GK   + 
Subjt:  LVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQL-SMLLKRQPALFGKRESRV

Query:  RDIVSMVVEVGFATNTKMFVHGLHAISSVSN---------------------VSFKK--------------KVELICSFGITEKECMRMFTRAPVLIRTS
        ++ +   VE+GF   T  FV  L+ +  +S+                       FKK               VE     G +  E + M  R P  I  S
Subjt:  RDIVSMVVEVGFATNTKMFVHGLHAISSVSN---------------------VSFKK--------------KVELICSFGITEKECMRMFTRAPVLIRTS

Query:  IGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDT-LGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSL
           +K   EF + E       + S P  L + ++ R +PR  V++++ SK LL+  +  + + L  + E FL  +V +  D   V  L+ +  G  V   
Subjt:  IGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYRVLQIIKSKRLLKKNMKLVDT-LGMSDEDFLDKFVYRFPDN--VKHLLLVLRGQSVDSL

Query:  QPK
          K
Subjt:  QPK

AT5G64950.1 Mitochondrial transcription termination factor family protein2.0e-6340.86Show/hide
Query:  ASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGK
        +S++T AS SN    ++L D     P  A+++     +++S E+ +SV + L     SD  I  +IR+ P++ F ++EK L+PK+ FF+++G  GS LGK
Subjt:  ASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIAFSSIEKTLKPKIEFFQNLGLVGSDLGK

Query:  FISKHSSLLTVSLEKKLIPSVEILKS-VFPKNECNIGLLQVMRRCCDVLLT-SPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMV
        F+S++SS++ VSL KKLIP+VEILKS V PK+E    L  ++ RC  +LL+  P   LL NI Y  +CGI  SQL+ LL+RQP +F   E ++R  VS  
Subjt:  FISKHSSLLTVSLEKKLIPSVEILKS-VFPKNECNIGLLQVMRRCCDVLLT-SPYTRLLVNIDYFRSCGIADSQLSMLLKRQPALFGKRESRVRDIVSMV

Query:  VEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYR
        +++GF  N++M VH + ++SS+S  +F +KV+L  + G +E E   +  R+P LIR S  KL LG EF+L      R  +  RP  L + ++ RV+PR +
Subjt:  VEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRVLPRYR

Query:  VLQIIKSKRLL----KKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLV
        VLQI++ K LL    KK   +V  + M++E FL+K+V RF D +   LLV
Subjt:  VLQIIKSKRLL----KKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGCTCTCAGATCATCATTCTCTCTTTCCTCTCGCCATAACCGTCCTTCCTTCCTCCGAATTGATCCACTCCTGTATTATTTCTTCTCTTCTTCTTCCCGCACCCA
AGCGTCTGATTCTACTTCTGCTTCTGCTTCAAATGGAGTCGTCGCCCAATACCTCGTCGACACCTTCGAACTGTCCCCCGCCAGAGCGGTGTCGTTAATGAGCTGCCGCA
GAAGCGTTGAATCAACGGAAAAGATTCAATCCGTCTACAAATATCTTTCAGAGCTCGGAATCTCCGACGCCCACATTCACTCCGCGATTCGCCTCACGCCGCAGATCGCC
TTTTCCAGCATCGAAAAGACTCTGAAACCGAAGATCGAGTTCTTCCAGAATCTGGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAAGCATTCTTCGCTTTTGAC
TGTTAGTTTGGAGAAGAAATTGATCCCCAGTGTCGAGATTCTTAAGAGTGTTTTCCCCAAGAATGAGTGTAATATCGGTCTCCTGCAAGTTATGCGTCGATGTTGTGATG
TGCTTTTGACATCCCCGTATACAAGGTTGCTGGTAAATATTGATTACTTTCGAAGCTGTGGGATTGCTGATTCTCAACTCTCTATGTTGCTGAAGAGGCAACCTGCACTT
TTTGGTAAGCGTGAATCTCGAGTTAGAGATATTGTTTCCATGGTTGTAGAGGTTGGTTTTGCTACTAATACTAAAATGTTTGTTCATGGACTTCATGCTATCAGTAGTGT
AAGTAATGTGTCCTTTAAGAAAAAAGTGGAGCTGATTTGCAGCTTTGGAATAACTGAGAAAGAATGTATGAGAATGTTTACTCGTGCTCCTGTTTTGATAAGGACCTCCA
TTGGTAAGCTTAAGCTTGGTCTAGAATTCTTCTTGAATGAGGCAAAAGCCAGCAGATCAGATATTGTCAGCAGACCTACTTGTTTGATGCACGGCATGCAGGGGAGGGTG
CTCCCTCGGTATAGAGTACTACAGATCATTAAATCGAAGAGGCTATTGAAGAAGAACATGAAATTGGTCGACACATTGGGGATGTCTGATGAGGATTTCTTGGATAAATT
CGTGTATAGGTTTCCTGATAATGTGAAACATTTGTTGCTGGTCTTAAGAGGTCAATCTGTAGATTCATTGCAGCCTAAGGAGTTAGCAACGCGAAGACGCAGCCAAAACC
CGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGCTCTCAGATCATCATTCTCTCTTTCCTCTCGCCATAACCGTCCTTCCTTCCTCCGAATTGATCCACTCCTGTATTATTTCTTCTCTTCTTCTTCCCGCACCCA
AGCGTCTGATTCTACTTCTGCTTCTGCTTCAAATGGAGTCGTCGCCCAATACCTCGTCGACACCTTCGAACTGTCCCCCGCCAGAGCGGTGTCGTTAATGAGCTGCCGCA
GAAGCGTTGAATCAACGGAAAAGATTCAATCCGTCTACAAATATCTTTCAGAGCTCGGAATCTCCGACGCCCACATTCACTCCGCGATTCGCCTCACGCCGCAGATCGCC
TTTTCCAGCATCGAAAAGACTCTGAAACCGAAGATCGAGTTCTTCCAGAATCTGGGCTTGGTCGGCTCCGATTTGGGTAAGTTCATTTCCAAGCATTCTTCGCTTTTGAC
TGTTAGTTTGGAGAAGAAATTGATCCCCAGTGTCGAGATTCTTAAGAGTGTTTTCCCCAAGAATGAGTGTAATATCGGTCTCCTGCAAGTTATGCGTCGATGTTGTGATG
TGCTTTTGACATCCCCGTATACAAGGTTGCTGGTAAATATTGATTACTTTCGAAGCTGTGGGATTGCTGATTCTCAACTCTCTATGTTGCTGAAGAGGCAACCTGCACTT
TTTGGTAAGCGTGAATCTCGAGTTAGAGATATTGTTTCCATGGTTGTAGAGGTTGGTTTTGCTACTAATACTAAAATGTTTGTTCATGGACTTCATGCTATCAGTAGTGT
AAGTAATGTGTCCTTTAAGAAAAAAGTGGAGCTGATTTGCAGCTTTGGAATAACTGAGAAAGAATGTATGAGAATGTTTACTCGTGCTCCTGTTTTGATAAGGACCTCCA
TTGGTAAGCTTAAGCTTGGTCTAGAATTCTTCTTGAATGAGGCAAAAGCCAGCAGATCAGATATTGTCAGCAGACCTACTTGTTTGATGCACGGCATGCAGGGGAGGGTG
CTCCCTCGGTATAGAGTACTACAGATCATTAAATCGAAGAGGCTATTGAAGAAGAACATGAAATTGGTCGACACATTGGGGATGTCTGATGAGGATTTCTTGGATAAATT
CGTGTATAGGTTTCCTGATAATGTGAAACATTTGTTGCTGGTCTTAAGAGGTCAATCTGTAGATTCATTGCAGCCTAAGGAGTTAGCAACGCGAAGACGCAGCCAAAACC
CGAGCTAA
Protein sequenceShow/hide protein sequence
MYALRSSFSLSSRHNRPSFLRIDPLLYYFFSSSSRTQASDSTSASASNGVVAQYLVDTFELSPARAVSLMSCRRSVESTEKIQSVYKYLSELGISDAHIHSAIRLTPQIA
FSSIEKTLKPKIEFFQNLGLVGSDLGKFISKHSSLLTVSLEKKLIPSVEILKSVFPKNECNIGLLQVMRRCCDVLLTSPYTRLLVNIDYFRSCGIADSQLSMLLKRQPAL
FGKRESRVRDIVSMVVEVGFATNTKMFVHGLHAISSVSNVSFKKKVELICSFGITEKECMRMFTRAPVLIRTSIGKLKLGLEFFLNEAKASRSDIVSRPTCLMHGMQGRV
LPRYRVLQIIKSKRLLKKNMKLVDTLGMSDEDFLDKFVYRFPDNVKHLLLVLRGQSVDSLQPKELATRRRSQNPS