; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020597 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020597
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTranscription termination factor like
Genome locationtig00153552:237156..238361
RNA-Seq ExpressionSgr020597
SyntenySgr020597
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015351.1 Transcription termination factor MTERF8, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]6.0e-16376.73Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH   S LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++M  R+G++STEKPQSVYKYLSELGFS AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IE+TLKPKIEFFQNLG VGSDLG+F+SKHS+LLTVSL+ KLMPSVEILK+VFPKDE ++DLLQVM+RCSD LMR P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG+RES VRD+VSM VETGFSTNT+MFVHGLHA+SSV+N TF+KKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVVK K L +K P+L+D LG+S+EDFLDKFV+RFPD+V +LL A+RGQ +D LQ KELET
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET

XP_022136933.1 uncharacterized protein LOC111008506 isoform X1 [Momordica charantia]8.9e-18385.32Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSS-TEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRL
        MYA  S+FSLSARH  P    IDP L YFFSSSSS  EASA+NAIVVQYLVD F LSTARALA+MSCRKGVESTEKP+SV KYLSELGFSDAHIQSAIR+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSS-TEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRL

Query:  SPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGI
        SPQIAFSS+EKTLKPKIEFFQNLG VGSDLGKF+S HSSLLTVSLKNKL PSVEILKNVFPKDE +++LLQVMRRCSD+LMRCP+SRL +NINYFRSCGI
Subjt:  SPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGI

Query:  VDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFM
        VDSQLSMLLKRQPVLFGRRES VRDLVSMAVETGFSTNT+MFVHGLHA+SSVSN TFKKKVELICSFGFTEKECMKMFTSAPVLIRTS+ KLKTG+EFFM
Subjt:  VDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFM

Query:  NEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGRIRA
        NEAKVSRS+IV RPT LMHSMQGRVLPR RVLQVVK KRL+RK+P+LVDTLGISEEDF DKFVYRFPDNV+DLLVAY GQ VDAL+ KEL  +RR RIR 
Subjt:  NEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGRIRA

Query:  LI
        LI
Subjt:  LI

XP_022929189.1 uncharacterized protein LOC111435860 [Cucurbita moschata]9.9e-16677.02Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH   S LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++M  R+G+ESTEKPQSVYKYLSELGFS AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IE+TLKPKIEFFQNLG VGSDLG+F+SKHS+LLTVSL+ +LMPSVEILK+VFPKDE ++DLLQVM+RCSDMLMR P +RL +NI+Y +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG RES VRD+VSM VETGFSTNT+MFVHGLH++SSVSN TFKKKVELICSFG TEKECM+MFTSAP LIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGR
        EA+VSRS IVC+PTCLMH+MQGRVLPR RVLQVVK KRL  K P+L+D LG+S+EDFLDKFV+RFPD+V +LL A+RGQ +D LQ KELETRR G+
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGR

XP_022929526.1 uncharacterized protein LOC111436067 [Cucurbita moschata]2.5e-16176.21Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH   S LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++M  R+G+ESTEKPQSVYKYLSELGFS AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IEKTLKPKIEFFQN G VGSDLG+F+SKHS+LLTVSL+ KLMPSVEILK+VFPKDE ++DLLQ M+RC D LMR P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG+RES VR++VSM VETGFSTNT+MFVHGLHA+SSV+N TFKKKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVVK K L +K P+L+D LG+S+EDFLDKFV+RFPD+V +LL A+RGQ +D LQ K+LET
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET

XP_022984673.1 uncharacterized protein LOC111482884 isoform X1 [Cucurbita maxima]9.9e-16678.01Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH  PS LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++MS R+G+ STEKPQSVYKYLSELGFS+AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IEKTLKPKIEFFQNLG VGSDLG+F+SKHS+LLT+SL+ +LMPSVEILK+VFPKDE ++DLLQV+RRCSDMLMR P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG RES VRDLVSM VETGFSTNTRMFVHGLHA+SSVSN TFKKKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVV  KRLS+K  +L+D LG+S+E+FLDKFV+RFPD+V +LL A+RGQ +D LQ KELET
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET

TrEMBL top hitse value%identityAlignment
A0A6J1C8W1 uncharacterized protein LOC111008506 isoform X14.3e-18385.32Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSS-TEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRL
        MYA  S+FSLSARH  P    IDP L YFFSSSSS  EASA+NAIVVQYLVD F LSTARALA+MSCRKGVESTEKP+SV KYLSELGFSDAHIQSAIR+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSS-TEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRL

Query:  SPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGI
        SPQIAFSS+EKTLKPKIEFFQNLG VGSDLGKF+S HSSLLTVSLKNKL PSVEILKNVFPKDE +++LLQVMRRCSD+LMRCP+SRL +NINYFRSCGI
Subjt:  SPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGI

Query:  VDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFM
        VDSQLSMLLKRQPVLFGRRES VRDLVSMAVETGFSTNT+MFVHGLHA+SSVSN TFKKKVELICSFGFTEKECMKMFTSAPVLIRTS+ KLKTG+EFFM
Subjt:  VDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFM

Query:  NEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGRIRA
        NEAKVSRS+IV RPT LMHSMQGRVLPR RVLQVVK KRL+RK+P+LVDTLGISEEDF DKFVYRFPDNV+DLLVAY GQ VDAL+ KEL  +RR RIR 
Subjt:  NEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGRIRA

Query:  LI
        LI
Subjt:  LI

A0A6J1EN21 uncharacterized protein LOC1114358604.8e-16677.02Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH   S LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++M  R+G+ESTEKPQSVYKYLSELGFS AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IE+TLKPKIEFFQNLG VGSDLG+F+SKHS+LLTVSL+ +LMPSVEILK+VFPKDE ++DLLQVM+RCSDMLMR P +RL +NI+Y +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG RES VRD+VSM VETGFSTNT+MFVHGLH++SSVSN TFKKKVELICSFG TEKECM+MFTSAP LIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGR
        EA+VSRS IVC+PTCLMH+MQGRVLPR RVLQVVK KRL  K P+L+D LG+S+EDFLDKFV+RFPD+V +LL A+RGQ +D LQ KELETRR G+
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGR

A0A6J1END2 uncharacterized protein LOC1114360671.2e-16176.21Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH   S LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++M  R+G+ESTEKPQSVYKYLSELGFS AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IEKTLKPKIEFFQN G VGSDLG+F+SKHS+LLTVSL+ KLMPSVEILK+VFPKDE ++DLLQ M+RC D LMR P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG+RES VR++VSM VETGFSTNT+MFVHGLHA+SSV+N TFKKKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVVK K L +K P+L+D LG+S+EDFLDKFV+RFPD+V +LL A+RGQ +D LQ K+LET
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET

A0A6J1J5Y4 uncharacterized protein LOC111482884 isoform X41.2e-16177Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH  PS LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++MS R+G+ STEKPQSVYKYLSELGFS+AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IEKTLKPKIEFFQNLG VGSDLG F+SKHS+LLT+SL+ +LMPSVEILK+VFPKDE ++D LQVMRRCSDML R P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG RES VRD+VSM VETGFSTNT+MFVHGLHA+SSVSN TFKKKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSK
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVV  KRLS+K  +L+D LG+S+E+FLDKFV+RFPD V +LL A+RGQ +D LQ K
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSK

A0A6J1JB80 uncharacterized protein LOC111482884 isoform X14.8e-16678.01Show/hide
Query:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS
        MYA  S FS+S+RH  PS LRIDP LY FFSSSS ++ASAS  IVVQYL+DTF+LS ARA+++MS R+G+ STEKPQSVYKYLSELGFS+AHIQS IRL+
Subjt:  MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLS

Query:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV
        PQIAFS+IEKTLKPKIEFFQNLG VGSDLG+F+SKHS+LLT+SL+ +LMPSVEILK+VFPKDE ++DLLQV+RRCSDMLMR P +RL +NINY +SCGIV
Subjt:  PQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIV

Query:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN
         SQLSMLLKRQP LFG RES VRDLVSM VETGFSTNTRMFVHGLHA+SSVSN TFKKKVELICSFG TEKECM+MFTSAPVLIRTSVGKLK GLEFFMN
Subjt:  DSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMN

Query:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET
        EA+VSRS IV +PTCLMH+MQGRVLPR RVLQVV  KRLS+K  +L+D LG+S+E+FLDKFV+RFPD+V +LL A+RGQ +D LQ KELET
Subjt:  EAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELET

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic2.0e-0725.63Show/hide
Query:  VQYLVDTFKLST---ARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTV
        + YL + F LST    R LA      G    E+ + + KY   LG     ++  + + P +    +EKT+ PK+ F Q +G     +G  + K  SLLT 
Subjt:  VQYLVDTFKLST---ARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTV

Query:  SLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFG------------RRESLVR---DLVS
        SL  K+ P V  L  +     +  D+ +V+     +L     ++L+ N+ Y+ S GI   QL  ++   P+L               R +++R   DL+ 
Subjt:  SLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFG------------RRESLVR---DLVS

Query:  MAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICS
              +S   R+     H +   + V FK +  L C+
Subjt:  MAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICS

F4JVI3 Transcription termination factor MTERF5, chloroplastic2.3e-0827.95Show/hide
Query:  KPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRR
        K + V ++L +LG   + I + +   PQI   S+   LKP + F + LG   +   K +S+  ++LT S + KL  +VE L      +E    + +++ R
Subjt:  KPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRR

Query:  CSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRR-ESLVRDLVSMAVETGF
        C +++    + +L+  + YFRS  +    +++LL R P  FG   ES ++ +    +E GF
Subjt:  CSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRR-ESLVRDLVSMAVETGF

Q84X53 Transcription termination factor MTEF1, chloroplastic5.9e-0425.27Show/hide
Query:  LRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQS----VYKYLS-ELGFSDAHIQSAIRLSPQIAFSSIEKTLKP
        LR++P L           A  S+ + V+ L+ +  LS      ++     +  T  P+S    V ++LS E+  S+  I  +I   P++  SS++  L+P
Subjt:  LRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQS----VYKYLS-ELGFSDAHIQSAIRLSPQIAFSSIEKTLKP

Query:  KIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKN--VFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYF
         + F + LGFVG D     S+++ LL  +++  L+P +E L+    F ++E    + +++ R   +L    D+ L   + +F
Subjt:  KIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKN--VFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYF

Q9FM80 Transcription termination factor MTERF9, chloroplastic3.3e-0722.11Show/hide
Query:  YLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVE-ILKNVFPKDESSNDLLQVMRRCSDMLM
        YL  +G     I+  +   PQI   ++E  LK  I F   LG   S +G+ ++   SL + S++N L P++  +++ V  K+    D+ +V++    +L+
Subjt:  YLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVE-ILKNVFPKDESSNDLLQVMRRCSDMLM

Query:  RCPDSRLQLNINYF---RSCGIVDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMF
        +  D  +  N  Y    +  G     +  ++K+ P            L+  +++ G                      F  ++  + S G    + +K+ 
Subjt:  RCPDSRLQLNINYF---RSCGIVDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMF

Query:  TSAPVLIRTSV-GKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKF
        TS   ++  S+   LK    + +NE      ++   P  L  S+  R+ PR+R L  V+LK++ RK P  + +L  ++E F  ++
Subjt:  TSAPVLIRTSV-GKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKF

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein9.6e-3428.83Show/hide
Query:  AIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTV
        +  V YLVD+  LS   A +     K V S++KP SV     + GF++  I S I+  P++   S E  + PK+ FF ++GF  SD  K +S    +L+ 
Subjt:  AIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTV

Query:  SLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDM-LMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMF
        SL  +L+P  + LK++  ++ES    L+   RC  + +  C    + L ++  R  G+ D  +  L++  P  F  RE    ++++     GF      F
Subjt:  SLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDM-LMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMF

Query:  VHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSR
        VH + A    S    ++K +L   FG+++++ +      P  +  S  K+   LE+ +N   +    IV RP  L  SM+ R+ PRN+V+ ++  K L +
Subjt:  VHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSR

Query:  K-DPKLVDTLGISEEDFLDKFVYRFPDNVKDLL
        K D      L +   +F+DKFV ++ D +  L+
Subjt:  K-DPKLVDTLGISEEDFLDKFVYRFPDNVKDLL

AT1G62010.1 Mitochondrial transcription termination factor family protein4.3e-1826.49Show/hide
Query:  SARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEK--PQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSI
        +AR +   +LR+   L    S SS + +SAS A  V    +TFK S+     + S R   + T+K    SV   L   GF+D+ I S IR   ++   + 
Subjt:  SARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEK--PQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSI

Query:  EKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLL
          +L  K++F Q+ G   S+L + +S    +L       L    + +K +   D+SS        + S  L +    R   NI   R  G+   +L +LL
Subjt:  EKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLL

Query:  --KRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGF------------------TEKEC---------------
          K QPV    +E     L    VE GF   T  FVH LH +  +S+ T ++K+ +  S GF                  +EK+                
Subjt:  --KRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGF------------------TEKEC---------------

Query:  --MKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKD-PKLVDTLGISEEDFLDKFVYRFPDN--
          M MF   P  I  S   +K   EF + E       +   P  L +S++ R +PR  V++V+  K L   + P +   L  + E FL+ +V +  D   
Subjt:  --MKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKD-PKLVDTLGISEEDFLDKFVYRFPDN--

Query:  VKDLLVAYRGQSVDALQSK
        V +L+  + G  V     K
Subjt:  VKDLLVAYRGQSVDALQSK

AT1G62120.1 Mitochondrial transcription termination factor family protein4.6e-2024.81Show/hide
Query:  FSSS------SSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLG
        FSSS      SS +    +   V YLVD+  L+T  A ++ S +   ++   P SV   L   GF+D+ I + IR  P++     EK+L PK++F Q++G
Subjt:  FSSS------SSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQNLG

Query:  FVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQL-SMLLKRQPVLFGRRESLV
           S+L + +S    +L       L    + +K +   D+SS    ++ + C  +           N+   R  G+    L S+L+     + G+ +   
Subjt:  FVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQL-SMLLKRQPVLFGRRESLV

Query:  RDLVSMAVETGFSTNTRMFVHGLHAVSSVSN---------------------VTFKK--------------KVELICSFGFTEKECMKMFTSAPVLIRTS
        ++ +  AVE GF   T  FV  L+ +  +S+                       FKK               VE     GF+  E + M    P  I  S
Subjt:  RDLVSMAVETGFSTNTRMFVHGLHAVSSVSN---------------------VTFKK--------------KVELICSFGFTEKECMKMFTSAPVLIRTS

Query:  VGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKD-PKLVDTLGISEEDFLDKFVYRFPDN--VKDLLVAYRGQSVDAL
           +KT  EF + E       +   P  L +S++ R +PR  V++V+  K L   + P +   L  + E FL  +V +  D   V +L+  + G  V   
Subjt:  VGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKD-PKLVDTLGISEEDFLDKFVYRFPDN--VKDLLVAYRGQSVDAL

Query:  QSK
          K
Subjt:  QSK

AT5G07900.1 Mitochondrial transcription termination factor family protein1.6e-3328.19Show/hide
Query:  ISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIA
        I VFSL  +  SP              +    E     +  + YL+D+  LS   A  V S +  ++S E+P +V   L + GF+ A I S ++  P + 
Subjt:  ISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIA

Query:  FSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQL
         ++ E  L PK+ FF ++G   S L + ++   ++LT SL N+L+PS   LK+V   DE    ++  +RR + + +      L  NINY    G+ +  +
Subjt:  FSSIEKTLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQL

Query:  SMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVT-FKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAK
         +LL   P    ++    + +   A E GF+     FV  +HA+S   N + + K  E+   +G++E + M  F   P  +  S  K+   +E+F+NE  
Subjt:  SMLLKRQPVLFGRRESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVT-FKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAK

Query:  VS-RSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRG
        ++ RS+  C P  L  S++ R++PR  V +V+    L ++D  L   L   E+ FL+K V ++ + + +L+  Y G
Subjt:  VS-RSVIVCRPTCLMHSMQGRVLPRNRVLQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRG

AT5G64950.1 Mitochondrial transcription termination factor family protein9.5e-6641.48Show/hide
Query:  RIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQ
        R D  L     SS++T AS SN   V++L D        A+A+      ++S E+P+SV + L    FSD  IQ +IR+ P++ F ++EK L+PK+ FF+
Subjt:  RIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEKTLKPKIEFFQ

Query:  NLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKN-VFPKDESSNDLLQVMRRCSDMLM-RCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRR
        ++GF GS LGKF+S++SS++ VSL  KL+P+VEILK+ V PK E   DL  ++ RC  +L+ R P+  L  NI+Y  +CGIV SQL+ LL+RQP +F   
Subjt:  NLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKN-VFPKDESSNDLLQVMRRCSDMLM-RCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRR

Query:  ESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMH
        E  +R  VS A++ GF+ N+RM VH + ++SS+S  TF +KV+L  + GF+E E   +   +P LIR S  KL  G EF++    + R  +  RP  L +
Subjt:  ESLVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMH

Query:  SMQGRVLPRNRVLQVVKLKRLSRKDPK----LVDTLGISEEDFLDKFVYRFPDNV-KDLLVAYR
        +++ RV+PR +VLQ+++ K L  K+ K    +V  + ++EE FL+K+V RF D + ++LLVAY+
Subjt:  SMQGRVLPRNRVLQVVKLKRLSRKDPK----LVDTLGISEEDFLDKFVYRFPDNV-KDLLVAYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACGCTCAGATATCAGTATTCTCTCTTTCTGCTCGCCATTACAGTCCTTCCTACCTTCGAATTGATCCGCGCTTGTATTATTTCTTCTCCTCTTCTTCCTCCACCGA
AGCCTCTGCTTCAAATGCCATCGTCGTCCAATACCTCGTCGATACCTTCAAATTGTCCACCGCCAGAGCCCTGGCGGTTATGAGCTGCCGGAAAGGCGTTGAATCAACGG
AAAAGCCTCAGTCTGTTTACAAATATCTTTCAGAGCTCGGATTCTCCGACGCCCACATTCAATCTGCGATTCGCCTCTCGCCGCAAATCGCGTTTTCCAGCATCGAGAAG
ACTCTTAAGCCGAAGATCGAGTTCTTCCAGAATCTTGGTTTCGTCGGCTCCGATTTGGGTAAGTTCATGTCCAAGCATTCTTCTCTTTTGACTGTTAGTTTGAAGAATAA
ATTGATGCCCAGCGTCGAGATTCTCAAAAATGTTTTTCCCAAGGATGAAAGTAGTAACGATCTCCTTCAAGTTATGCGGCGATGCTCGGATATGCTTATGAGATGCCCGG
ATTCAAGGTTGCAGCTAAACATTAATTACTTTCGAAGTTGTGGGATTGTTGATTCTCAACTCTCTATGTTACTGAAGAGGCAACCTGTACTTTTTGGTAGGCGAGAATCT
CTAGTTAGGGATCTTGTTTCCATGGCTGTAGAGACTGGTTTTTCTACAAATACTAGAATGTTTGTTCATGGACTTCATGCTGTCAGTAGTGTAAGTAATGTGACCTTTAA
GAAAAAAGTGGAGCTGATTTGCAGCTTTGGATTTACTGAGAAAGAATGTATGAAAATGTTTACGTCTGCTCCTGTTTTGATTAGGACCTCCGTTGGTAAACTAAAGACTG
GTCTAGAATTCTTCATGAATGAGGCAAAAGTCAGCAGATCAGTCATTGTTTGTAGACCTACTTGTTTGATGCACAGCATGCAGGGGAGGGTGCTCCCTCGGAATAGAGTT
CTACAGGTCGTGAAGTTGAAGCGGCTATCGAGGAAGGACCCGAAATTGGTCGATACATTGGGGATATCTGAAGAGGATTTCTTAGATAAATTTGTGTATAGGTTTCCAGA
TAATGTGAAAGATCTGTTGGTGGCCTATAGAGGTCAATCTGTGGATGCATTGCAATCTAAAGAGTTAGAAACACGCAGACGGGGCAGAATCAGAGCTCTAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACGCTCAGATATCAGTATTCTCTCTTTCTGCTCGCCATTACAGTCCTTCCTACCTTCGAATTGATCCGCGCTTGTATTATTTCTTCTCCTCTTCTTCCTCCACCGA
AGCCTCTGCTTCAAATGCCATCGTCGTCCAATACCTCGTCGATACCTTCAAATTGTCCACCGCCAGAGCCCTGGCGGTTATGAGCTGCCGGAAAGGCGTTGAATCAACGG
AAAAGCCTCAGTCTGTTTACAAATATCTTTCAGAGCTCGGATTCTCCGACGCCCACATTCAATCTGCGATTCGCCTCTCGCCGCAAATCGCGTTTTCCAGCATCGAGAAG
ACTCTTAAGCCGAAGATCGAGTTCTTCCAGAATCTTGGTTTCGTCGGCTCCGATTTGGGTAAGTTCATGTCCAAGCATTCTTCTCTTTTGACTGTTAGTTTGAAGAATAA
ATTGATGCCCAGCGTCGAGATTCTCAAAAATGTTTTTCCCAAGGATGAAAGTAGTAACGATCTCCTTCAAGTTATGCGGCGATGCTCGGATATGCTTATGAGATGCCCGG
ATTCAAGGTTGCAGCTAAACATTAATTACTTTCGAAGTTGTGGGATTGTTGATTCTCAACTCTCTATGTTACTGAAGAGGCAACCTGTACTTTTTGGTAGGCGAGAATCT
CTAGTTAGGGATCTTGTTTCCATGGCTGTAGAGACTGGTTTTTCTACAAATACTAGAATGTTTGTTCATGGACTTCATGCTGTCAGTAGTGTAAGTAATGTGACCTTTAA
GAAAAAAGTGGAGCTGATTTGCAGCTTTGGATTTACTGAGAAAGAATGTATGAAAATGTTTACGTCTGCTCCTGTTTTGATTAGGACCTCCGTTGGTAAACTAAAGACTG
GTCTAGAATTCTTCATGAATGAGGCAAAAGTCAGCAGATCAGTCATTGTTTGTAGACCTACTTGTTTGATGCACAGCATGCAGGGGAGGGTGCTCCCTCGGAATAGAGTT
CTACAGGTCGTGAAGTTGAAGCGGCTATCGAGGAAGGACCCGAAATTGGTCGATACATTGGGGATATCTGAAGAGGATTTCTTAGATAAATTTGTGTATAGGTTTCCAGA
TAATGTGAAAGATCTGTTGGTGGCCTATAGAGGTCAATCTGTGGATGCATTGCAATCTAAAGAGTTAGAAACACGCAGACGGGGCAGAATCAGAGCTCTAATTTGA
Protein sequenceShow/hide protein sequence
MYAQISVFSLSARHYSPSYLRIDPRLYYFFSSSSSTEASASNAIVVQYLVDTFKLSTARALAVMSCRKGVESTEKPQSVYKYLSELGFSDAHIQSAIRLSPQIAFSSIEK
TLKPKIEFFQNLGFVGSDLGKFMSKHSSLLTVSLKNKLMPSVEILKNVFPKDESSNDLLQVMRRCSDMLMRCPDSRLQLNINYFRSCGIVDSQLSMLLKRQPVLFGRRES
LVRDLVSMAVETGFSTNTRMFVHGLHAVSSVSNVTFKKKVELICSFGFTEKECMKMFTSAPVLIRTSVGKLKTGLEFFMNEAKVSRSVIVCRPTCLMHSMQGRVLPRNRV
LQVVKLKRLSRKDPKLVDTLGISEEDFLDKFVYRFPDNVKDLLVAYRGQSVDALQSKELETRRRGRIRALI