; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G015220 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G015220
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationchr05:23026975..23028221
RNA-Seq ExpressionLsi05G015220
SyntenyLsi05G015220
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022933338.1 uncharacterized protein LOC111440613 isoform X1 [Cucurbita moschata]3.9e-16275Show/hide
Query:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL
        R SL+ + L +SL V+         SS +  AG     L      +  L RR+ +VR PSSVFAHGFSESPLKSLRYLSTSSEI SSPKSAS  SN V+ 
Subjt:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL

Query:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF
        +N+E+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQSKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF
Subjt:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF

Query:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL
        +GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFKEIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  
Subjt:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL

Query:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL
        AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKRI+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKL
Subjt:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL

Query:  YTEKLKDS
        Y EKL+DS
Subjt:  YTEKLKDS

XP_022933339.1 uncharacterized protein LOC111440613 isoform X2 [Cucurbita moschata]3.9e-16275Show/hide
Query:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL
        R SL+ + L +SL V+         SS +  AG     L      +  L RR+ +VR PSSVFAHGFSESPLKSLRYLSTSSEI SSPKSAS  SN V+ 
Subjt:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL

Query:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF
        +N+E+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQSKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF
Subjt:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF

Query:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL
        +GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFKEIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  
Subjt:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL

Query:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL
        AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKRI+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKL
Subjt:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL

Query:  YTEKLKDS
        Y EKL+DS
Subjt:  YTEKLKDS

XP_022956116.1 uncharacterized protein LOC111457902 isoform X2 [Cucurbita moschata]3.7e-16080.56Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+++V  PSSVF HGFSESPLKSLRY STSSEI SSPKSAS  SN V+ +N+E+PVIV FENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF+GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKLY EKL+DS
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

XP_022979833.1 uncharacterized protein LOC111479414 [Cucurbita maxima]2.4e-15980.28Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+++VR PSSVFAHGFSESP KSLRY STSS I SS KSAS  SN V+ +NNE+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQALLQTEEKTIASIKRF+GILTWDL+I A PN+E L QIGVPDS I+TILQYQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  AF RHPWCMMASEDKINGVMDFFVNK+GCE S+IA+RPAL+SLSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+K+ IPGLLKLY +KL D+
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

XP_023527658.1 transcription termination factor MTERF4, chloroplastic-like [Cucurbita pepo subsp. pepo]1.7e-15779.44Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+ +VR PSSVFAH FSESPLKSLRY STSSEI SSPKSAS  SN V+ +N+E+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+ +FP++L RSL+R+IIPAFDYIQALLQTEEKTIASIKRF+GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVEEVKEMGFNP RL FV+AV A RAMSK+TW KKVEVY+KWGWSEEEI  AF RHPWCMM SEDKINGVMDFFVNKIG + S+IA+RPAL+SLSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRGSVYQVLLS+GLIKKD NL   FES ENRFL+KFI+P+K+ IPGLLKLY EKL+D+
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

TrEMBL top hitse value%identityAlignment
A0A6J1EK58 uncharacterized protein LOC1114340721.1e-15778.61Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+++VR PSSVFAHGFSESPLKSLRY STSSEI SSPKSAS  SN V+ +N+E+PVIV FENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQT+EK IASIKR +GILTWDL+ +  PN+E L QIGVPDSNI+TILQYQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVE+VKEMGFNP RLKFVLAVFA RAMSK TW KKVEVY+KWGW EEEI +AF RHPWCMMASEDKING+MDFFVNKIGCE+S++A+RPA+++LSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRG VYQVLLS+GLIKKD NL + FESTE+RFLDKFI+P+KE IPGLLKLY EKL+D+
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

A0A6J1EZH0 uncharacterized protein LOC111440613 isoform X21.9e-16275Show/hide
Query:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL
        R SL+ + L +SL V+         SS +  AG     L      +  L RR+ +VR PSSVFAHGFSESPLKSLRYLSTSSEI SSPKSAS  SN V+ 
Subjt:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL

Query:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF
        +N+E+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQSKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF
Subjt:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF

Query:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL
        +GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFKEIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  
Subjt:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL

Query:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL
        AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKRI+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKL
Subjt:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL

Query:  YTEKLKDS
        Y EKL+DS
Subjt:  YTEKLKDS

A0A6J1F4G8 uncharacterized protein LOC111440613 isoform X11.9e-16275Show/hide
Query:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL
        R SL+ + L +SL V+         SS +  AG     L      +  L RR+ +VR PSSVFAHGFSESPLKSLRYLSTSSEI SSPKSAS  SN V+ 
Subjt:  RHSLAAIFLLWSLIVS---------SSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQL

Query:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF
        +N+E+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQSKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF
Subjt:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRF

Query:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL
        +GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFKEIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  
Subjt:  MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRL

Query:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL
        AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKRI+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKL
Subjt:  AFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKL

Query:  YTEKLKDS
        Y EKL+DS
Subjt:  YTEKLKDS

A0A6J1GVP2 uncharacterized protein LOC111457902 isoform X21.8e-16080.56Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+++V  PSSVF HGFSESPLKSLRY STSSEI SSPKSAS  SN V+ +N+E+PVIV FENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQ LLQTEEKTIASIKRF+GILTWDL+I A PN+E L QIGVPDS I+TIL+YQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  AF RHPWCMMASEDKINGVMDFFVNKIGCE S+IA+RPAL+SLSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+KE IPGLLKLY EKL+DS
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

A0A6J1IXF7 uncharacterized protein LOC1114794141.2e-15980.28Show/hide
Query:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ
        L RR+++VR PSSVFAHGFSESP KSLRY STSS I SS KSAS  SN V+ +NNE+PVIVFFENHGFSK+QISELVKKFPQVLSAN EKTLLPKLLFFQ
Subjt:  LFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQ

Query:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK
        SKGLSSPE+AKL+  FP++L RSL+R+IIPAFDYIQALLQTEEKTIASIKRF+GILTWDL+I A PN+E L QIGVPDS I+TILQYQPRVFL +++RFK
Subjt:  SKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFK

Query:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR
        EIVEEVKEMGFNP RL FVLAVFA RAMSK+TW KKVEVY+KWGW EEEI  AF RHPWCMMASEDKINGVMDFFVNK+GCE S+IA+RPAL+SLSLKKR
Subjt:  EIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKR

Query:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS
        I+PRGSVYQVLLS+GLIKKD NL   FES ENRFLDKFI+P+K+ IPGLLKLY +KL D+
Subjt:  ILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS

SwissProt top hitse value%identityAlignment
B6TGN4 Transcription termination factor MTERF4, chloroplastic4.1e-0521.62Show/hide
Query:  PVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILT
        PV+ + +      + +  +++++P++L    E T+   + +    G+   +V  +I+ FP VL   + + I P  ++++ +          I++   +L 
Subjt:  PVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILT

Query:  WDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEE
        + L+    PNIE L  IGV    +++I+   P V L   +R K + ++
Subjt:  WDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEE

F4JVI3 Transcription termination factor MTERF5, chloroplastic1.4e-0823.79Show/hide
Query:  KSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQ
        K A+ P  +  L    +PV+ F  + G  KS I  ++ K PQ+   +    L P + F ++ G+   + AK+IS FP +L  S  +++    +++     
Subjt:  KSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQ

Query:  TEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVF-LTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEV
        TEE+    + R   I+++ +     P +E  + + V   +++ +L   P+ F L+     K + E   E GF    +  +++    R  +  T+  K  V
Subjt:  TEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVF-LTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEV

Query:  YRKWGW
          KW +
Subjt:  YRKWGW

Q6AUK6 Transcription termination factor MTERF4, chloroplastic2.9e-0621.97Show/hide
Query:  LKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKR
        ++ N  PV+ +    G  +  + +L++++PQVL A+    L P + + Q   +   +V +++  +P +L   L   +  +  Y+  +     +  + I R
Subjt:  LKNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKR

Query:  FMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVF-LTSSIRFKEIVEEVKEMGFNPQRLKFVLAVF
        F  +L   +     P +E L+ IG+    I+ I++ +P V       + K  +E + E G   + L F++A +
Subjt:  FMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVF-LTSSIRFKEIVEEVKEMGFNPQRLKFVLAVF

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial5.7e-0718.52Show/hide
Query:  NEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMG
        N   ++ FF + GF    I ++++K  Q+  A ++             G+   ++  ++S  P +L   L+ ++IP  + + +L +   +  ++I +F  
Subjt:  NEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMG

Query:  ILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTS-SIRFKEIVEEVKEMGFNPQRL--------KFVLAVFARRAMSKTTWDKKVEVYRKWGW
        IL+  +     P +   + +GVP++ +  ++ + PR+   S   +   IV  +  +G +   +         F++     + +  TT   K  V    G 
Subjt:  ILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTS-SIRFKEIVEEVKEMGFNPQRL--------KFVLAVFARRAMSKTTWDKKVEVYRKWGW

Query:  SEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARR----PALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKF
        SE+ I+      P  +    +KI      ++ + G   S IA      P ++  S+K  + PR      ++  G+ +  +    F    + +   +F
Subjt:  SEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARR----PALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKF

Q9ZT96 Transcription termination factor MTERF4, chloroplastic2.0e-0718.18Show/hide
Query:  PVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILT
        PV+ + +      S +  +++++P+VL    E T+   + +    G++  E+  +++ +P +L   + R I P  +Y++ L          I++   IL 
Subjt:  PVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILT

Query:  WDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEM-----GFNPQRLKFVLAVFAR-RAMSKTTWDKKVEVYRKWGWSEEEIR
        ++L  +  PN++IL+   V ++++ +I+   P +     I  K  ++  +++       NP+ L  ++    +  ++S++   K ++   K G+S ++ R
Subjt:  WDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEM-----GFNPQRLKFVLAVFAR-RAMSKTTWDKKVEVYRKWGWSEEEIR

Query:  LAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPR
              P  +  +   +    ++F  ++      +   PA  +  L+  + PR
Subjt:  LAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPR

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein9.6e-5836.14Show/hide
Query:  SPKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQ
        S +SA   S FV+L ++++P  V+  F++HGF+  QI+ ++K FP+VLS + E  + PKL+FF S G S+ + AK+IS+ P +L  SL++++IP +D ++
Subjt:  SPKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQ

Query:  ALLQTEEKTIASIKRFMGILTWDLRIS--AGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWD
        ++L  EE  +  +KR  GI  + L+I+      + I +++GVPD +I  ++Q  P  F +   RF E++  V   GF+P++  FV A+ A    S++  +
Subjt:  ALLQTEEKTIASIKRFMGILTWDLRIS--AGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWD

Query:  KKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKK-DANLSLFFESTENR
        +K ++++ +GWS+E+   A  R P C+  S++KI   +++ VN IG ++  I  RP ++SLS++KRI PR  V  +LLS+GL+KK D N     +   + 
Subjt:  KKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKK-DANLSLFFESTENR

Query:  FLDKFINPYKERIPGLLKLYT
        F+DKF+  Y++ +P L++ +T
Subjt:  FLDKFINPYKERIPGLLKLYT

AT1G61980.1 Mitochondrial transcription termination factor family protein2.8e-3329.31Show/hide
Query:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYI-QALLQTEEKTIASIKR
        K+N + V+    +HGF+ SQIS +V  +PQ+L A+ EK+L PKL F QS+G SS E+ +++S  P +L +  ++ I   +D+I + LL    K+  S + 
Subjt:  KNNEEPVIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYI-QALLQTEEKTIASIKR

Query:  F-MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEI
        F  G L   +R     N+ +L+++G+P   +  +L     V +    +F+E +++V EMGF+P   KFV A+   + +S    + KV  Y++ G+  E +
Subjt:  F-MGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEI

Query:  RLAFGRHPWCMMASEDKINGVM-----------------------------------DFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEG
           F R P  +  SE KI   +                                   +F V K+      +   PA++  SL+KR +PRG+V Q L+S+G
Subjt:  RLAFGRHPWCMMASEDKINGVM-----------------------------------DFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEG

Query:  LIKKD-ANLSLFFESTENRFLDKFINPYKER
        LI  +  ++S  F  T+  FL++++  ++++
Subjt:  LIKKD-ANLSLFFESTENRFLDKFINPYKER

AT5G07900.1 Mitochondrial transcription termination factor family protein2.3e-6741.09Show/hide
Query:  SLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRS
        +L YL  S  +  SP SA++ S  + L + E P  V+    +HGF+ +QIS LVKK P +L AN E  LLPKL FF S G+S   +A+ +++ PT+L RS
Subjt:  SLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRS

Query:  LNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVF
        L  Q+IP++++++++L ++EK +A+++R   +   D   +  PNI  + + GVP+  I  +L + P   +  +  F+ I ++ +EMGFNPQ+  FVLA+ 
Subjt:  LNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVF

Query:  ARRAM-SKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDAN
        A     +K+ WDK  EVY++WGWSE++I  AF +HP CMM SE KIN  M++FVN++      IA+ P ++  SL+KRI+PR SV +VL+S GL+K+D +
Subjt:  ARRAM-SKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDAN

Query:  LSLFFESTENRFLDKFINPYKERIPGLLKLY
        L+      E  FL+K +  Y+E +P L+ LY
Subjt:  LSLFFESTENRFLDKFINPYKERIPGLLKLY

AT5G23930.1 Mitochondrial transcription termination factor family protein3.7e-3324.94Show/hide
Query:  HLFRRMVLVRPPSSVFAHGFSE--SPLKSL-------RYLSTSSEIVSS-------PKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQV
        H +R + L+   SS F   FS   +  K L       R   T S ++ S        +S S+ +NF + K N + V+    ++GF  SQIS ++  +P+ 
Subjt:  HLFRRMVLVRPPSSVFAHGFSE--SPLKSL-------RYLSTSSEIVSS-------PKSASLPSNFVQLKNNEEPVIVFFENHGFSKSQISELVKKFPQV

Query:  LSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNIST
        L  N EKTL  KL F +  G SS E+ +++S  P +L +   + I   +DY++ +LQ ++ + +S KR         + +   N+ +L+++GVP   +  
Subjt:  LSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIGVPDSNIST

Query:  ILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVE--------------VYRKW--------------------------
        +L  + +  +    RF+E V+++ EMGF+P+  KFV A++    +S  T ++KV               V++KW                          
Subjt:  ILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVE--------------VYRKW--------------------------

Query:  ------------------------------GWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLS
                                      G++++E+ +   RHP C+  + D +    +F V  +G     +A  P ++  SL+K +LPR +V + L+S
Subjt:  ------------------------------GWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLS

Query:  EGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYT
         GLI +   +S    S + +FL  F+  +++ +P L  ++T
Subjt:  EGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYT

AT5G64950.1 Mitochondrial transcription termination factor family protein1.2e-3929.84Show/hide
Query:  PKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQA
        P + ++   +  LK+ E+P  VI   +++ FS +QI + ++  P+++  N EK L PKL FF+  G +   + K +S   +V+  SL +++IP  + +++
Subjt:  PKSASLPSNFVQLKNNEEP--VIVFFENHGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQA

Query:  LLQTEEKTIASIKRFMG--ILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDK
        ++  + + +  I    G  +L+ D  +   PNI  L+  G+  S ++++L+ QPR+F  S  + +  V    ++GF       V AV +  ++S+ T+D+
Subjt:  LLQTEEKTIASIKRFMG--ILTWDLRISAGPNIEILKQIGVPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDK

Query:  KVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLI----KKDANLSLFFESTE
        KV+++   G+SE+EI     R P  +  SEDK+    +F++ ++G E   +A+RP ++S +L+KR++PR  V Q+L  +GL+    KK  N+    E TE
Subjt:  KVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSFIARRPALMSLSLKKRILPRGSVYQVLLSEGLI----KKDANLSLFFESTE

Query:  NRFLDKFINPYKERI
          FL+K++  + + I
Subjt:  NRFLDKFINPYKERI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCCTCTCTGCTGGTCGCCGGCACTCCTTGGCCGCCATTTTCCTTCTCTGGTCTCTCATAGTTTCATCCTCTTGTGAGGCATTCGCGGGTTCTCCCATTCTTCTTCTCAA
GAAGGAAGCAGCAGTCATGTATCATTTGTTTCGTAGAATGGTTCTAGTTAGACCTCCATCATCGGTCTTTGCTCACGGATTTTCTGAAAGTCCGTTGAAATCTCTCAGAT
ACTTATCGACCTCTTCTGAGATTGTATCGTCTCCGAAATCTGCTTCGTTGCCTTCTAATTTTGTTCAGCTTAAGAACAACGAGGAGCCTGTAATTGTGTTCTTTGAGAAT
CATGGATTCTCTAAATCACAGATCTCTGAGCTCGTCAAGAAGTTCCCTCAGGTTCTTTCAGCGAACACCGAGAAAACCCTATTGCCCAAATTGTTGTTCTTTCAATCTAA
AGGCCTTTCTTCCCCCGAGGTTGCCAAATTAATATCTGCTTTTCCTACTGTTTTAAAACGAAGTTTAAACAGACAAATTATTCCAGCCTTTGATTACATTCAAGCCCTGC
TTCAAACTGAGGAGAAGACTATCGCATCAATAAAACGCTTCATGGGTATTCTCACTTGGGATCTTCGAATTTCAGCTGGTCCCAATATTGAAATTCTGAAACAAATTGGA
GTGCCTGATTCCAACATTTCAACCATTCTTCAGTACCAGCCTAGAGTGTTCTTGACAAGTTCCATCCGATTCAAGGAGATAGTAGAGGAAGTTAAGGAAATGGGATTCAA
CCCTCAGCGATTGAAGTTTGTTCTTGCTGTTTTTGCTCGGCGAGCAATGAGCAAAACTACATGGGATAAGAAGGTTGAAGTTTATAGGAAATGGGGATGGTCTGAAGAAG
AGATCCGTTTGGCTTTTGGAAGGCATCCATGGTGTATGATGGCTTCTGAGGATAAGATTAATGGTGTAATGGATTTTTTTGTCAACAAGATTGGATGTGAATCTTCTTTC
ATTGCTAGGAGACCTGCTCTAATGTCACTTAGTTTAAAGAAGAGGATTTTGCCTAGAGGCTCTGTTTATCAAGTTTTGCTGTCAGAGGGTTTGATTAAGAAAGATGCGAA
TCTTTCTTTGTTCTTTGAGTCTACTGAAAACCGCTTCTTAGACAAATTCATCAACCCTTATAAGGAGCGGATTCCTGGATTGTTGAAGTTATATACAGAGAAATTAAAGG
ATTCT
mRNA sequenceShow/hide mRNA sequence
TCCCTCTCTGCTGGTCGCCGGCACTCCTTGGCCGCCATTTTCCTTCTCTGGTCTCTCATAGTTTCATCCTCTTGTGAGGCATTCGCGGGTTCTCCCATTCTTCTTCTCAA
GAAGGAAGCAGCAGTCATGTATCATTTGTTTCGTAGAATGGTTCTAGTTAGACCTCCATCATCGGTCTTTGCTCACGGATTTTCTGAAAGTCCGTTGAAATCTCTCAGAT
ACTTATCGACCTCTTCTGAGATTGTATCGTCTCCGAAATCTGCTTCGTTGCCTTCTAATTTTGTTCAGCTTAAGAACAACGAGGAGCCTGTAATTGTGTTCTTTGAGAAT
CATGGATTCTCTAAATCACAGATCTCTGAGCTCGTCAAGAAGTTCCCTCAGGTTCTTTCAGCGAACACCGAGAAAACCCTATTGCCCAAATTGTTGTTCTTTCAATCTAA
AGGCCTTTCTTCCCCCGAGGTTGCCAAATTAATATCTGCTTTTCCTACTGTTTTAAAACGAAGTTTAAACAGACAAATTATTCCAGCCTTTGATTACATTCAAGCCCTGC
TTCAAACTGAGGAGAAGACTATCGCATCAATAAAACGCTTCATGGGTATTCTCACTTGGGATCTTCGAATTTCAGCTGGTCCCAATATTGAAATTCTGAAACAAATTGGA
GTGCCTGATTCCAACATTTCAACCATTCTTCAGTACCAGCCTAGAGTGTTCTTGACAAGTTCCATCCGATTCAAGGAGATAGTAGAGGAAGTTAAGGAAATGGGATTCAA
CCCTCAGCGATTGAAGTTTGTTCTTGCTGTTTTTGCTCGGCGAGCAATGAGCAAAACTACATGGGATAAGAAGGTTGAAGTTTATAGGAAATGGGGATGGTCTGAAGAAG
AGATCCGTTTGGCTTTTGGAAGGCATCCATGGTGTATGATGGCTTCTGAGGATAAGATTAATGGTGTAATGGATTTTTTTGTCAACAAGATTGGATGTGAATCTTCTTTC
ATTGCTAGGAGACCTGCTCTAATGTCACTTAGTTTAAAGAAGAGGATTTTGCCTAGAGGCTCTGTTTATCAAGTTTTGCTGTCAGAGGGTTTGATTAAGAAAGATGCGAA
TCTTTCTTTGTTCTTTGAGTCTACTGAAAACCGCTTCTTAGACAAATTCATCAACCCTTATAAGGAGCGGATTCCTGGATTGTTGAAGTTATATACAGAGAAATTAAAGG
ATTCT
Protein sequenceShow/hide protein sequence
SLSAGRRHSLAAIFLLWSLIVSSSCEAFAGSPILLLKKEAAVMYHLFRRMVLVRPPSSVFAHGFSESPLKSLRYLSTSSEIVSSPKSASLPSNFVQLKNNEEPVIVFFEN
HGFSKSQISELVKKFPQVLSANTEKTLLPKLLFFQSKGLSSPEVAKLISAFPTVLKRSLNRQIIPAFDYIQALLQTEEKTIASIKRFMGILTWDLRISAGPNIEILKQIG
VPDSNISTILQYQPRVFLTSSIRFKEIVEEVKEMGFNPQRLKFVLAVFARRAMSKTTWDKKVEVYRKWGWSEEEIRLAFGRHPWCMMASEDKINGVMDFFVNKIGCESSF
IARRPALMSLSLKKRILPRGSVYQVLLSEGLIKKDANLSLFFESTENRFLDKFINPYKERIPGLLKLYTEKLKDS