; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007944 (gene) of Snake gourd v1 genome

Gene IDTan0007944
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationLG02:92016527..92020975
RNA-Seq ExpressionTan0007944
SyntenyTan0007944
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0032502 - developmental process (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132136.1 transcription termination factor MTERF8, chloroplastic-like [Momordica charantia]6.7e-18889.47Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        M+NFLFK PLRL A+DL+KFT+N   IGL S SSLSQISQSTNNRTVDYLV TLGLS+DSA+A AKRIHLK TANPDSV+ALF+AYGF P +TASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        PSLLLADP+  LKPK EFLSQNG+SG VLVDVISRDPSIL RSLDKQI+PCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTLSRDAE F  IVEKTKE GFNPSSLMFIYGLCT SGMKKDKWLSKL +FKSFGWSEEQFQSLFLKQP FMNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDP LLEMYQKKMAIL
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

XP_022951071.1 uncharacterized protein LOC111454030 [Cucurbita moschata]7.4e-17985.26Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFK+PLRLFAQDLQKF  N T I  K +SSLS++S+STNNRTVDYLVHTLG S+DSAL AAKRIHLKPTANPDSV+ALFKAYGFT  DTASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+  LKPKFEFLS+NG +GHVLVDVISRDPSIL RSL KQIVPCID LRNFFGST+GIVSLFS RRGTWVL  FSESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTL+RDAEEF  IVEKTKE GFNPSS MFI GLCTL GMKKDKWLSKLHIF SFGWSEEQFQSLF KQP  MNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEISKYP VL LSFEKRV+PRSSILQHLISKGFIKKTS G+AFMI EDKFL KFV+QYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

XP_022951072.1 uncharacterized protein LOC111454031 [Cucurbita moschata]8.7e-18084.74Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLRLFAQDLQKF  N T I  K +SSLS+ISQSTNNRTVDYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCR+
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+ ILKPKFEFLS+NG +GHVLVDVISRDPSIL RSL KQIVPCIDFLRNFFGSTDG+VSLFSARRGTWVL KF+ESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAK+ W+RPRTL+R+AEEF  IVEKTKE GFNPSS MF YGLCT  GMKKDKWLSKLHIF SFGWSEEQFQSLFLKQP  MNSSEEQIK+ALDF MNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEISKYP VL+LSFEKRV+PRSSILQHL+SKGFIKKT+ G+AFM+ EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

XP_022951074.1 uncharacterized protein LOC111454032 [Cucurbita moschata]7.9e-18185.79Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLRLFAQDLQKF  N T I  K +SSLS+ISQSTNNRTVDYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+ ILKPKFEFLS+NG +GHVLV+VISRDP IL RSL KQIVPCIDFLR FFGSTD IVSLFSARRGTWVL KFSESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
         IAK+ WVRPRTL+RDAEEF  IVEKTKE GFNPSS MFIYGLCT SGMKKDKWLSKLHIF SFGWS+EQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFM+ EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

XP_023537739.1 uncharacterized protein LOC111798674 isoform X2 [Cucurbita pepo subsp. pepo]4.3e-17984.74Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLR+FAQDLQKF  N T I  K +SSLS+IS+STNNRT+DYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P LLLADP+  LKPKFEFLS+NG +GHVLVDVISRDP IL RSL KQIVPCIDFLRNFFGSTD +VSLFSARRGTWVL KFSESVAPNIE+LRA GVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
         IAK+ WVRP TL+RDAEEF  IVEKTKE GF+PSS MFIYGLCT SGMKKDKWLSKLHIF SFGWSEEQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFMI EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

TrEMBL top hitse value%identityAlignment
A0A6J1BRE6 transcription termination factor MTERF8, chloroplastic-like3.2e-18889.47Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        M+NFLFK PLRL A+DL+KFT+N   IGL S SSLSQISQSTNNRTVDYLV TLGLS+DSA+A AKRIHLK TANPDSV+ALF+AYGF P +TASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        PSLLLADP+  LKPK EFLSQNG+SG VLVDVISRDPSIL RSLDKQI+PCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTLSRDAE F  IVEKTKE GFNPSSLMFIYGLCT SGMKKDKWLSKL +FKSFGWSEEQFQSLFLKQP FMNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDP LLEMYQKKMAIL
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

A0A6J1GGI9 uncharacterized protein LOC1114540303.6e-17985.26Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFK+PLRLFAQDLQKF  N T I  K +SSLS++S+STNNRTVDYLVHTLG S+DSAL AAKRIHLKPTANPDSV+ALFKAYGFT  DTASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+  LKPKFEFLS+NG +GHVLVDVISRDPSIL RSL KQIVPCID LRNFFGST+GIVSLFS RRGTWVL  FSESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTL+RDAEEF  IVEKTKE GFNPSS MFI GLCTL GMKKDKWLSKLHIF SFGWSEEQFQSLF KQP  MNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEISKYP VL LSFEKRV+PRSSILQHLISKGFIKKTS G+AFMI EDKFL KFV+QYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

A0A6J1GGM9 uncharacterized protein LOC1114540314.2e-18084.74Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLRLFAQDLQKF  N T I  K +SSLS+ISQSTNNRTVDYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCR+
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+ ILKPKFEFLS+NG +GHVLVDVISRDPSIL RSL KQIVPCIDFLRNFFGSTDG+VSLFSARRGTWVL KF+ESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAK+ W+RPRTL+R+AEEF  IVEKTKE GFNPSS MF YGLCT  GMKKDKWLSKLHIF SFGWSEEQFQSLFLKQP  MNSSEEQIK+ALDF MNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEISKYP VL+LSFEKRV+PRSSILQHL+SKGFIKKT+ G+AFM+ EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

A0A6J1GHP2 uncharacterized protein LOC1114540323.8e-18185.79Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLRLFAQDLQKF  N T I  K +SSLS+ISQSTNNRTVDYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCRN
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+ ILKPKFEFLS+NG +GHVLV+VISRDP IL RSL KQIVPCIDFLR FFGSTD IVSLFSARRGTWVL KFSESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
         IAK+ WVRPRTL+RDAEEF  IVEKTKE GFNPSS MFIYGLCT SGMKKDKWLSKLHIF SFGWS+EQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFM+ EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

A0A6J1KSF9 uncharacterized protein LOC1114960453.9e-17884.47Show/hide
Query:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN
        MTNFLFKVPLRLFAQ+LQKF  N   I  K +SSLS+IS+STNNRTVDYLVHTLG S+DSALAAAKRIHLK TANPDSV+ALFKAYGFT  DTASIFCR+
Subjt:  MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRN

Query:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS
        P+LLLADP+ ILKPK EFLS+NG +GHVLVDVISRDPSIL RSL KQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVL KFSESVAPNIE+LRANGVPDS
Subjt:  PSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDS

Query:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL
        NIAK+ WVRPRTL+RDAEEF  IVEKTKE GFNPSS MF YGLCT  GMKKDKWLSKLHIF SFGWSEEQFQSLFLKQP  MNSSEE+IKRAL+FFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL
        DWT EEISKYP +L+LS EKRV+PRSSILQHLISKGFIKKTS G+AF++ EDKFLVKFVMQYLS+DP LLEMYQKKMA+L
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic7.3e-0423.2Show/hide
Query:  KSISSLSQISQSTN--NRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLSGH
        K I+ L +   ST    R + Y  H +G S +           KP      ++  F   G        I    P L   D  K + PK  FL + G+   
Subjt:  KSISSLSQISQSTN--NRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLSGH

Query:  VLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFR
         + +++ + PS+L  SL K+I P + FL    G T   +    A     +       + PN+    + G+    + +MI   P  L  + +  R
Subjt:  VLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFR

Q9FK23 Transcription termination factor MTERF8, chloroplastic2.4e-0719.31Show/hide
Query:  SQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGF-TPFDTASIFCRNPSLLLADPNKILKPKFEFL-SQNGLSGHVLVDVI
        + I  S N +    L+H  G+ ++       +++L       SV  + +   F  PF    I  R P +L +D +  L P+ +F+ + +G        V+
Subjt:  SQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGF-TPFDTASIFCRNPSLLLADPNKILKPKFEFL-SQNGLSGHVLVDVI

Query:  SRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFN
         R P+IL  S++  +   ++FL++F G T   V          +       + P IE L+  G     + K +   P  L+       + +    + G+ 
Subjt:  SRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFRYIVEKTKEKGFN

Query:  PSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLI
          +    + +  ++    D     + ++ S+G S E   ++  K P  +  +   ++  L++ +  +    EE+  +P  L    + R+  R    + L 
Subjt:  PSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLI

Query:  SKGFIKKTSVGKAFMIREDKF
        S+G  +  S+ K   +  ++F
Subjt:  SKGFIKKTSVGKAFMIREDKF

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial2.5e-0424Show/hide
Query:  LFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKF
        L  + G +     S+    P LL  D NKILKP +++L + G     +  +++  P IL +S+   + P I FL    G     V+ +      +  H  
Subjt:  LFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKF

Query:  SESVAPNIEVLRANGVPDSNIAKMI
         + V    ++++ N + D ++ +M+
Subjt:  SESVAPNIEVLRANGVPDSNIAKMI

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein1.8e-4529.01Show/hide
Query:  LKSISSLSQISQSTNNR--------TVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLS
        ++++ S+SQ+      +        TV YLV + GLS +SA + ++ + L  +  PDSVLALFK +GFT     S+    P +L   P  ++ PK  F S
Subjt:  LKSISSLSQISQSTNNR--------TVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLS

Query:  QNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEF
          G S      +IS  P +L  SL K+++PC D L++     + +V         + L K +  V+  + + R  GVPD +I  ++   P T       F
Subjt:  QNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEF

Query:  RYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEK
          ++ +    GF+P    F++ +       +     K  +F+ FGWS+E F +  ++ P  +  S+E+I   L++ +N +     +I   P+VL LS EK
Subjt:  RYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEK

Query:  RVIPRSSILQHLISKGFIKKTSVG--KAFMIREDKFLVKFVMQYLSKDPQLLEMY
        R+ PR+ ++  L+SKG +KK  +       ++  +F+ KFV++Y  + PQL++ +
Subjt:  RVIPRSSILQHLISKGFIKKTSVG--KAFMIREDKFLVKFVMQYLSKDPQLLEMY

AT1G61970.1 Mitochondrial transcription termination factor family protein4.1e-3425.86Show/hide
Query:  FAQDLQKFTENVTRIGLKSISSLS-QISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKI
        F+  +QK +         S + LS ++ +  NN TV YLV +LGL+   A + ++++  +   NPDSVL L  ++GFT    ++I    P LL+AD  K 
Subjt:  FAQDLQKFTENVTRIGLKSISSLS-QISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKI

Query:  LKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFS----ESVAPNIEVLRANGVPDSNIAKMIW
        L PK +FL   G S   + +++S  P ILG+   K I    DF+++          L  + +   + H       E+   N+ VLR  G+P   +  ++ 
Subjt:  LKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFS----ESVAPNIEVLRANGVPDSNIAKMIW

Query:  VRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSE---------------------
           + +    E+F   ++K  E GF+P++  F+  L  +  M +     K+H++KS G+      S F K P+ +  SE                     
Subjt:  VRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSE---------------------

Query:  --------------EQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKK--TSVGKAFMIREDKFLVKFVMQYLSKD--PQ
                      E +K+  +F + K++W  + +   P V   S EKR++PR ++++ L+SKG ++    S+    M  +  FL ++V  ++ K    +
Subjt:  --------------EQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKK--TSVGKAFMIREDKFLVKFVMQYLSKD--PQ

Query:  LLEMYQ
        L+ +Y+
Subjt:  LLEMYQ

AT1G61980.1 Mitochondrial transcription termination factor family protein1.8e-3426.76Show/hide
Query:  LRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPN
        LR+  Q     + + +      +S   ++ +   + TV YLV +LGL +  A + ++++  +   NPDSVL L +++GFT    ++I    P LL+AD  
Subjt:  LRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPN

Query:  KILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFF--GSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIW
        K L PK +FL   G S   L +++S  P ILG+   K I    DF++      S+    S     +G        E+   N+ VLR  G+P     K+++
Subjt:  KILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFF--GSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIW

Query:  VRPRTLSRDA-----EEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSE----------------
          P  +S D      E+F   ++K  E GF+PS+  F+  LC +  +       K++ +K  G+  E   ++F + P F+  SE                
Subjt:  VRPRTLSRDA-----EEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSE----------------

Query:  -------------------EQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFI--KKTSVGKAFMIREDKFLVKFVMQYLSK
                           E +K+  +F + K++W  + +   P VL  S EKR +PR +++Q LISKG I  +  S+ + F+  +  FL ++V ++  K
Subjt:  -------------------EQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFI--KKTSVGKAFMIREDKFLVKFVMQYLSK

Query:  --DPQLLEMYQ
          + +L+ +Y+
Subjt:  --DPQLLEMYQ

AT5G07900.1 Mitochondrial transcription termination factor family protein7.1e-5533.7Show/hide
Query:  PLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADP
        P+ +F+   Q F+  VT +  K+     Q  Q   + T++YL+ + GLS DSA  A++++ L     P++VL L + +GFT    +S+  + P LLLA+ 
Subjt:  PLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADP

Query:  NKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWV-LHKFSESVAPNIEVLRANGVPDSNIAKMIW
          +L PK  F    G+S  +L   ++ DP+IL RSL  Q++P  +FL++   S + IV+  + RR TWV L   ++++ PNI  +   GVP+  I  ++ 
Subjt:  NKILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWV-LHKFSESVAPNIEVLRANGVPDSNIAKMIW

Query:  VRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGM-KKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEE
          P  + +   EF+ I ++ +E GFNP    F+  +  LSG   K  W     +++ +GWSE+     F K P  M  SE +I R +++F+N+++  P  
Subjt:  VRPRTLSRDAEEFRYIVEKTKEKGFNPSSLMFIYGLCTLSGM-KKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEE

Query:  ISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-SVGKAFMIREDKFLVKFVMQYLSKDPQLLEMY
        I++ P+VLF S EKR+IPR S+ + L+S G +K+  S+    +  E  FL K V++Y  + P+L+ +Y
Subjt:  ISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-SVGKAFMIREDKFLVKFVMQYLSKDPQLLEMY

AT5G64950.1 Mitochondrial transcription termination factor family protein3.1e-3427.03Show/hide
Query:  IGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRI-HLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLS
        + L   SS +  +  +N   V++L    G  +  A+A A R  +LK    P SV+ + K+Y F+          +P ++  +  KIL+PK  F    G +
Subjt:  IGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRI-HLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNKILKPKFEFLSQNGLS

Query:  GHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSES--VAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFRYI
        G  L   +S++ S++G SL K+++P ++ L++        + +  +R G W+L     +  + PNI  L   G+  S +A ++  +PR  +   E+ R  
Subjt:  GHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSES--VAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEFRYI

Query:  VEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVI
        V +  + GF  +S M ++ + +LS + +  +  K+ +F + G+SE++   +  + P  +  SE+++    +F++ ++    E ++K P VL  + EKRVI
Subjt:  VEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVI

Query:  PRSSILQHLISKGFIKKTSVGKAFMI-----REDKFLVKFVMQY
        PR  +LQ L  KG + K    K  M+      E+ FL K+V+++
Subjt:  PRSSILQHLISKGFIKKTSVGKAFMI-----REDKFLVKFVMQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAATTTTCTGTTCAAAGTTCCTCTGCGTCTCTTTGCTCAAGATCTCCAGAAGTTTACAGAGAACGTCACCAGAATTGGCCTCAAATCCATCTCTTCTCTGTCACA
AATTTCTCAATCAACCAACAATCGGACAGTCGACTACCTCGTCCACACCCTCGGCCTCTCCCAGGACTCAGCTCTCGCCGCTGCCAAGCGAATCCATCTCAAACCCACCG
CAAATCCCGACTCTGTTCTTGCTCTCTTCAAGGCATATGGGTTCACACCGTTCGACACTGCTAGCATCTTTTGCAGAAATCCCAGTCTCCTCCTAGCGGATCCGAACAAA
ATACTCAAACCCAAGTTCGAGTTTCTCTCTCAAAACGGCTTGTCCGGTCATGTTCTCGTCGACGTGATCTCCAGGGATCCGTCAATTCTCGGAAGGAGTTTGGATAAGCA
GATTGTTCCCTGTATTGATTTTCTCAGAAATTTCTTTGGTTCTACTGATGGTATTGTATCACTCTTTTCTGCTAGACGTGGGACTTGGGTTTTGCACAAGTTTTCCGAAT
CTGTGGCTCCCAATATCGAAGTATTGAGAGCTAATGGCGTGCCCGACTCAAACATCGCGAAGATGATTTGGGTGCGTCCGAGGACACTCTCAAGGGACGCGGAAGAGTTC
AGATACATCGTCGAGAAGACTAAGGAGAAGGGTTTTAATCCTTCGAGCTTGATGTTTATTTATGGGCTGTGTACACTTTCAGGGATGAAAAAGGACAAGTGGTTGTCGAA
ACTGCATATTTTTAAAAGTTTTGGGTGGTCAGAGGAGCAGTTTCAATCTCTATTTCTCAAGCAACCCATGTTTATGAATTCATCCGAGGAGCAAATAAAGAGGGCCTTGG
ATTTTTTTATGAACAAATTAGACTGGACGCCCGAAGAAATTTCCAAGTACCCAATTGTGCTATTTCTTAGTTTTGAAAAGAGGGTGATACCGAGGTCGTCTATTCTTCAG
CACCTGATATCAAAAGGTTTTATCAAGAAGACGAGCGTTGGCAAGGCATTTATGATTAGAGAGGACAAGTTTTTGGTCAAGTTTGTGATGCAGTATCTTTCTAAGGATCC
ACAGCTACTAGAGATGTACCAGAAGAAGATGGCAATTTTATGA
mRNA sequenceShow/hide mRNA sequence
CTAAAATATAGTCTGCTGCACTACATCTCAAACACAATTCTCTTATTCTACTTCAAAACATTGCTGATTCTGAACCTCGGCCACAAACACCCCCCTAAAGTTTTATTTTC
AACCGAAAATTCAAAATTCTCGATACTAGTGGGCCGTCCATACGCAAACCTCGATTAAATCCCCCATTCGCAAACCTCCTCGTTCACTTTTCAATGACCAATTTTCTGTT
CAAAGTTCCTCTGCGTCTCTTTGCTCAAGATCTCCAGAAGTTTACAGAGAACGTCACCAGAATTGGCCTCAAATCCATCTCTTCTCTGTCACAAATTTCTCAATCAACCA
ACAATCGGACAGTCGACTACCTCGTCCACACCCTCGGCCTCTCCCAGGACTCAGCTCTCGCCGCTGCCAAGCGAATCCATCTCAAACCCACCGCAAATCCCGACTCTGTT
CTTGCTCTCTTCAAGGCATATGGGTTCACACCGTTCGACACTGCTAGCATCTTTTGCAGAAATCCCAGTCTCCTCCTAGCGGATCCGAACAAAATACTCAAACCCAAGTT
CGAGTTTCTCTCTCAAAACGGCTTGTCCGGTCATGTTCTCGTCGACGTGATCTCCAGGGATCCGTCAATTCTCGGAAGGAGTTTGGATAAGCAGATTGTTCCCTGTATTG
ATTTTCTCAGAAATTTCTTTGGTTCTACTGATGGTATTGTATCACTCTTTTCTGCTAGACGTGGGACTTGGGTTTTGCACAAGTTTTCCGAATCTGTGGCTCCCAATATC
GAAGTATTGAGAGCTAATGGCGTGCCCGACTCAAACATCGCGAAGATGATTTGGGTGCGTCCGAGGACACTCTCAAGGGACGCGGAAGAGTTCAGATACATCGTCGAGAA
GACTAAGGAGAAGGGTTTTAATCCTTCGAGCTTGATGTTTATTTATGGGCTGTGTACACTTTCAGGGATGAAAAAGGACAAGTGGTTGTCGAAACTGCATATTTTTAAAA
GTTTTGGGTGGTCAGAGGAGCAGTTTCAATCTCTATTTCTCAAGCAACCCATGTTTATGAATTCATCCGAGGAGCAAATAAAGAGGGCCTTGGATTTTTTTATGAACAAA
TTAGACTGGACGCCCGAAGAAATTTCCAAGTACCCAATTGTGCTATTTCTTAGTTTTGAAAAGAGGGTGATACCGAGGTCGTCTATTCTTCAGCACCTGATATCAAAAGG
TTTTATCAAGAAGACGAGCGTTGGCAAGGCATTTATGATTAGAGAGGACAAGTTTTTGGTCAAGTTTGTGATGCAGTATCTTTCTAAGGATCCACAGCTACTAGAGATGT
ACCAGAAGAAGATGGCAATTTTATGAACGCTAGGAATGCAATTCAGGACGAAAATATTCAAAGTTGTGGATTTTGAAAAGTCCAGGTTTTATGGACATTCAGTTTCAGGC
AGGGTGGTCTTTGAAAATAGATTGAGATTCTGACCCGGTTTTGTTGGTGATTTTACCAAATTTTGGGATAATAGAGATTAATATCTTGTAAATTAATGGACTTACTAATG
GTTGGTTAGCTCAAATTATTTATTATAATTTCTCAATTCTTTGAAATCTTAGTATTTTTCATCAATTTTTTTTGTCATCTCATTTTTTTTCTGGTAACAATAGGTGGAGC
TGGAGATTTGAAACTTTGACGTTGTATTGCGACAAGTTGTATCAATTATAAATTCGGTTGTCTAGGACCGACCTTTCAATTGGAGGCTTTTTTTATGTGTGTGTATATAG
TTTTGTAATCAATCAAATAATTTATTTACATTTGAAGATGAATGCATGGTGATTTGATGGACGC
Protein sequenceShow/hide protein sequence
MTNFLFKVPLRLFAQDLQKFTENVTRIGLKSISSLSQISQSTNNRTVDYLVHTLGLSQDSALAAAKRIHLKPTANPDSVLALFKAYGFTPFDTASIFCRNPSLLLADPNK
ILKPKFEFLSQNGLSGHVLVDVISRDPSILGRSLDKQIVPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIEVLRANGVPDSNIAKMIWVRPRTLSRDAEEF
RYIVEKTKEKGFNPSSLMFIYGLCTLSGMKKDKWLSKLHIFKSFGWSEEQFQSLFLKQPMFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQ
HLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPQLLEMYQKKMAIL