; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001051 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001051
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMitochondrial transcription termination factor family protein
Genome locationscaffold36:1082150..1083286
RNA-Seq ExpressionMS001051
SyntenyMS001051
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009658 - chloroplast organization (biological process)
GO:0032502 - developmental process (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022132136.1 transcription termination factor MTERF8, chloroplastic-like [Momordica charantia]3.1e-20999.47Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVA AKRIHLKRTANPDSVIALFEAYGFA SNTASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

XP_022951074.1 uncharacterized protein LOC111454032 [Cucurbita moschata]6.4e-17583.11Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLRL A+DL+KF  N  +I     SSLS+ISQSTNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDT LKPK EFLS+NG +G VLV+VISRDP ILRRSL KQI+PCIDFLR FFGSTD IVSLFSARRGTWVL KFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
         IAK+ WVRPRTL+RDAE F DIVEKTKEAGFNPSS MFIYGLCTFSGMKKDKWLSKL +F SFGWS+EQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFM+ EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

XP_023537738.1 uncharacterized protein LOC111798674 isoform X1 [Cucurbita pepo subsp. pepo]3.8e-17582.59Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLR+ A+DL+KF  N  +I     SSLS+IS+STNNRT+DYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDTTLKPK EFLS+NG +G VLVDVIS DP ILRRSL KQI+PCIDFLRNFFGSTD +VSLFSARRGTWVL KFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
         IAK+ WVRP TL+RDAE F DIVEKTKEAGF+PSS MFIYGLCTFSGMKKDKWLSKL +F SFGWSEEQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFMI EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

XP_023537739.1 uncharacterized protein LOC111798674 isoform X2 [Cucurbita pepo subsp. pepo]1.3e-17582.85Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLR+ A+DL+KF  N  +I     SSLS+IS+STNNRT+DYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P LLLADPDTTLKPK EFLS+NG +G VLVDVISRDP ILRRSL KQI+PCIDFLRNFFGSTD +VSLFSARRGTWVL KFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
         IAK+ WVRP TL+RDAE F DIVEKTKEAGF+PSS MFIYGLCTFSGMKKDKWLSKL +F SFGWSEEQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFMI EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

XP_023537741.1 uncharacterized protein LOC111798675 isoform X2 [Cucurbita pepo subsp. pepo]6.4e-17582.85Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLRL A+DL+KF  N  +I     SSLS+IS+STNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDTTLKPK EFLS+NG +G VLVDVISRDPSILRRSL KQI+PCIDFLRNFFGSTDG++SLFSARRGTWVL  FSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAK+ WVRPRTL+RDAE F DIVEKTKEAGFNPSS MF YGLCTF GMKKDKWLSKLQ+F SFGWSEEQFQSLFLKQP  MNSSEEQIK+ALDF MNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEI KYP VL+LSFEKRV+PRSSILQHLISKGFIKKT+ G+AFMI EDKFLVKFVMQYLS+ PHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

TrEMBL top hitse value%identityAlignment
A0A6J1BRE6 transcription termination factor MTERF8, chloroplastic-like1.5e-20999.47Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVA AKRIHLKRTANPDSVIALFEAYGFA SNTASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

A0A6J1GGM9 uncharacterized protein LOC1114540313.4e-17482.06Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLRL A+DL+KF  N  +I     SSLS+ISQSTNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCR+
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDT LKPK EFLS+NG +G VLVDVISRDPSILRRSL KQI+PCIDFLRNFFGSTDG+VSLFSARRGTWVL KF+ESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAK+ W+RPRTL+R+AE F DIVEKTKEAGFNPSS MF YGLCTF GMKKDKWLSKL +F SFGWSEEQFQSLFLKQP  MNSSEEQIK+ALDF MNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEISKYP VL+LSFEKRV+PRSSILQHL+SKGFIKKT+ G+AFM+ EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

A0A6J1GHP2 uncharacterized protein LOC1114540323.1e-17583.11Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLRL A+DL+KF  N  +I     SSLS+ISQSTNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCRN
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDT LKPK EFLS+NG +G VLV+VISRDP ILRRSL KQI+PCIDFLR FFGSTD IVSLFSARRGTWVL KFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
         IAK+ WVRPRTL+RDAE F DIVEKTKEAGFNPSS MFIYGLCTFSGMKKDKWLSKL +F SFGWS+EQFQSLFLKQP FMNSSEE+IKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEIS+YPIVL+LSFEKRV+PRSSILQHL+SKGFIKKTS G+AFM+ EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

A0A6J1KMV5 uncharacterized protein LOC1114960445.9e-17483.11Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLR  A++L+KF  N  +I  N  SSLS+IS+STNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCR+
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P LLLADPDT LKPK+EFLS+NG +G VLVDVISRDPSILRRSL   I+PCIDFLRNFFGSTDGIVSLFSARRGTWVLH FSESVAPNIE LRA GV DS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAKMIW RPRTL+RDAE F DIVEKTKEAGFNPSS MF YGLCTFS MKKDKWLSKL +F SFGWSEEQFQSLFLKQP FM SSEEQIKRALDFFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEISKYP VL+LSFEKRV+PRSSILQHLISKGFIKKTS G+AF+I EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

A0A6J1KSF9 uncharacterized protein LOC1114960452.2e-17382.06Show/hide
Query:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN
        M+NFLFK PLRL A++L+KF  N  +I     SSLS+IS+STNNRTVDYLV TLG SKDSA+AAAKRIHLK TANPDSVIALF+AYGF  S+TASIFCR+
Subjt:  MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRN

Query:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS
        P+LLLADPDT LKPK+EFLS+NG +G VLVDVISRDPSILRRSL KQI+PCIDFLRNFFGSTDGIVSLFSARRGTWVL KFSESVAPNIE LRA GVPDS
Subjt:  PSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDS

Query:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL
        NIAK+ WVRPRTL+RDAE F DIVEKTKEAGFNPSS MF YGLCT+ GMKKDKWLSKL +F SFGWSEEQFQSLFLKQP  MNSSEE+IKRAL+FFMNKL
Subjt:  NIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKL

Query:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI
        DWT EEISKYP +L+LS EKRV+PRSSILQHLISKGFIKKTS G+AF++ EDKFLVKFVMQYLS+DPHLLEMYQKKMA+
Subjt:  DWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI

SwissProt top hitse value%identityAlignment
F4IHL3 Transcription termination factor MTERF2, chloroplastic3.3e-0423.44Show/hide
Query:  VIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVL
        ++  F   G        I    P L   D + T+ PK+ FL + G+  + + +++ + PS+L  SL K+I P + FL    G T   +    A     + 
Subjt:  VIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVL

Query:  HKFSESVAPNIERLRAYGVPDSNIAKMI
              + PN+    + G+    + +MI
Subjt:  HKFSESVAPNIERLRAYGVPDSNIAKMI

Q84X53 Transcription termination factor MTEF1, chloroplastic2.0e-0630.71Show/hide
Query:  SVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQN-GVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNF-FGSTDGIVSLFSARRGT
        SV  L  + G +      I    P LL +DP++ + P L FLS    +S Q +   ISR P +L  S+D Q+ P + FL+   F   D I S    R   
Subjt:  SVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQN-GVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNF-FGSTDGIVSLFSARRGT

Query:  WVLHKFSESVAPNIERL-RAYGVPDSNIAKMIWVRPRTLS
         ++     ++ P IE L    G     +AKM+   P  L+
Subjt:  WVLHKFSESVAPNIERL-RAYGVPDSNIAKMIWVRPRTLS

Q9FK23 Transcription termination factor MTERF8, chloroplastic9.2e-0719.31Show/hide
Query:  KRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFL-SQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDG
        K ++ K   + + +I+  E +G        I  R P +L +D D+ L P+++F+ + +G        V+ R P+IL  S++  +   ++FL++F G T  
Subjt:  KRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFL-SQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDG

Query:  IVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSF
         V          +       + P IE L+  G     + K +   P  L+         +    + G+   +    + +   +    D     + ++ S+
Subjt:  IVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSF

Query:  GWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKF
        G S E   ++  K P+ +  +   ++  L++ +  +    EE+  +P  L    + R+  R    + L S+G  +  S+ K   +  ++F
Subjt:  GWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAFMIREDKF

Q9M219 Transcription termination factor MTEF18, mitochondrial5.9e-0619.75Show/hide
Query:  IALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLH
        I +F   G        + CRN SL L   +  L  K  +  + GVS +    +I R+P+I+   L+K +I     L++F    D + ++  A++  +V  
Subjt:  IALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLH

Query:  KFSESVAPNIERL-----RAYGVPDSN----IAKMIWVRP-RTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWL-----------SKL
        +      P + R      R + +  +     +A    + P   L R+ +  ++ ++ ++    N   L F++ +         K L            + 
Subjt:  KFSESVAPNIERL-----RAYGVPDSN----IAKMIWVRP-RTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWL-----------SKL

Query:  QVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-SVGKAFMIREDKFLVK
        Q+  + G    +   L    PK +N     I+  L F   ++  + + +  +P  L    E R+ PR    + L+ KGF +K+ S+       E  F+ +
Subjt:  QVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-SVGKAFMIREDKFLVK

Query:  FVMQYLSKDPHLLEMYQKK
            + +   H  E +  +
Subjt:  FVMQYLSKDPHLLEMYQKK

Q9SZL6 Transcription termination factor MTERF6, chloroplastic/mitochondrial1.8e-0722.26Show/hide
Query:  NPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQ-NGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARR
        N  S++  F   GF   +   +  +   L  A  D       ++LS   G+  + L  ++SR P IL   LD+++IP ++ L +  G     V+    + 
Subjt:  NPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQ-NGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARR

Query:  GTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID-IVEKTKEAGFNPSSLM--FIYGLCTFSGMKKDKWLSKLQVF--KSFGWSE
           + H   E + P +   +A GVP++ + KMI   PR +S   +  +  IV      G +   ++   +       G   DK L     F   S G SE
Subjt:  GTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID-IVEKTKEAGFNPSSLM--FIYGLCTFSGMKKDKWLSKLQVF--KSFGWSE

Query:  EQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISK----YPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAF
        +  +S+ +  P+ +     +I +    ++ +  +   +I+     YP +L  S +  + PR   L  ++ +G  +  S  + F
Subjt:  EQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISK----YPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVGKAF

Arabidopsis top hitse value%identityAlignment
AT1G21150.1 Mitochondrial transcription termination factor family protein1.9e-4430Show/hide
Query:  TVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLD
        TV YLV + GLS +SA + ++ + L  +  PDSV+ALF+ +GF      S+    P +L   P+  + PKL F S  G S      +IS  P +L  SL 
Subjt:  TVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLD

Query:  KQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCT
        K++IPC D L++     + +V         + L K +  V+  +   R  GVPD +I  ++   P T       F +++ +    GF+P    F++ +  
Subjt:  KQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLCT

Query:  FSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVG-
        F    +     K ++F+ FGWS+E F +  ++ P  +  S+E+I   L++ +N +     +I   P+VL LS EKR+ PR+ ++  L+SKG +KK  +  
Subjt:  FSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTSVG-

Query:  -KAFMIREDKFLVKFVMQYLSKDPHLLEMY
             ++  +F+ KFV++Y  + P L++ +
Subjt:  -KAFMIREDKFLVKFVMQYLSKDPHLLEMY

AT1G61980.1 Mitochondrial transcription termination factor family protein9.0e-3427.96Show/hide
Query:  NSFSSLS------QISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNG
        NSFSS S      ++ +   + TV YLV +LGL K  A + ++++  +   NPDSV+ L  ++GF  S  ++I    P LL+AD + +L PKL+FL   G
Subjt:  NSFSSLS------QISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNG

Query:  VSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFF--GSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDA----
         S   L +++S  P IL +   K I    DF++      S+    S     +G        E+   N+  LR  G+P     K+++  P  +S D     
Subjt:  VSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFF--GSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDA----

Query:  -EGFIDIVEKTKEAGFNPSSLMFIYGLC-----------------------------------TFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKF
         E F + ++K  E GF+PS+  F+  LC                                    F    + K L+ ++ F   G+S ++F  L  + P+ 
Subjt:  -EGFIDIVEKTKEAGFNPSSLMFIYGLC-----------------------------------TFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKF

Query:  MNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFI--KKTSVGKAFMIREDKFLVKFVMQYLSK--DPHLLEMYQ
        +  S E +K+  +F + K++W  + +   P VL  S EKR +PR +++Q LISKG I  +  S+ + F+  +  FL ++V ++  K  +  L+ +Y+
Subjt:  MNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFI--KKTSVGKAFMIREDKFLVKFVMQYLSK--DPHLLEMYQ

AT1G61990.1 Mitochondrial transcription termination factor family protein2.5e-3127.3Show/hide
Query:  NSFSSLSQISQST------NNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNG
        NSFSS      S        N TV YLV +LGLSK  A + ++++  +   NPDSV++LF +YGF  S  ++I    P LL+AD    L  KL+ L   G
Subjt:  NSFSSLSQISQST------NNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNG

Query:  VSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFF-GSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID
         S   + +++S  P IL +   K I    D +++     T     L    +G  +          N+  LR  G+P   +  ++  + + +    E F  
Subjt:  VSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFF-GSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID

Query:  IVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQP--------KFMNSSE---------------------------E
         ++K  E GF+P++  F+  L     M +     K+ VF+S G++ +    +F K P        K + S+E                           E
Subjt:  IVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQP--------KFMNSSE---------------------------E

Query:  QIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTS----VGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQK
         +K+  +F + K+ W    +  +P V   S EKR+IPR +IL+ L+SKG ++K S    V       ++ FL ++VM++    P L+ ++ K
Subjt:  QIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKTS----VGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQK

AT5G07900.1 Mitochondrial transcription termination factor family protein1.8e-5335.05Show/hide
Query:  TVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLD
        T++YL+ + GLS DSA  A++++ L     P++V+ L   +GF T+  +S+  + P LLLA+ ++ L PKL F    GVS  +L   ++ DP+IL RSL 
Subjt:  TVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLD

Query:  KQIIPCIDFLRNFFGSTDGIVSLFSARRGTWV-LHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLC
         Q+IP  +FL++   S + IV+  + RR TWV L   ++++ PNI  +   GVP+  I  ++   P  + +    F  I ++ +E GFNP    F+  + 
Subjt:  KQIIPCIDFLRNFFGSTDGIVSLFSARRGTWV-LHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFIDIVEKTKEAGFNPSSLMFIYGLC

Query:  TFSGM-KKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-S
          SG   K  W    +V++ +GWSE+     F K P  M  SE +I R +++F+N+++  P  I++ P+VLF S EKR+IPR S+ + L+S G +K+  S
Subjt:  TFSGM-KKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQHLISKGFIKKT-S

Query:  VGKAFMIREDKFLVKFVMQYLSKDPHLLEMY
        +    +  E  FL K V++Y  + P L+ +Y
Subjt:  VGKAFMIREDKFLVKFVMQYLSKDPHLLEMY

AT5G64950.1 Mitochondrial transcription termination factor family protein2.6e-3326.96Show/hide
Query:  SIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRI-HLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGV
        ++ L+ FSS +  +  +N   V++L R  G  K  A+A A R  +LK    P SVI + ++Y F+ +        +P ++  + +  L+PKL F    G 
Subjt:  SIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRI-HLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDTTLKPKLEFLSQNGV

Query:  SGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSES--VAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID
        +G  L   +S++ S++  SL K++IP ++ L++        + +  +R G W+L     +  + PNI  L   G+  S +A ++  +PR  +   E    
Subjt:  SGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSES--VAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGFID

Query:  IVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRV
         V +  + GF  +S M ++ + + S + +  +  K+++F + G+SE++   +  + P  +  SE+++    +F++ ++    E ++K P VL  + EKRV
Subjt:  IVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRV

Query:  IPRSSILQHLISKGFIKKTSVGKAFMI-----REDKFLVKFVMQY
        IPR  +LQ L  KG + K    K  M+      E+ FL K+V+++
Subjt:  IPRSSILQHLISKGFIKKTSVGKAFMI-----REDKFLVKFVMQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAATTTTCTGTTCAAAGCTCCGCTGCGTCTCCTTGCTCGCGATCTCCGGAAGTTTACAGATAACAGCATCTCAATTGGCCTCAATTCCTTCTCCTCTCTGTCTCA
AATCTCTCAATCAACCAACAATCGGACAGTAGACTACCTCGTTCGCACCCTCGGGCTCTCCAAGGATTCGGCTGTCGCAGCTGCCAAGCGCATCCATCTCAAACGAACCG
CCAATCCAGACTCCGTTATCGCTCTCTTCGAGGCCTATGGATTCGCTACGTCCAACACTGCCAGCATCTTCTGTAGGAATCCCAGTCTCCTTCTTGCCGATCCGGACACT
ACACTCAAGCCCAAACTCGAGTTTCTCTCTCAAAACGGCGTATCGGGTCAAGTTCTGGTCGATGTGATATCCAGGGACCCGTCCATCCTCAGAAGGAGTTTGGACAAGCA
GATTATTCCTTGTATTGATTTCCTTAGGAATTTCTTTGGCTCTACCGATGGTATTGTCTCGCTCTTTTCTGCTAGACGTGGGACTTGGGTTTTGCACAAGTTTTCGGAAT
CTGTGGCTCCCAATATCGAACGGCTGAGAGCTTACGGTGTTCCTGATTCAAACATCGCCAAAATGATTTGGGTGCGCCCGAGGACTCTCTCCAGGGACGCGGAAGGGTTC
ATTGACATTGTGGAGAAGACAAAGGAGGCGGGTTTTAATCCTTCAAGCTTGATGTTTATTTATGGGCTGTGTACATTTTCAGGGATGAAAAAGGACAAATGGTTGTCAAA
GCTGCAGGTTTTTAAAAGCTTTGGGTGGTCAGAGGAGCAGTTTCAATCGTTATTTCTTAAGCAGCCCAAGTTTATGAATTCGTCCGAGGAGCAAATAAAGAGGGCCTTGG
ATTTCTTTATGAACAAATTAGACTGGACGCCCGAAGAAATTTCCAAGTACCCAATTGTGCTCTTTCTGAGTTTTGAAAAGAGGGTGATACCGAGGTCGTCTATCCTCCAG
CACTTGATATCAAAAGGTTTTATCAAGAAGACGAGTGTTGGCAAGGCATTTATGATTCGCGAGGATAAGTTCTTGGTTAAGTTCGTGATGCAGTATCTTTCAAAGGACCC
ACATCTACTAGAGATGTACCAGAAGAAGATGGCGATT
mRNA sequenceShow/hide mRNA sequence
ATGTCCAATTTTCTGTTCAAAGCTCCGCTGCGTCTCCTTGCTCGCGATCTCCGGAAGTTTACAGATAACAGCATCTCAATTGGCCTCAATTCCTTCTCCTCTCTGTCTCA
AATCTCTCAATCAACCAACAATCGGACAGTAGACTACCTCGTTCGCACCCTCGGGCTCTCCAAGGATTCGGCTGTCGCAGCTGCCAAGCGCATCCATCTCAAACGAACCG
CCAATCCAGACTCCGTTATCGCTCTCTTCGAGGCCTATGGATTCGCTACGTCCAACACTGCCAGCATCTTCTGTAGGAATCCCAGTCTCCTTCTTGCCGATCCGGACACT
ACACTCAAGCCCAAACTCGAGTTTCTCTCTCAAAACGGCGTATCGGGTCAAGTTCTGGTCGATGTGATATCCAGGGACCCGTCCATCCTCAGAAGGAGTTTGGACAAGCA
GATTATTCCTTGTATTGATTTCCTTAGGAATTTCTTTGGCTCTACCGATGGTATTGTCTCGCTCTTTTCTGCTAGACGTGGGACTTGGGTTTTGCACAAGTTTTCGGAAT
CTGTGGCTCCCAATATCGAACGGCTGAGAGCTTACGGTGTTCCTGATTCAAACATCGCCAAAATGATTTGGGTGCGCCCGAGGACTCTCTCCAGGGACGCGGAAGGGTTC
ATTGACATTGTGGAGAAGACAAAGGAGGCGGGTTTTAATCCTTCAAGCTTGATGTTTATTTATGGGCTGTGTACATTTTCAGGGATGAAAAAGGACAAATGGTTGTCAAA
GCTGCAGGTTTTTAAAAGCTTTGGGTGGTCAGAGGAGCAGTTTCAATCGTTATTTCTTAAGCAGCCCAAGTTTATGAATTCGTCCGAGGAGCAAATAAAGAGGGCCTTGG
ATTTCTTTATGAACAAATTAGACTGGACGCCCGAAGAAATTTCCAAGTACCCAATTGTGCTCTTTCTGAGTTTTGAAAAGAGGGTGATACCGAGGTCGTCTATCCTCCAG
CACTTGATATCAAAAGGTTTTATCAAGAAGACGAGTGTTGGCAAGGCATTTATGATTCGCGAGGATAAGTTCTTGGTTAAGTTCGTGATGCAGTATCTTTCAAAGGACCC
ACATCTACTAGAGATGTACCAGAAGAAGATGGCGATT
Protein sequenceShow/hide protein sequence
MSNFLFKAPLRLLARDLRKFTDNSISIGLNSFSSLSQISQSTNNRTVDYLVRTLGLSKDSAVAAAKRIHLKRTANPDSVIALFEAYGFATSNTASIFCRNPSLLLADPDT
TLKPKLEFLSQNGVSGQVLVDVISRDPSILRRSLDKQIIPCIDFLRNFFGSTDGIVSLFSARRGTWVLHKFSESVAPNIERLRAYGVPDSNIAKMIWVRPRTLSRDAEGF
IDIVEKTKEAGFNPSSLMFIYGLCTFSGMKKDKWLSKLQVFKSFGWSEEQFQSLFLKQPKFMNSSEEQIKRALDFFMNKLDWTPEEISKYPIVLFLSFEKRVIPRSSILQ
HLISKGFIKKTSVGKAFMIREDKFLVKFVMQYLSKDPHLLEMYQKKMAI