; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g39650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g39650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMitochondrial transcription termination factor family protein
Genome locationchr4:29416704..29419173
RNA-Seq ExpressionMoc04g39650
SyntenyMoc04g39650
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsIPR003690 - Transcription termination factor, mitochondrial/chloroplastic
IPR038538 - MTERF superfamily, mitochondrial/chloroplastic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578327.1 hypothetical protein SDJN03_22775, partial [Cucurbita argyrosperma subsp. sororia]6.1e-10384.86Show/hide
Query:  SWMLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAE
        S MLGK+LVSPISTVDS TR CFSTP   TAK +  CSNVSFS +P K +  D RR SP PRKWEIHSTAQVESL+LSDEDKKTWEACRQALSMFSFS E
Subjt:  SWMLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAE

Query:  EQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYN
        EQDKMLGKAFGHIHSPYWGEDRKKEVP I+ VN+ILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLL+KEWGIQGKSLR+LLLRNPKVLGYN
Subjt:  EQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYN

Query:  VDCKGDCMAKCTRCWVRF
        VDCKGDC+AKCTRCWVRF
Subjt:  VDCKGDCMAKCTRCWVRF

XP_022152521.1 uncharacterized protein LOC111020227 isoform X1 [Momordica charantia]2.2e-12195.15Show/hide
Query:  MLGKALVSPISTVDSATRFCFST-----------PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA
        MLGKALVSPISTVDSATRFCFST           PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA
Subjt:  MLGKALVSPISTVDSATRFCFST-----------PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA

Query:  LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL
        LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL
Subjt:  LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYNVDCKGDCMAKCTRCWVRF
        RNPKVLGYNVDCKGDCMAKCTRCWVRF
Subjt:  RNPKVLGYNVDCKGDCMAKCTRCWVRF

XP_022152523.1 uncharacterized protein LOC111020227 isoform X2 [Momordica charantia]6.3e-124100Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_022938853.1 uncharacterized protein LOC111444935 [Cucurbita moschata]8.0e-10385.19Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGK+LVSPISTVDS TR CFSTP   TAK +  CSNVSFS +P K +  D RR SP PRKWEIHSTAQVESL+LSDEDKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+ VN+ILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLL+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

XP_023549996.1 uncharacterized protein LOC111808316 [Cucurbita pepo subsp. pepo]1.8e-10285.19Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGK+LVSPISTVDS TR CFSTP   TAK +  CSNVSFS +P K +  D RR SP PRKWEIHSTAQVESL+LSDEDKKTWEACRQALS FSFS EEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+ VN+ILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLL+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

TrEMBL top hitse value%identityAlignment
A0A1S3B2X0 uncharacterized protein LOC1034854181.2e-9983.33Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGK+L SPIST+DSATRFC ST    TA  D  CSNVSFS  P K+  K  RR +P PRKWE+ S+ QVESLILSDEDKKTWEACRQALS+FSFS EEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKK+VP IEIVNDILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLLDKEWGIQGKSLRNLLLRNPKVLGY VD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DF32 uncharacterized protein LOC111020227 isoform X23.0e-124100Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDCMAKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1DI00 uncharacterized protein LOC111020227 isoform X11.1e-12195.15Show/hide
Query:  MLGKALVSPISTVDSATRFCFST-----------PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA
        MLGKALVSPISTVDSATRFCFST           PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA
Subjt:  MLGKALVSPISTVDSATRFCFST-----------PGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQA

Query:  LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL
        LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL
Subjt:  LSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLL

Query:  RNPKVLGYNVDCKGDCMAKCTRCWVRF
        RNPKVLGYNVDCKGDCMAKCTRCWVRF
Subjt:  RNPKVLGYNVDCKGDCMAKCTRCWVRF

A0A6J1FK31 uncharacterized protein LOC1114449353.9e-10385.19Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGK+LVSPISTVDS TR CFSTP   TAK +  CSNVSFS +P K +  D RR SP PRKWEIHSTAQVESL+LSDEDKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+ VN+ILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLL+KEWGIQGKSLR+LLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

A0A6J1JTP6 uncharacterized protein LOC1114896501.1e-10284.72Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ
        MLGK+LVSPISTVDS TR CFSTP   TAK    CSNVSFS +P K +  D RR SP PRKWEIHST+QVESL+LSD+DKKTWEACRQALSMFSFS EEQ
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDARRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQ

Query:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD
        DKMLGKAFGHIHSPYWGEDRKKEVP I+ VN+ILEYLRTLGLS+DDL KLLKKFPEVLGCNLEQELKTN+QLL+KEWGIQGKSLRNLLLRNPKVLGYNVD
Subjt:  DKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVD

Query:  CKGDCMAKCTRCWVRF
        CKGDC+AKCTRCWVRF
Subjt:  CKGDCMAKCTRCWVRF

SwissProt top hitse value%identityAlignment
Q9ZT96 Transcription termination factor MTERF4, chloroplastic2.7e-0528.75Show/hide
Query:  IVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        I+  ++EYL  LG+      +L++K P +LG  L+  +K N+Q+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  IVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

Arabidopsis top hitse value%identityAlignment
AT4G02990.1 Mitochondrial transcription termination factor family protein1.9e-0628.75Show/hide
Query:  IVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK
        I+  ++EYL  LG+      +L++K P +LG  L+  +K N+Q+L +++ ++  SL +++ + P+++G ++  K D   K
Subjt:  IVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAK

AT4G09620.1 Mitochondrial transcription termination factor family protein1.8e-6557.99Show/hide
Query:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNV--SFSNRPWKNVTKDARRHSPCPRKWEIH-STAQVESLILSDEDKKTWEACRQALSMFSFSA
        M+G +L SP++T+ SA  F  S    VT       +NV     N  +  V    R  S    KW +  ST QVE+   S+ED   WE C++ALS F FS 
Subjt:  MLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNV--SFSNRPWKNVTKDARRHSPCPRKWEIH-STAQVESLILSDEDKKTWEACRQALSMFSFSA

Query:  EEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGY
        EE+DK+LGKAFGHIHSPYW E+R KE PK+E +N ILE+LR+LGLSD+DL K++KKFPEVLGC+LE+E+K NI +L+ +WGI GK LRNLLLRNPKVLGY
Subjt:  EEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEVLGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGY

Query:  NVDCKGDCMAKCTRCWVRF
        NVDCKGDC+A+CTRCWVRF
Subjt:  NVDCKGDCMAKCTRCWVRF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCGAACCGCGAACATGGCTATTGGCTGCAAGCAAAGGACAAGAGGAGAATCCGTGCAATTAGTCTTCAAATTTCAGCTACTTTTGATTGGATTCAAGCA
GTTTACTGTTTCTTTTCCGCGTTCACTAAGCTCCTCATGGCTTCTGCTTCTTGGATGCTAGGAAAAGCATTGGTGTCTCCCATATCAACAGTTGATTCTGCAACT
CGCTTCTGCTTTTCTACTCCTGGCTTTGTGACAGCCAAACCAGATGGTGCATGTTCAAACGTGAGCTTCTCCAATCGTCCTTGGAAGAACGTGACAAAGGATGCG
AGGAGACATAGTCCCTGTCCCAGGAAGTGGGAAATCCATTCAACTGCTCAAGTTGAAAGCTTGATATTAAGTGATGAGGATAAGAAGACATGGGAAGCTTGCCGA
CAAGCTCTGTCCATGTTCAGCTTCAGTGCTGAGGAGCAGGATAAGATGCTCGGAAAGGCGTTCGGCCACATTCATTCGCCCTACTGGGGCGAAGACAGAAAGAAG
GAAGTCCCAAAGATTGAAATTGTAAATGACATACTGGAATATCTGAGGACACTCGGCCTTTCTGATGATGATCTCTGTAAGCTTCTAAAAAAATTCCCAGAAGTT
CTTGGATGCAATCTTGAGCAGGAGCTGAAAACCAATATACAATTGTTGGACAAGGAGTGGGGAATTCAAGGCAAATCACTCAGGAACCTTCTTCTGCGTAATCCC
AAGGTATTGGGCTACAATGTTGATTGTAAAGGAGACTGCATGGCGAAATGCACCAGATGCTGGGTTCGATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGCGAACCGCGAACATGGCTATTGGCTGCAAGCAAAGGACAAGAGGAGAATCCGTGCAATTAGTCTTCAAATTTCAGCTACTTTTGATTGGATTCAAGCA
GTTTACTGTTTCTTTTCCGCGTTCACTAAGCTCCTCATGGCTTCTGCTTCTTGGATGCTAGGAAAAGCATTGGTGTCTCCCATATCAACAGTTGATTCTGCAACT
CGCTTCTGCTTTTCTACTCCTGGCTTTGTGACAGCCAAACCAGATGGTGCATGTTCAAACGTGAGCTTCTCCAATCGTCCTTGGAAGAACGTGACAAAGGATGCG
AGGAGACATAGTCCCTGTCCCAGGAAGTGGGAAATCCATTCAACTGCTCAAGTTGAAAGCTTGATATTAAGTGATGAGGATAAGAAGACATGGGAAGCTTGCCGA
CAAGCTCTGTCCATGTTCAGCTTCAGTGCTGAGGAGCAGGATAAGATGCTCGGAAAGGCGTTCGGCCACATTCATTCGCCCTACTGGGGCGAAGACAGAAAGAAG
GAAGTCCCAAAGATTGAAATTGTAAATGACATACTGGAATATCTGAGGACACTCGGCCTTTCTGATGATGATCTCTGTAAGCTTCTAAAAAAATTCCCAGAAGTT
CTTGGATGCAATCTTGAGCAGGAGCTGAAAACCAATATACAATTGTTGGACAAGGAGTGGGGAATTCAAGGCAAATCACTCAGGAACCTTCTTCTGCGTAATCCC
AAGGTATTGGGCTACAATGTTGATTGTAAAGGAGACTGCATGGCGAAATGCACCAGATGCTGGGTTCGATTCTAG
Protein sequenceShow/hide protein sequence
MGANREHGYWLQAKDKRRIRAISLQISATFDWIQAVYCFFSAFTKLLMASASWMLGKALVSPISTVDSATRFCFSTPGFVTAKPDGACSNVSFSNRPWKNVTKDA
RRHSPCPRKWEIHSTAQVESLILSDEDKKTWEACRQALSMFSFSAEEQDKMLGKAFGHIHSPYWGEDRKKEVPKIEIVNDILEYLRTLGLSDDDLCKLLKKFPEV
LGCNLEQELKTNIQLLDKEWGIQGKSLRNLLLRNPKVLGYNVDCKGDCMAKCTRCWVRF