; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009978 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009978
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGenomic DNA, chromosome 3, P1 clone: MJL12
Genome locationscaffold223:45553..46329
RNA-Seq ExpressionMS009978
SyntenyMS009978
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593013.1 hypothetical protein SDJN03_12489, partial [Cucurbita argyrosperma subsp. sororia]2.2e-3348.13Show/hide
Query:  EKNKMTEESCLS-FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEE
        EK +  E S LS F+     FFSL+ SHPLYF Y LFFSPYLLRLL F+SPLL TT   FLLI                +FPLP+    H  W N VF  
Subjt:  EKNKMTEESCLS-FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEE

Query:  PVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKK
        P  ++ T    + E   NK+E + +      SR+  I    E  ++S  +  K FED        +D MDLLWE YE K+     L            KK
Subjt:  PVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKK

Query:  DLRSLVN-EKEAEEGEEAEEE---EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
        DLRSLVN +KE EE EE EEE   E GKICCLQALK STGKMR G+GK+SGL KISKAFKGLK LH L
Subjt:  DLRSLVN-EKEAEEGEEAEEE---EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

KAG7025422.1 hypothetical protein SDJN02_11917, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-3347.41Show/hide
Query:  EKNKMTEESCLS-FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEE
        EK +  E S LS F+     FFSL+ SHPLYF Y LFFSPYLLRLL F+SPLL TT   FLLI                +FPLP+    H  W N VF  
Subjt:  EKNKMTEESCLS-FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEE

Query:  PVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKK
        P  ++ T    + E   NK+E + +      SR+  I    E  ++S  +  K FED        +D MDLLWE YE K+     L            KK
Subjt:  PVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKK

Query:  DLRSLVN------EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
        DLRSLVN      E+E EE EE EE E GKICCLQALK STGKMR G+GK+SGL KISKAFKGLK LH L
Subjt:  DLRSLVN------EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

XP_022157214.1 uncharacterized protein LOC111023976 [Momordica charantia]1.2e-12497.22Show/hide
Query:  MTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKEED
        MTEE CLSFIFFNFFSLIFSHPLYFFYLLFFSPYLL+LLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE D
Subjt:  MTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKEED

Query:  RSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI
        RS SSSSCSSRKKQIKQRDEWRRRSAQLSFKFFED Y DNNNKDDEMDLLWEMYEAKES MGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI
Subjt:  RSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI

Query:  GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS
        GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS
Subjt:  GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS

XP_022960018.1 uncharacterized protein LOC111460896 isoform X1 [Cucurbita moschata]8.9e-3550Show/hide
Query:  FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE
        FFSL+ SHPLYF Y LFFSPYLLRLL F+SPLL TT   FLLI                +FPLP+    H  W N VF  P  ++ T    + E   NK+
Subjt:  FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE

Query:  EDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVN-EKEAEEGEEAEE
        E + +      SR+  I    E  ++S  +  K FED        +D MDLLWE YE K+     L            KKDLRSLVN +KE EE EE EE
Subjt:  EDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVN-EKEAEEGEEAEE

Query:  E-EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
        E E GKICCLQALK STGKMR G+GK+SGL KISKAFKGLK LHHL
Subjt:  E-EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

XP_023513792.1 uncharacterized protein LOC111778295 [Cucurbita pepo subsp. pepo]5.8e-3446.72Show/hide
Query:  KKMEKNKMTEESCLS----FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWL
        KKM K +  EE   S    F+     FFSL+ SHPLYF Y LFFSPYL RLL F+SPLL TT   FLLI                +FPLP+    H  W 
Subjt:  KKMEKNKMTEESCLS----FI--FFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWL

Query:  NLVFEEPVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGD
        N VF  P  ++ T    + E   NK+E + +      SR+  I    E  ++S  +  K FED        +D+MDLLWE YE K+     L        
Subjt:  NLVFEEPVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGD

Query:  DNSKKKDLRSLVN----EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
            KKDLRSLVN     +E EE EE EEEE GKICCLQALK STGKMR G+GK+SGL KISKAFKGLK LH L
Subjt:  DNSKKKDLRSLVN----EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

TrEMBL top hitse value%identityAlignment
A0A0A0K4L5 Genomic DNA, chromosome 3, P1 clone: MJL121.1e-3043.21Show/hide
Query:  EKNKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFL---LIPSFPLPRQDDLHHY------WLNLVF---EEPVREVE
        E+ K++  S        F SLI SHPLYF Y LFFSPY+L++L F SPLL+ T L  L   L   F    Q+  H        W N  F   + P+ E E
Subjt:  EKNKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFL---LIPSFPLPRQDDLHHY------WLNLVF---EEPVREVE

Query:  TKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRR---------SAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNS
         +  +I + I N+EE +      C   +  I  R+  +            + +  K FED        +++MDLLWE YE KE  +    + +K+    S
Subjt:  TKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRR---------SAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNS

Query:  KKKDLRSLVN---EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS
        KKKDLRSLVN   E E  E +E EEEE GKICCLQALKFST KMR G+GK++GL KISKAFKGLKFLH L  +GK   HS
Subjt:  KKKDLRSLVN---EKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS

A0A1S4E1X2 uncharacterized protein LOC1079915923.0e-2844.12Show/hide
Query:  EKNKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFL---LIPSFPLPRQDDLHHY------WLNLVF---EEPVREVE
        E+ K++  S        F SLI SHPLYF Y LFFSPY+L++L FLSPL T T L  L   L   F    Q+  H        W N  F   + P+ E E
Subjt:  EKNKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFL---LIPSFPLPRQDDLHHY------WLNLVF---EEPVREVE

Query:  TKAKDITEVIPNKEE--------DRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSK
         +  +I + I N+EE        D  +   S  +  K+ K         + +  K FED        +++MDLLWE YE +E  +    + +K+    SK
Subjt:  TKAKDITEVIPNKEE--------DRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSK

Query:  KKDLRSLVN-EKEAEE-GEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHG
        KKDLRSLVN +KE EE  EE EEEE GKICCLQALKFS+ KMR G+GK++GL KISKAFKGLKFLH L  +G
Subjt:  KKDLRSLVN-EKEAEE-GEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHG

A0A6J1DVV3 uncharacterized protein LOC1110239765.9e-12597.22Show/hide
Query:  MTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKEED
        MTEE CLSFIFFNFFSLIFSHPLYFFYLLFFSPYLL+LLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE D
Subjt:  MTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKEED

Query:  RSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI
        RS SSSSCSSRKKQIKQRDEWRRRSAQLSFKFFED Y DNNNKDDEMDLLWEMYEAKES MGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI
Subjt:  RSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEI

Query:  GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS
        GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS
Subjt:  GKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS

A0A6J1H9Q5 uncharacterized protein LOC111460896 isoform X14.3e-3550Show/hide
Query:  FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE
        FFSL+ SHPLYF Y LFFSPYLLRLL F+SPLL TT   FLLI                +FPLP+    H  W N VF  P  ++ T    + E   NK+
Subjt:  FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE

Query:  EDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVN-EKEAEEGEEAEE
        E + +      SR+  I    E  ++S  +  K FED        +D MDLLWE YE K+     L            KKDLRSLVN +KE EE EE EE
Subjt:  EDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVN-EKEAEEGEEAEE

Query:  E-EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
        E E GKICCLQALK STGKMR G+GK+SGL KISKAFKGLK LHHL
Subjt:  E-EIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

A0A6J1KYN7 uncharacterized protein LOC1114975701.2e-3245.9Show/hide
Query:  KKMEKNKMTEESCLSFIFFN----FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNL
        KKM K +   E   S  F +     FSL+ SHPLYF Y LFFSPYLLRLL F+SPLL TT   FLLI                +FPLP+    H  W N 
Subjt:  KKMEKNKMTEESCLSFIFFN----FFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLI---------------PSFPLPRQDDLHHYWLNL

Query:  VFEEPVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDN
        VF  P   + T    + E   NK+E + +      +R+  I    E  ++S  +  K FED        +D+MDLLWE YE K+     L          
Subjt:  VFEEPVREVETKAKDITEVIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDN

Query:  SKKKDLRSLVNEKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL
          KKDLRSLVN ++  E EE EEEE GKICCLQALK STGKMR G+GK+SGL KISKAFKG K LH L
Subjt:  SKKKDLRSLVNEKEAEEGEEAEEEEIGKICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G25130.1 unknown protein8.4e-1539.75Show/hide
Query:  VIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEG
        V  N+EED     S  S RK++     EWRR    L+ K FE+R+N +  +   MD LWE YE +     +  ++ K+      KK  +S++  K  E+ 
Subjt:  VIPNKEEDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEG

Query:  EEAEEEEIG-----KICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKK
           EEE+       ++CCLQALKFSTGKM LGI  R  L K+SKAFKG+   ++ N H KK
Subjt:  EEAEEEEIG-----KICCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKK

AT3G25130.1 unknown protein1.0e-0451.67Show/hide
Query:  NKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPS
        NK+   S L   F  F S I +HP YF YLLFFSPY+ ++L FLSPL  TT+L  L + S
Subjt:  NKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAGAAAATGGAAAAGAATAAAATGACTGAAGAATCGTGTCTCTCATTCATATTCTTCAATTTCTTTTCCTTGATTTTTTCCCACCCTCTCTACTTCTTCTACTTG
CTCTTCTTCTCCCCTTACCTTCTAAGACTCCTCTTTTTCCTCTCTCCCCTTCTCACCACCACCTCCCTCAGCTTCCTCCTCATTCCCTCCTTCCCTCTTCCTCGC
CAAGATGATCTCCATCATTATTGGCTCAACCTCGTCTTTGAAGAACCCGTTCGAGAAGTCGAAACAAAAGCTAAAGATATTACAGAAGTGATTCCGAACAAGGAA
GAAGACAGATCAGACTCTTCTTCTTCTTGTTCTTCGCGAAAAAAGCAAATTAAGCAGAGAGATGAGTGGAGGAGGAGGAGCGCGCAACTCAGCTTCAAATTCTTT
GAAGACCGCTATAATGATAATAATAATAAAGATGACGAGATGGATTTGCTTTGGGAAATGTATGAGGCCAAGGAATCAACGATGGGAGAATTAGTCGACGATAGT
AAACGAGGCGACGACAACTCGAAGAAGAAGGATCTAAGAAGTCTCGTGAACGAGAAGGAAGCGGAGGAGGGTGAGGAAGCCGAAGAAGAAGAGATTGGGAAGATT
TGTTGCTTACAAGCTTTGAAATTCTCCACAGGGAAAATGAGGCTTGGGATTGGAAAGAGGAGCGGTTTGACAAAGATTTCTAAGGCATTCAAAGGCCTTAAATTC
TTGCATCACCTCAACAACCATGGTAAGAAGAACAGGCATTCT
mRNA sequenceShow/hide mRNA sequence
AAGAAAATGGAAAAGAATAAAATGACTGAAGAATCGTGTCTCTCATTCATATTCTTCAATTTCTTTTCCTTGATTTTTTCCCACCCTCTCTACTTCTTCTACTTG
CTCTTCTTCTCCCCTTACCTTCTAAGACTCCTCTTTTTCCTCTCTCCCCTTCTCACCACCACCTCCCTCAGCTTCCTCCTCATTCCCTCCTTCCCTCTTCCTCGC
CAAGATGATCTCCATCATTATTGGCTCAACCTCGTCTTTGAAGAACCCGTTCGAGAAGTCGAAACAAAAGCTAAAGATATTACAGAAGTGATTCCGAACAAGGAA
GAAGACAGATCAGACTCTTCTTCTTCTTGTTCTTCGCGAAAAAAGCAAATTAAGCAGAGAGATGAGTGGAGGAGGAGGAGCGCGCAACTCAGCTTCAAATTCTTT
GAAGACCGCTATAATGATAATAATAATAAAGATGACGAGATGGATTTGCTTTGGGAAATGTATGAGGCCAAGGAATCAACGATGGGAGAATTAGTCGACGATAGT
AAACGAGGCGACGACAACTCGAAGAAGAAGGATCTAAGAAGTCTCGTGAACGAGAAGGAAGCGGAGGAGGGTGAGGAAGCCGAAGAAGAAGAGATTGGGAAGATT
TGTTGCTTACAAGCTTTGAAATTCTCCACAGGGAAAATGAGGCTTGGGATTGGAAAGAGGAGCGGTTTGACAAAGATTTCTAAGGCATTCAAAGGCCTTAAATTC
TTGCATCACCTCAACAACCATGGTAAGAAGAACAGGCATTCT
Protein sequenceShow/hide protein sequence
KKMEKNKMTEESCLSFIFFNFFSLIFSHPLYFFYLLFFSPYLLRLLFFLSPLLTTTSLSFLLIPSFPLPRQDDLHHYWLNLVFEEPVREVETKAKDITEVIPNKE
EDRSDSSSSCSSRKKQIKQRDEWRRRSAQLSFKFFEDRYNDNNNKDDEMDLLWEMYEAKESTMGELVDDSKRGDDNSKKKDLRSLVNEKEAEEGEEAEEEEIGKI
CCLQALKFSTGKMRLGIGKRSGLTKISKAFKGLKFLHHLNNHGKKNRHS