; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGirdin-like
Genome locationchr4:11605941..11609899
RNA-Seq ExpressionMoc04g15380
SyntenyMoc04g15380
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY18289.1 Uncharacterized protein TCM_042889 [Theobroma cacao]3.8e-2527.45Show/hide
Query:  LLENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVS
        L +N    +  +WE  R   +  F  KYGHIA LLYV V+  +LRA+V+ WDP+Y+CF F+ +D+TPTIEEY +LL I   +  ++Y        +R ++
Subjt:  LLENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVS

Query:  LLIWV------------------RQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEV
         L+ +                   QF+P TH L   +F Y       +I+++V+ WK   R+  G   D +  GY  WH  R K V+  P +  K  +  
Subjt:  LLIWV------------------RQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEV

Query:  PTRRILSEMSPNQSTQRKV--------RRIEYEQREGNPTSAKERLQLGFNQNLPRNIELELQLARREALSSSLDKTQNAADGLMHDYAHIKEQYNQVEY
          + +L E   N+ T++++        RR E E  E    +A++ L +   +      + E       +L + + + Q+A + L H+     +   +++ 
Subjt:  PTRRILSEMSPNQSTQRKV--------RRIEYEQREGNPTSAKERLQLGFNQNLPRNIELELQLARREALSSSLDKTQNAADGLMHDYAHIKEQYNQVEY

Query:  ELDHVR
        + D ++
Subjt:  ELDHVR

KAA0060423.1 uncharacterized protein E6C27_scaffold22G002420 [Cucumis melo var. makuwa]3.7e-2043.22Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        +N L  LK +WE L P R+  F KKYGHI +L+Y+ VN+  LRA++  WDP Y CFTF S D+ PTIEEY  +L +P +E+  VY ++   T KR +S  
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  IWVRQFIPATHDLRNSKF
            +F+   H     K+
Subjt:  IWVRQFIPATHDLRNSKF

XP_022147190.1 uncharacterized protein LOC111016201 [Momordica charantia]2.4e-8046.21Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        ENQLD LKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALV+HWDP Y+CFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------IWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM
                   IWVRQFIPATHDLRNS+FAYD+ FCKNKIQ+VVKAWKTIVRIQSGNYHDNIFEGYE+WHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM
Subjt:  -----------IWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM

Query:  SPNQSTQRK
        SPNQSTQRK
Subjt:  SPNQSTQRK

XP_022150759.1 uncharacterized protein LOC111018820 [Momordica charantia]8.7e-4665.22Show/hide
Query:  FSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLIWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKT
        +S+  A +R  D  ++    S+  +     ++H+L   PL       SY     L++     IWVRQFIPATHDLRNS+FAYDVRFCKNKIQ+VVKAWK 
Subjt:  FSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLIWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKT

Query:  IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEMSPNQSTQRKV
        IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLL TDKGKGKLEVPT  ILSEMSPNQSTQRKV
Subjt:  IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEMSPNQSTQRKV

XP_022150759.1 uncharacterized protein LOC111018820 [Momordica charantia]1.8e-1192.68Show/hide
Query:  MDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLI
        M+ITPTI+EYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL+
Subjt:  MDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLI

XP_022150759.1 uncharacterized protein LOC111018820 [Momordica charantia]9.1e-2726.21Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        +N L  LK +WE L P R+  F KKYGHIA+L+Y+ VN+  LRA++  WDP Y CFTF S D+ PTIEEY  +L +P +E+  VY ++   T KR +S  
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  -----------------------------------------------------------------------------------------------IWVRQ
                                                                                                       +W++Q
Subjt:  -----------------------------------------------------------------------------------------------IWVRQ

Query:  FIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLP---TDKG-KGKLEVPTRRI-----LSEMSPNQSTQ
        FIP TH+L+   F+YD   C+ K ++ V AWK+I +I+   +++ +  GYE W ++R K ++ +     ++G K   E P + I     L E +     +
Subjt:  FIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLP---TDKG-KGKLEVPTRRI-----LSEMSPNQSTQ

Query:  RKVRRIEYEQREGNPTSAKERLQ-----LGFNQNLPRNIE-LELQLARREALSSSLDKTQNAADGLMHDY-AHIKEQYNQVEYELDHVRCDNT
         +  R E  Q   + T  +  L+     L     L +++E L+ ++ R    + SL   +      M     +IK+  N  EY L+ V   NT
Subjt:  RKVRRIEYEQREGNPTSAKERLQ-----LGFNQNLPRNIE-LELQLARREALSSSLDKTQNAADGLMHDY-AHIKEQYNQVEYELDHVRCDNT

TrEMBL top hitse value%identityAlignment
A0A061FU58 G-patch domain-containing protein1.8e-2527.45Show/hide
Query:  LLENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVS
        L +N    +  +WE  R   +  F  KYGHIA LLYV V+  +LRA+V+ WDP+Y+CF F+ +D+TPTIEEY +LL I   +  ++Y        +R ++
Subjt:  LLENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVS

Query:  LLIWV------------------RQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEV
         L+ +                   QF+P TH L   +F Y       +I+++V+ WK   R+  G   D +  GY  WH  R K V+  P +  K  +  
Subjt:  LLIWV------------------RQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEV

Query:  PTRRILSEMSPNQSTQRKV--------RRIEYEQREGNPTSAKERLQLGFNQNLPRNIELELQLARREALSSSLDKTQNAADGLMHDYAHIKEQYNQVEY
          + +L E   N+ T++++        RR E E  E    +A++ L +   +      + E       +L + + + Q+A + L H+     +   +++ 
Subjt:  PTRRILSEMSPNQSTQRKV--------RRIEYEQREGNPTSAKERLQLGFNQNLPRNIELELQLARREALSSSLDKTQNAADGLMHDYAHIKEQYNQVEY

Query:  ELDHVR
        + D ++
Subjt:  ELDHVR

A0A5A7UWQ6 Uncharacterized protein1.8e-2043.22Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        +N L  LK +WE L P R+  F KKYGHI +L+Y+ VN+  LRA++  WDP Y CFTF S D+ PTIEEY  +L +P +E+  VY ++   T KR +S  
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  IWVRQFIPATHDLRNSKF
            +F+   H     K+
Subjt:  IWVRQFIPATHDLRNSKF

A0A6J1CZG4 uncharacterized protein LOC1110162011.2e-8046.21Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        ENQLD LKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALV+HWDP Y+CFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------IWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM
                   IWVRQFIPATHDLRNS+FAYD+ FCKNKIQ+VVKAWKTIVRIQSGNYHDNIFEGYE+WHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM
Subjt:  -----------IWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEM

Query:  SPNQSTQRK
        SPNQSTQRK
Subjt:  SPNQSTQRK

A0A6J1DB13 uncharacterized protein LOC1110188204.2e-4665.22Show/hide
Query:  FSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLIWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKT
        +S+  A +R  D  ++    S+  +     ++H+L   PL       SY     L++     IWVRQFIPATHDLRNS+FAYDVRFCKNKIQ+VVKAWK 
Subjt:  FSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLIWVRQFIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKT

Query:  IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEMSPNQSTQRKV
        IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLL TDKGKGKLEVPT  ILSEMSPNQSTQRKV
Subjt:  IVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEMSPNQSTQRKV

A0A6J1DB13 uncharacterized protein LOC1110188208.9e-1292.68Show/hide
Query:  MDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLI
        M+ITPTI+EYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL+
Subjt:  MDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLI

A0A6J1DB13 uncharacterized protein LOC1110188204.4e-2726.21Show/hide
Query:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL
        +N L  LK +WE L P R+  F KKYGHIA+L+Y+ VN+  LRA++  WDP Y CFTF S D+ PTIEEY  +L +P +E+  VY ++   T KR +S  
Subjt:  ENQLDILKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLL

Query:  -----------------------------------------------------------------------------------------------IWVRQ
                                                                                                       +W++Q
Subjt:  -----------------------------------------------------------------------------------------------IWVRQ

Query:  FIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLP---TDKG-KGKLEVPTRRI-----LSEMSPNQSTQ
        FIP TH+L+   F+YD   C+ K ++ V AWK+I +I+   +++ +  GYE W ++R K ++ +     ++G K   E P + I     L E +     +
Subjt:  FIPATHDLRNSKFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLP---TDKG-KGKLEVPTRRI-----LSEMSPNQSTQ

Query:  RKVRRIEYEQREGNPTSAKERLQ-----LGFNQNLPRNIE-LELQLARREALSSSLDKTQNAADGLMHDY-AHIKEQYNQVEYELDHVRCDNT
         +  R E  Q   + T  +  L+     L     L +++E L+ ++ R    + SL   +      M     +IK+  N  EY L+ V   NT
Subjt:  RKVRRIEYEQREGNPTSAKERLQ-----LGFNQNLPRNIE-LELQLARREALSSSLDKTQNAADGLMHDY-AHIKEQYNQVEYELDHVRCDNT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCCGGAGGAATCTCCGATCACGTCGCCAGAGGTCGCCCGAGGAACCGTCATTAACGTCGTTGGAGGTCGTCGGTGTCTGCTGTAGACACCGTGTATTTAAGGA
GAGCTGCAGTGGGGAGGAGGAGGCCGACACAACACTAAGAAAAATTCACTTCCTTCTTTCTCTCTACCATCACCTTCTCTTCTTCTTCACCCAGGCTGGCGACCCCGACG
ATACACAGACTGCACACGTCCCCTGGCGCTTGAAAACTCAAGTTGACCATACCGCGCAAGCGTATGGAAGATTTGGGGGCCGCTTCCTTCTAGAAAATCAGTTGGATATC
TTGAAACGTTTATGGGAAGGCTTAAGACCAGATAGAAAGACCCAGTTCATAAAAAAATATGGCCACATAGCACAGCTTCTGTACGTGCGAGTAAACTTCTCAGTGTTGAG
AGCATTAGTTCGACATTGGGATCCGACCTATAAATGCTTTACTTTTAGCTCAATGGATATAACCCCAACTATCGAGGAATACCACACCCTCTTACAAATACCACTGCAAG
AGAAAATTGAGGTTTATTCCTATGATGGTGGGTTTACGTTGAAAAGGGCAGTATCGCTGCTTATATGGGTACGTCAGTTTATCCCAGCCACACATGACTTAAGAAACTCT
AAATTTGCCTATGATGTTAGATTTTGCAAAAACAAAATTCAAAAGGTCGTAAAGGCGTGGAAGACGATTGTCAGAATCCAAAGTGGCAATTACCACGATAATATATTTGA
AGGATACGAACAGTGGCATTCGAGTAGAGGAAAAACTGTGGTTCTCTTACCAACCGACAAGGGCAAAGGGAAGCTAGAGGTTCCCACTAGGAGAATTCTTTCTGAAATGA
GCCCAAATCAATCCACTCAGCGGAAGGTTCGCAGGATAGAATACGAGCAGCGAGAAGGGAACCCCACATCAGCTAAAGAAAGACTACAACTAGGGTTTAATCAAAATCTA
CCTCGAAACATTGAGCTGGAATTACAGCTGGCAAGAAGGGAGGCGCTTTCAAGCAGTCTCGACAAAACCCAAAATGCTGCTGATGGGTTAATGCATGATTATGCCCATAT
CAAGGAACAATACAACCAAGTGGAGTATGAATTAGACCATGTGAGGTGCGATAACACATTGTTGTGTCATAATTCAGAACATGTATTCACCCAAGTCAGACAGGCAGCCT
GTATGGCAGATGGTTTAGCTGAAGAGGCACGAGCTCTCACGTCTGCCATTGCCCCTACACAGCCGAATGGCAAAAGCACACTCAAGTTCTTGGGCAAACTTCGACGAGAT
CTAGAGCATTGGGGACAGTTTTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCCGGAGGAATCTCCGATCACGTCGCCAGAGGTCGCCCGAGGAACCGTCATTAACGTCGTTGGAGGTCGTCGGTGTCTGCTGTAGACACCGTGTATTTAAGGA
GAGCTGCAGTGGGGAGGAGGAGGCCGACACAACACTAAGAAAAATTCACTTCCTTCTTTCTCTCTACCATCACCTTCTCTTCTTCTTCACCCAGGCTGGCGACCCCGACG
ATACACAGACTGCACACGTCCCCTGGCGCTTGAAAACTCAAGTTGACCATACCGCGCAAGCGTATGGAAGATTTGGGGGCCGCTTCCTTCTAGAAAATCAGTTGGATATC
TTGAAACGTTTATGGGAAGGCTTAAGACCAGATAGAAAGACCCAGTTCATAAAAAAATATGGCCACATAGCACAGCTTCTGTACGTGCGAGTAAACTTCTCAGTGTTGAG
AGCATTAGTTCGACATTGGGATCCGACCTATAAATGCTTTACTTTTAGCTCAATGGATATAACCCCAACTATCGAGGAATACCACACCCTCTTACAAATACCACTGCAAG
AGAAAATTGAGGTTTATTCCTATGATGGTGGGTTTACGTTGAAAAGGGCAGTATCGCTGCTTATATGGGTACGTCAGTTTATCCCAGCCACACATGACTTAAGAAACTCT
AAATTTGCCTATGATGTTAGATTTTGCAAAAACAAAATTCAAAAGGTCGTAAAGGCGTGGAAGACGATTGTCAGAATCCAAAGTGGCAATTACCACGATAATATATTTGA
AGGATACGAACAGTGGCATTCGAGTAGAGGAAAAACTGTGGTTCTCTTACCAACCGACAAGGGCAAAGGGAAGCTAGAGGTTCCCACTAGGAGAATTCTTTCTGAAATGA
GCCCAAATCAATCCACTCAGCGGAAGGTTCGCAGGATAGAATACGAGCAGCGAGAAGGGAACCCCACATCAGCTAAAGAAAGACTACAACTAGGGTTTAATCAAAATCTA
CCTCGAAACATTGAGCTGGAATTACAGCTGGCAAGAAGGGAGGCGCTTTCAAGCAGTCTCGACAAAACCCAAAATGCTGCTGATGGGTTAATGCATGATTATGCCCATAT
CAAGGAACAATACAACCAAGTGGAGTATGAATTAGACCATGTGAGGTGCGATAACACATTGTTGTGTCATAATTCAGAACATGTATTCACCCAAGTCAGACAGGCAGCCT
GTATGGCAGATGGTTTAGCTGAAGAGGCACGAGCTCTCACGTCTGCCATTGCCCCTACACAGCCGAATGGCAAAAGCACACTCAAGTTCTTGGGCAAACTTCGACGAGAT
CTAGAGCATTGGGGACAGTTTTATTGA
Protein sequenceShow/hide protein sequence
MGCRRNLRSRRQRSPEEPSLTSLEVVGVCCRHRVFKESCSGEEEADTTLRKIHFLLSLYHHLLFFFTQAGDPDDTQTAHVPWRLKTQVDHTAQAYGRFGGRFLLENQLDI
LKRLWEGLRPDRKTQFIKKYGHIAQLLYVRVNFSVLRALVRHWDPTYKCFTFSSMDITPTIEEYHTLLQIPLQEKIEVYSYDGGFTLKRAVSLLIWVRQFIPATHDLRNS
KFAYDVRFCKNKIQKVVKAWKTIVRIQSGNYHDNIFEGYEQWHSSRGKTVVLLPTDKGKGKLEVPTRRILSEMSPNQSTQRKVRRIEYEQREGNPTSAKERLQLGFNQNL
PRNIELELQLARREALSSSLDKTQNAADGLMHDYAHIKEQYNQVEYELDHVRCDNTLLCHNSEHVFTQVRQAACMADGLAEEARALTSAIAPTQPNGKSTLKFLGKLRRD
LEHWGQFY