; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g17830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g17830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:12706966..12713054
RNA-Seq ExpressionMoc07g17830
SyntenyMoc07g17830
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.8e-22389.87Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG   FTSD LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAAS+AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKK
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
        TATHLATIRQKEGETLREYVTRFQ EQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKDI
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        E  DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIS++ T IE +      K    +    G P R  +SKDKYCRFHREHGHNTSD WELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS
        DALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS

XP_022149377.1 uncharacterized protein LOC111017807 [Momordica charantia]8.0e-18679.45Show/hide
Query:  KDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKV
        +DPKDYVEVFEGLMDFQAA++AIKCRAFQIALTG ARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTATHLATIRQKE ETLREYVTRFQ EQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKGSFSS-GRAEYRRAENGPTR
         HCSDDSAMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  KTD KS+DKGS SS  RAE+RR E+GP+R
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKGSFSS-GRAEYRRAENGPTR

Query:  SRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEE
        SRPYER+TPTTI IS++ T IE +      K+   +    G+P  E +SKDK CRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEE
Subjt:  SRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEE

Query:  RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILS
        RKRSRTPPRR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQ+PTCSITFDDADLE VHLPHNDALVIAPLIDHV+V  +LVDGGASANILS
Subjt:  RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILS

Query:  LPTYLALGWTRSQLKKSPTPLVGFSENQSSQRVASTCR
        LPTYLALGWTR QLKKSPT  +   ENQS Q+ A TCR
Subjt:  LPTYLALGWTRSQLKKSPTPLVGFSENQSSQRVASTCR

XP_022150613.1 uncharacterized protein LOC111018708, partial [Momordica charantia]1.5e-20087.8Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG  +FTSD LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AAS+AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KK
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
        T THLATIRQKEG TLREYVTRFQ EQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKIGR RSGKD+
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        E+ DPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPIS++ T IE +      K    +    G P R  +SKDKYCRFHREHGHNTSDCWELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTC ITFD AD EEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVL
        DA VIAPLIDHVVVRRVL
Subjt:  DALVIAPLIDHVVVRRVL

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.2e-19565.76Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG   FTSD LE        APTVK YDGSKDPKDYVEVFEGLMDFQAAS+AIKCRAFQIALTGSARLW                               
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
                              FQ +QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGKD 
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPIS++ T IE +      K    +    G P R  ++KDKYCRFHREH HNTSD WELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSE----------------------NQSSQRVASTCRSRW---AGPNS
        DALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS                        Q ++ V    RS +    G   
Subjt:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSE----------------------NQSSQRVASTCRSRW---AGPNS

Query:  GH-----PNGRVRVLKYSTPNGVGTVRGEQTASRECYASTLKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVGPVEILDNPSISEPDLMEIG
         H     P+   +VLKYSTPNGVG VRGEQ ASRECYAS LKG+SVCALETL SRDGTLEF+A+LPRREFAAPTEELELV  +    N +I     ++  
Subjt:  GH-----PNGRVRVLKYSTPNGVGTVRGEQTASRECYASTLKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVGPVEILDNPSISEPDLMEIG

Query:  APESSWMDPIADFIRGNSPQDP
          E S ++ I D I      +P
Subjt:  APESSWMDPIADFIRGNSPQDP

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]9.1e-21461.57Show/hide
Query:  MVQPANSTNTADQRTLAASDAHQREVGAAVVEGQGHGGPATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPAKVTRVDIREQRGSHL
        MVQPANSTNTAD+R LAA+  HQREVGA VVEGQGH    TEPL RSARIT PVLPPAHP+ SKA         + +  P   P  +TR +         
Subjt:  MVQPANSTNTADQRTLAASDAHQREVGAAVVEGQGHGGPATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPAKVTRVDIREQRGSHL

Query:  GPAEEEHPEDNESEGHTRQRGDLREHLNRKRDSSLRKGQSPSCSHRSSNQQAESSQRRSTERWQLGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDY
                                + L  K D+ + +     C  + S          S +   LG ++F+SD LEA IPPKFK PT+KPYDGSKDPKDY
Subjt:  GPAEEEHPEDNESEGHTRQRGDLREHLNRKRDSSLRKGQSPSCSHRSSNQQAESSQRRSTERWQLGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDY

Query:  VEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDD
        VEVFE LMDFQAA++AIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF  EQLKVAHCSDD
Subjt:  VEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDD

Query:  SAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYER
        SAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE 
Subjt:  SAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYER

Query:  FTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT
        +TPTTIPI ++ T IE T      K    +    G+P  E ++ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RT
Subjt:  FTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT

Query:  PPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRR DRPAVIN             K+KELAR ARREVCIIREQRPT SI F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFS----------------------ENQSSQRVASTCRSRW-------------AGPNSGHPNGRVRVLKYSTPNGVGTVRGEQ
        LGWTRSQLKKSPTPLVGFS                        Q ++ V    RS +             A P++ H     +VLKYST NGVGTVRGE 
Subjt:  LGWTRSQLKKSPTPLVGFS----------------------ENQSSQRVASTCRSRW-------------AGPNSGHPNGRVRVLKYSTPNGVGTVRGEQ

Query:  TASRECYASTLKGTSVCALETLTSRD
          SRECYAS  K +SVCALE  T RD
Subjt:  TASRECYASTLKGTSVCALETLTSRD

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.4e-22389.87Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG   FTSD LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAAS+AIKCRAF+IALTGSARLWYRRLPA SISTYSQLRREFLA FSSRHYDKK
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
        TATHLATIRQKEGETLREYVTRFQ EQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGR RSGKDI
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        E  DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIS++ T IE +      K    +    G P R  +SKDKYCRFHREHGHNTSD WELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIE+LIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS
        DALVIAPLIDHVVV RVLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFS
Subjt:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFS

A0A6J1D7S8 uncharacterized protein LOC1110178073.9e-18679.45Show/hide
Query:  KDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKV
        +DPKDYVEVFEGLMDFQAA++AIKCRAFQIALTG ARLWYRRLPARSISTYSQLR+EF++QF SRHYD+KTATHLATIRQKE ETLREYVTRFQ EQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKGSFSS-GRAEYRRAENGPTR
         HCSDDSAMCYFLTGLADE LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++  KTD KS+DKGS SS  RAE+RR E+GP+R
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKGSFSS-GRAEYRRAENGPTR

Query:  SRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEE
        SRPYER+TPTTI IS++ T IE +      K+   +    G+P  E +SKDK CRFHR+H HNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEE
Subjt:  SRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEE

Query:  RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILS
        RKRSRTPPRR DRPAVINTIFGGPSGGQSG+KRKELAR ARREVCIIREQ+PTCSITFDDADLE VHLPHNDALVIAPLIDHV+V  +LVDGGASANILS
Subjt:  RKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILS

Query:  LPTYLALGWTRSQLKKSPTPLVGFSENQSSQRVASTCR
        LPTYLALGWTR QLKKSPT  +   ENQS Q+ A TCR
Subjt:  LPTYLALGWTRSQLKKSPTPLVGFSENQSSQRVASTCR

A0A6J1D9E1 uncharacterized protein LOC1110188233.5e-19565.76Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG   FTSD LE        APTVK YDGSKDPKDYVEVFEGLMDFQAAS+AIKCRAFQIALTGSARLW                               
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
                              FQ +QLKVA  SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPER I R RSGKD 
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        EK D KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPIS++ T IE +      K    +    G P R  ++KDKYCRFHREH HNTSD WELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIEDLIQD YFKKFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTC ITFD ADLEEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSE----------------------NQSSQRVASTCRSRW---AGPNS
        DALVIAPLIDHVVVRRVLVD G SANI+SL TYLALGWTRSQLKKS TPLVGFS                        Q ++ V    RS +    G   
Subjt:  DALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSE----------------------NQSSQRVASTCRSRW---AGPNS

Query:  GH-----PNGRVRVLKYSTPNGVGTVRGEQTASRECYASTLKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVGPVEILDNPSISEPDLMEIG
         H     P+   +VLKYSTPNGVG VRGEQ ASRECYAS LKG+SVCALETL SRDGTLEF+A+LPRREFAAPTEELELV  +    N +I     ++  
Subjt:  GH-----PNGRVRVLKYSTPNGVGTVRGEQTASRECYASTLKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVGPVEILDNPSISEPDLMEIG

Query:  APESSWMDPIADFIRGNSPQDP
          E S ++ I D I      +P
Subjt:  APESSWMDPIADFIRGNSPQDP

A0A6J1D9W7 uncharacterized protein LOC1110187087.3e-20187.8Show/hide
Query:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK
        LG  +FTSD LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF AAS+AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KK
Subjt:  LGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKK

Query:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI
        T THLATIRQKEG TLREYVTRFQ EQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAKKVIDGQELLRTKTGRP+RKIGR RSGKD+
Subjt:  TATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDI

Query:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK
        E+ DPKSKDKGSFSSGRAEYRRAE+GPT+SRPYERFTPTTIPIS++ T IE +      K    +    G P R  +SKDKYCRFHREHGHNTSDCWELK
Subjt:  EKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELK

Query:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN
        RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQ PTC ITFD AD EEVHLPHN
Subjt:  RQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHN

Query:  DALVIAPLIDHVVVRRVL
        DA VIAPLIDHVVVRRVL
Subjt:  DALVIAPLIDHVVVRRVL

A0A6J1DHB3 uncharacterized protein LOC1110204794.4e-21461.57Show/hide
Query:  MVQPANSTNTADQRTLAASDAHQREVGAAVVEGQGHGGPATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPAKVTRVDIREQRGSHL
        MVQPANSTNTAD+R LAA+  HQREVGA VVEGQGH    TEPL RSARIT PVLPPAHP+ SKA         + +  P   P  +TR +         
Subjt:  MVQPANSTNTADQRTLAASDAHQREVGAAVVEGQGHGGPATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPAKVTRVDIREQRGSHL

Query:  GPAEEEHPEDNESEGHTRQRGDLREHLNRKRDSSLRKGQSPSCSHRSSNQQAESSQRRSTERWQLGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDY
                                + L  K D+ + +     C  + S          S +   LG ++F+SD LEA IPPKFK PT+KPYDGSKDPKDY
Subjt:  GPAEEEHPEDNESEGHTRQRGDLREHLNRKRDSSLRKGQSPSCSHRSSNQQAESSQRRSTERWQLGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDY

Query:  VEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDD
        VEVFE LMDFQAA++AIKC AFQIALTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF  EQLKVAHCSDD
Subjt:  VEVFEGLMDFQAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDD

Query:  SAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYER
        SAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I + R+GKD  K D KS+DKG S SS R +YRR+ +   +SRPYE 
Subjt:  SAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKG-SFSSGRAEYRRAENGPTRSRPYER

Query:  FTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT
        +TPTTIPI ++ T IE T      K    +    G+P  E ++ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKR RT
Subjt:  FTPTTIPISDLTTEIERTSRSLEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRT

Query:  PPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA
        PPRR DRPAVIN             K+KELAR ARREVCIIREQRPT SI F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASANILSL TYLA
Subjt:  PPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLA

Query:  LGWTRSQLKKSPTPLVGFS----------------------ENQSSQRVASTCRSRW-------------AGPNSGHPNGRVRVLKYSTPNGVGTVRGEQ
        LGWTRSQLKKSPTPLVGFS                        Q ++ V    RS +             A P++ H     +VLKYST NGVGTVRGE 
Subjt:  LGWTRSQLKKSPTPLVGFS----------------------ENQSSQRVASTCRSRW-------------AGPNSGHPNGRVRVLKYSTPNGVGTVRGEQ

Query:  TASRECYASTLKGTSVCALETLTSRD
          SRECYAS  K +SVCALE  T RD
Subjt:  TASRECYASTLKGTSVCALETLTSRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCGGATCAGAGGACTCTAGCCGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCGGCGGTGGTAGAGGGGCAAGGC
CACGGCGGCCCGGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCTCCTGTTCTACCGCCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAGCGAAAGTGACGCGCGTTGACATACGGGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAG
GAACATCCCGAGGACAACGAGAGCGAGGGGCACACTCGCCAGAGAGGAGACCTTCGTGAGCACCTCAACAGAAAGAGAGACTCATCCCTCCGGAAAGGACAGTCA
CCATCCTGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCAAAGAAGGTCCACTGAACGATGGCAACTTGGGAGAATCGCCTTCACCTCGGACGCTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGACTTC
CAAGCGGCATCAGAAGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTAC
TCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACTAGATTCCAGGTGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTC
ACGGTAAAACTCGGAGAGGAGGCCCCGGCAACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGGCCGA
CCAGAACGAAAAATCGGTCGAGACAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGGGCTGAGTATCGG
AGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCAGATCTAACGACCGAGATCGAACGAACATCGAGGAGT
CTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGAGACGCAGAGCAAGGACAAATATTGCCGCTTCCATCGGGAGCACGGCCAT
AACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAGGATGGCTACTTCAAGAAATTTGTGGGAAAACCCAGGACCAGCTCGGCAGAAAAA
AAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAA
AGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGATGCAGACTTGGAGGAGGTTCAC
CTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTGAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCG
ACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGC
CGGTCACGCTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACC
GCTTCAAGGGAGTGCTATGCCTCCACACTCAAAGGTACATCGGTCTGCGCCCTTGAAACTCTCACCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCG
AGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTCGGTCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCT
CCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTG
GTCCGAGGTGGAGTGTTGTACCGACGCGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGAGGCGCGAGGTGACATTGGCTTCAAC
TTGAAGAAAAAGCAAAGAAGGAAGAAGGCAAACAAAGTAAAAACAAATGAGAAGCTCTTATTGAATGAAAAGCAGAGCCAAGGCTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCGGATCAGAGGACTCTAGCCGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCGGCGGTGGTAGAGGGGCAAGGC
CACGGCGGCCCGGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCTCCTGTTCTACCGCCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGCCGAGGT
GGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAGCGAAAGTGACGCGCGTTGACATACGGGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAG
GAACATCCCGAGGACAACGAGAGCGAGGGGCACACTCGCCAGAGAGGAGACCTTCGTGAGCACCTCAACAGAAAGAGAGACTCATCCCTCCGGAAAGGACAGTCA
CCATCCTGCTCACACCGGAGCTCCAACCAGCAGGCTGAATCCTCTCAAAGAAGGTCCACTGAACGATGGCAACTTGGGAGAATCGCCTTCACCTCGGACGCTTTG
GAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGACTTC
CAAGCGGCATCAGAAGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTAC
TCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTG
CGAGAATATGTCACTAGATTCCAGGTGGAGCAGTTGAAGGTCGCACACTGCTCCGACGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTC
ACGGTAAAACTCGGAGAGGAGGCCCCGGCAACCTTCGCCGAGGTGCTTCAGAAAGCGAAGAAAGTCATCGATGGACAGGAGCTCCTTCGAACCAAAACCGGCCGA
CCAGAACGAAAAATCGGTCGAGACAGAAGTGGAAAAGATATAGAAAAGACAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGGGCTGAGTATCGG
AGGGCGGAGAACGGACCTACCAGGAGCCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCAGATCTAACGACCGAGATCGAACGAACATCGAGGAGT
CTGGAATGGAAAAACTACTCAAACGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGAGACGCAGAGCAAGGACAAATATTGCCGCTTCCATCGGGAGCACGGCCAT
AACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAAGATCTAATTCAGGATGGCTACTTCAAGAAATTTGTGGGAAAACCCAGGACCAGCTCGGCAGAAAAA
AAGGAAGAGCGAAAGCGTTCGAGGACGCCGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAA
AGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAGGTGTGTATCATCAGGGAGCAGAGGCCGACCTGCTCAATCACCTTCGACGATGCAGACTTGGAGGAGGTTCAC
CTGCCCCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTGAGGAGGGTGCTGGTAGACGGAGGCGCATCTGCTAACATCCTGTCCTTACCG
ACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGACACCGCTGGTTGGGTTCTCGGAGAATCAGTCGTCCCAGAGGGTTGCATCGACTTGC
CGGTCACGCTGGGCAGGACCAAACTCGGGTCACCCAAATGGCCGAGTTCGTGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCCGAGGAGAACAGACC
GCTTCAAGGGAGTGCTATGCCTCCACACTCAAAGGTACATCGGTCTGCGCCCTTGAAACTCTCACCAGTAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCG
AGGAGGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTCGGTCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCT
CCAGAGTCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTG
GTCCGAGGTGGAGTGTTGTACCGACGCGGCTTTTCCCTGCCTTTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGAGGCGCGAGGTGACATTGGCTTCAAC
TTGAAGAAAAAGCAAAGAAGGAAGAAGGCAAACAAAGTAAAAACAAATGAGAAGCTCTTATTGAATGAAAAGCAGAGCCAAGGCTTATAG
Protein sequenceShow/hide protein sequence
MVQPANSTNTADQRTLAASDAHQREVGAAVVEGQGHGGPATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPAKVTRVDIREQRGSHLGPAEE
EHPEDNESEGHTRQRGDLREHLNRKRDSSLRKGQSPSCSHRSSNQQAESSQRRSTERWQLGRIAFTSDALEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDF
QAASEAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQVEQLKVAHCSDDSAMCYFLTGLADEAL
TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRDRSGKDIEKTDPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISDLTTEIERTSRS
LEWKNYSNVLRSFGEPRRETQSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHK
RKELARAARREVCIIREQRPTCSITFDDADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSENQSSQRVASTC
RSRWAGPNSGHPNGRVRVLKYSTPNGVGTVRGEQTASRECYASTLKGTSVCALETLTSRDGTLEFEADLPRREFAAPTEELELVGPVEILDNPSISEPDLMEIGA
PESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGVLYRRGFSLPLLRCLTPEEGLYEARGDIGFNLKKKQRRKKANKVKTNEKLLLNEKQSQGL