; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g18940 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g18940
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr11:14437095..14443529
RNA-Seq ExpressionMoc11g18940
SyntenyMoc11g18940
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]2.8e-7638Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------
        R E D LR Q   ++E LK +C   + P    +  E PF+  +L+APIPP+FKAP +  YDG                                      
Subjt:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------

Query:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----
                     Y QLRR F+  FS+R   K   +HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF T L D  LT++ G   PA+     
Subjt:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----

Query:  -----------LNHDTSGRPAVDF-RG----EVHHGDVLLLSRRA-DDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRL
                   L    +GRP     RG    ++ + D     + +    ++  RR E  P+  R   ++RFTP    I+EI    +++ ++ L   P++L
Subjt:  -----------LNHDTSGRPAVDF-RG----EVHHGDVLLLSRRA-DDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRL

Query:  RQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREK----
        R  P +R K  YCRFH++HGH+TS  + LK Q+E+LI+ GY KK+VG+   + AE   ++E+R +S  P R+ DRPA+INTI GGPSGGQSGR+ K    
Subjt:  RQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREK----

Query:  -----LWLGRQHKS-----------EGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLP
             + + R+ +            E VH+PH DALVIAPLID V V RV +DGG S NIL   TY  LGW R  LK SPTPLVGF+GESV  EG++ LP
Subjt:  -----LWLGRQHKS-----------EGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLP

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]2.4e-9669.12Show/hide
Query:  SYDGYQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNHDTS-GRPAVD-
        S D YQQLRRLFINQFSARQLLKLPPSHL TVKQ DNESLTEYIAR +DEHVKVVSCTDDIAMMYFTT LNDRNLTIEFGSR PASLN   +  R  +D 
Subjt:  SYDGYQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNHDTS-GRPAVD-

Query:  ------------FRG----------EVHHGDVLLLSRRADDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKR
                     RG          +  H D    SR+A D++SRG+ DE+  SDR G KFD+FTPLNAS+AEIYA  ++TD+KALF APK+L +P GKR
Subjt:  ------------FRG----------EVHHGDVLLLSRRADDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKR

Query:  DKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSG
        DKRLYCRFHKDHGH++SR FHLKEQV+DLIRRGYLKKYVG RE A+ EGS REEKR +S PP RKEDRPA+INTIHGGPSG +SG
Subjt:  DKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.8e-7432.19Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG------YQQLRRLFINQFSARQLLKLPPSHLR---TVK
        R E D LR +   ++E LK +C   + P    +  E PF+  +L+        AP + SYDG      Y ++    ++  +A   +K     +    + +
Subjt:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG------YQQLRRLFINQFSARQLLKLPPSHLR---TVK

Query:  QWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS----------------LNHDTSGRP--AVDFRGEVHHGDVLLLSRRA
         W           F ++ +KV   +DD AM YF T L D  LT++ G   PA+                L    +GRP   +D RG     +   L  + 
Subjt:  QWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS----------------LNHDTSGRP--AVDFRGEVHHGDVLLLSRRA

Query:  DDNKSRGRRD--EKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKK
          + S GR +         R   ++RFTP    I+EI    +++ ++ L   P++LR  P +R+K  YCRFH++H H+TS  + LK Q+EDLI+  Y KK
Subjt:  DDNKSRGRRD--EKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKK

Query:  YVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFS
        +VG+   + AE   ++E+R  S  P R+ DRPA+INTI GGPSGGQSG + K         E      ++  +I                          
Subjt:  YVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFS

Query:  TYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRR
                                                                                     +E RP    T             
Subjt:  TYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRR

Query:  EKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVA
                   E VH+PH DALVIAPLID V VRRV +D G S NI+   TY  LGW R  LK S TPLVGF+ ESV  EGC+ LPVT+     QVT++A
Subjt:  EKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVA

Query:  EFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKG-----IATYAAVMDAAKPCADEPDQSRGTPAEELELIP
        EFVVID   AYNAI GRP+IH  R + ST HQVLKY T  G+  VRGEQ  SRECYA+A+KG     + T  +     +  A+ P +    P EELEL+P
Subjt:  EFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKG-----IATYAAVMDAAKPCADEPDQSRGTPAEELELIP

Query:  LL
        LL
Subjt:  LL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.2e-8633.81Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRC-RPVDPHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------
        R E D L+ +F  ++E LK RC +        +  E  FS  IL+A IPP+FK P M  YDG                                      
Subjt:  RSEVDLLRDQFQREIEDLKRRC-RPVDPHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------

Query:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----
                     Y QLR+ FI+QFS+R   +  P+HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF T L D  LT++     PA+     
Subjt:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----

Query:  -----------LNHDTSGRPAVDF---RGEVHHGDVLLLSRRADDNKSRGRRDEKA--PSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLR
                   L    +GRP  +    R     G     SR    + S  R D +    S  +   ++ +TP    I EI    ++T ++ L   P++LR
Subjt:  -----------LNHDTSGRPAVDF---RGEVHHGDVLLLSRRADDNKSRGRRDEKA--PSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLR

Query:  QPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGR
          P KR+   YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK+VG+  +   E    E KR ++PP  R++DRPA+IN              +K  L R
Subjt:  QPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGR

Query:  QHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREA
        + + E               + +++ +R                              PT  + F    +                              
Subjt:  QHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREA

Query:  AEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGW
                                                              EGVH+PH DALVIAPLIDLV VRR+ +DGGAS NIL  STY  LGW
Subjt:  AEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGW

Query:  ERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYA
         R  LK SPTPLVGF+GES+S EGC+ LPV+I + D QVT++AEFVVID   AYNAI GRP+IH  R V ST HQVLKY T  G+ TVRGE KTSRECYA
Subjt:  ERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYA

Query:  TAMK
        +  K
Subjt:  TAMK

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]3.6e-10844.59Show/hide
Query:  IDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNH----------------DTSGRPAVDFRGEVHHG--------DVLLLSRRADDNKSRGR
        +DEHVKVVSCTDDIAMMYFTT LNDRNLTIEF SR PASLN                     R +   R   H          D    SRRADD+KSR R
Subjt:  IDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNH----------------DTSGRPAVDFRGEVHHG--------DVLLLSRRADDNKSRGR

Query:  RDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEA
        RDE+  S+RRG KFD+FTPLNASIAEIYA  +DTD++ LFA+P++LR+P GKR+KRLYCRFHKDHGHDTSR FHLKEQVEDLIR GYLKKYVG RE AE 
Subjt:  RDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEA

Query:  EGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERK
        EGSAREEKR +S PPR KEDRPA+INTIHGGPSG +SG++ K  L R+                                                    
Subjt:  EGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERK

Query:  HLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHK
                        V+ E   S P       +W  +                                 P + +                     +  
Subjt:  HLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHK

Query:  SEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYA
         E VHMPH DALVIAPLID VKVRRV +DGGAS NI  FSTYT LGWER+HLK   T LVGFA ESVSTEGC+SLPVTISEG+ QVT+VAEFVVIDRS A
Subjt:  SEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYA

Query:  YNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKGIATYAAVMDAAKPCADEPDQSRGTPAEELELIPLLGSEKQVSVGSRLG
        Y             +VS                        S+ C  T  +G A              +P  +    +   EL+PLLG ++QVS+GSRL 
Subjt:  YNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKGIATYAAVMDAAKPCADEPDQSRGTPAEELELIPLLGSEKQVSVGSRLG

Query:  A----EVPRSENSNADLLA
        A    E+ R   SN+D+ A
Subjt:  A----EVPRSENSNADLLA

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088131.3e-7638Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------
        R E D LR Q   ++E LK +C   + P    +  E PF+  +L+APIPP+FKAP +  YDG                                      
Subjt:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------

Query:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----
                     Y QLRR F+  FS+R   K   +HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF T L D  LT++ G   PA+     
Subjt:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----

Query:  -----------LNHDTSGRPAVDF-RG----EVHHGDVLLLSRRA-DDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRL
                   L    +GRP     RG    ++ + D     + +    ++  RR E  P+  R   ++RFTP    I+EI    +++ ++ L   P++L
Subjt:  -----------LNHDTSGRPAVDF-RG----EVHHGDVLLLSRRA-DDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRL

Query:  RQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREK----
        R  P +R K  YCRFH++HGH+TS  + LK Q+E+LI+ GY KK+VG+   + AE   ++E+R +S  P R+ DRPA+INTI GGPSGGQSGR+ K    
Subjt:  RQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREK----

Query:  -----LWLGRQHKS-----------EGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLP
             + + R+ +            E VH+PH DALVIAPLID V V RV +DGG S NIL   TY  LGW R  LK SPTPLVGF+GESV  EG++ LP
Subjt:  -----LWLGRQHKS-----------EGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLP

A0A6J1D5T3 uncharacterized protein LOC1110175481.2e-9669.12Show/hide
Query:  SYDGYQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNHDTS-GRPAVD-
        S D YQQLRRLFINQFSARQLLKLPPSHL TVKQ DNESLTEYIAR +DEHVKVVSCTDDIAMMYFTT LNDRNLTIEFGSR PASLN   +  R  +D 
Subjt:  SYDGYQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNHDTS-GRPAVD-

Query:  ------------FRG----------EVHHGDVLLLSRRADDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKR
                     RG          +  H D    SR+A D++SRG+ DE+  SDR G KFD+FTPLNAS+AEIYA  ++TD+KALF APK+L +P GKR
Subjt:  ------------FRG----------EVHHGDVLLLSRRADDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKR

Query:  DKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSG
        DKRLYCRFHKDHGH++SR FHLKEQV+DLIRRGYLKKYVG RE A+ EGS REEKR +S PP RKEDRPA+INTIHGGPSG +SG
Subjt:  DKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSG

A0A6J1D9E1 uncharacterized protein LOC1110188234.7e-7432.19Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG------YQQLRRLFINQFSARQLLKLPPSHLR---TVK
        R E D LR +   ++E LK +C   + P    +  E PF+  +L+        AP + SYDG      Y ++    ++  +A   +K     +    + +
Subjt:  RSEVDLLRDQFQREIEDLKRRCRPVD-PHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG------YQQLRRLFINQFSARQLLKLPPSHLR---TVK

Query:  QWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS----------------LNHDTSGRP--AVDFRGEVHHGDVLLLSRRA
         W           F ++ +KV   +DD AM YF T L D  LT++ G   PA+                L    +GRP   +D RG     +   L  + 
Subjt:  QWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS----------------LNHDTSGRP--AVDFRGEVHHGDVLLLSRRA

Query:  DDNKSRGRRD--EKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKK
          + S GR +         R   ++RFTP    I+EI    +++ ++ L   P++LR  P +R+K  YCRFH++H H+TS  + LK Q+EDLI+  Y KK
Subjt:  DDNKSRGRRD--EKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKK

Query:  YVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFS
        +VG+   + AE   ++E+R  S  P R+ DRPA+INTI GGPSGGQSG + K         E      ++  +I                          
Subjt:  YVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFS

Query:  TYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRR
                                                                                     +E RP    T             
Subjt:  TYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRR

Query:  EKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVA
                   E VH+PH DALVIAPLID V VRRV +D G S NI+   TY  LGW R  LK S TPLVGF+ ESV  EGC+ LPVT+     QVT++A
Subjt:  EKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVA

Query:  EFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKG-----IATYAAVMDAAKPCADEPDQSRGTPAEELELIP
        EFVVID   AYNAI GRP+IH  R + ST HQVLKY T  G+  VRGEQ  SRECYA+A+KG     + T  +     +  A+ P +    P EELEL+P
Subjt:  EFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKG-----IATYAAVMDAAKPCADEPDQSRGTPAEELELIP

Query:  LL
        LL
Subjt:  LL

A0A6J1DHB3 uncharacterized protein LOC1110204791.1e-8633.81Show/hide
Query:  RSEVDLLRDQFQREIEDLKRRC-RPVDPHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------
        R E D L+ +F  ++E LK RC +        +  E  FS  IL+A IPP+FK P M  YDG                                      
Subjt:  RSEVDLLRDQFQREIEDLKRRC-RPVDPHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDG--------------------------------------

Query:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----
                     Y QLR+ FI+QFS+R   +  P+HL T++Q + E+L EY+ RF +E +KV  C+DD AM YF T L D  LT++     PA+     
Subjt:  -------------YQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPAS-----

Query:  -----------LNHDTSGRPAVDF---RGEVHHGDVLLLSRRADDNKSRGRRDEKA--PSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLR
                   L    +GRP  +    R     G     SR    + S  R D +    S  +   ++ +TP    I EI    ++T ++ L   P++LR
Subjt:  -----------LNHDTSGRPAVDF---RGEVHHGDVLLLSRRADDNKSRGRRDEKA--PSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLR

Query:  QPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGR
          P KR+   YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK+VG+  +   E    E KR ++PP  R++DRPA+IN              +K  L R
Subjt:  QPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGR

Query:  QHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREA
        + + E               + +++ +R                              PT  + F    +                              
Subjt:  QHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREA

Query:  AEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGW
                                                              EGVH+PH DALVIAPLIDLV VRR+ +DGGAS NIL  STY  LGW
Subjt:  AEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGW

Query:  ERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYA
         R  LK SPTPLVGF+GES+S EGC+ LPV+I + D QVT++AEFVVID   AYNAI GRP+IH  R V ST HQVLKY T  G+ TVRGE KTSRECYA
Subjt:  ERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYA

Query:  TAMK
        +  K
Subjt:  TAMK

A0A6J1E0L8 uncharacterized protein LOC1110253101.7e-10844.59Show/hide
Query:  IDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNH----------------DTSGRPAVDFRGEVHHG--------DVLLLSRRADDNKSRGR
        +DEHVKVVSCTDDIAMMYFTT LNDRNLTIEF SR PASLN                     R +   R   H          D    SRRADD+KSR R
Subjt:  IDEHVKVVSCTDDIAMMYFTTCLNDRNLTIEFGSRLPASLNH----------------DTSGRPAVDFRGEVHHG--------DVLLLSRRADDNKSRGR

Query:  RDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEA
        RDE+  S+RRG KFD+FTPLNASIAEIYA  +DTD++ LFA+P++LR+P GKR+KRLYCRFHKDHGHDTSR FHLKEQVEDLIR GYLKKYVG RE AE 
Subjt:  RDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGKRDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEA

Query:  EGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERK
        EGSAREEKR +S PPR KEDRPA+INTIHGGPSG +SG++ K  L R+                                                    
Subjt:  EGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERK

Query:  HLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHK
                        V+ E   S P       +W  +                                 P + +                     +  
Subjt:  HLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHK

Query:  SEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYA
         E VHMPH DALVIAPLID VKVRRV +DGGAS NI  FSTYT LGWER+HLK   T LVGFA ESVSTEGC+SLPVTISEG+ QVT+VAEFVVIDRS A
Subjt:  SEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGCVSLPVTISEGDQQVTKVAEFVVIDRSYA

Query:  YNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKGIATYAAVMDAAKPCADEPDQSRGTPAEELELIPLLGSEKQVSVGSRLG
        Y             +VS                        S+ C  T  +G A              +P  +    +   EL+PLLG ++QVS+GSRL 
Subjt:  YNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKGIATYAAVMDAAKPCADEPDQSRGTPAEELELIPLLGSEKQVSVGSRLG

Query:  A----EVPRSENSNADLLA
        A    E+ R   SN+D+ A
Subjt:  A----EVPRSENSNADLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGCCACTTCCCGTGCCCATCCTCGACTCGCGTACTCTCAGGTGGCCGGGACTCCTGTCATCGAACGACGACCCCAGGCGGGGTGGTCAAGGAAAATGGAGG
TCGGCCGGCCACATCCGATCCCATAGCAATCCGGGACTTCCACCTCGCCTCAGATCAGTTTCCGCCACTGCAGCCTCAGAGGAACGGGTTGCCGCCCCGTGCAAC
TCGCCTTCGCGGCTGGGGGAACACAGTAAAGGCAAGGCTCGAAGCGGTCGAGAGAGGCAGCGAGGTGTCCGGCTCTTCCGTCTCCAGGGATCCCATTCGAGGAAA
AGGGTTTATGCATCCGACCCAAATAACGGAGTATCAGTTCTGACCTCCCAAGGATGCCCGAGCTGGAGCACCCTCGCGAAGGCCACAACGGGTGGGGACAGGCGA
TGCCTTGGGAGCACAGGCCGAGGTGAGGGGAGGCCAAGGCGACGAGTGGTGGCGCCCGGAGATCGGGAGTACCTAGTCGGCGACGAGGAGGGTAGCCCAGAGGTC
GACGATCGAGAGAGGTCCTCCCACGGTAACCATTCATTCCGGTCTGAAGTGGACCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGGCGGTGC
AGGCCTGTAGATCCGCACCGCGTGAACGAGCAAGAGGAACCGCCTTTCTCCCAAGCAATCTTGGACGCACCTATCCCACCGAGGTTCAAGGCTCCGATCATGAGT
TCCTACGACGGTTATCAACAGCTGAGGAGGTTGTTCATCAACCAGTTCTCAGCTCGGCAGTTGTTGAAATTGCCGCCTTCTCACCTCAGAACAGTGAAGCAATGG
GACAATGAGTCTCTGACAGAATACATCGCTCGGTTCATAGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGACATCGCCATGATGTACTTCACCACGTGTTTG
AACGACAGGAACTTGACAATAGAGTTCGGAAGTCGACTGCCGGCCTCCCTGAACCACGACACTTCAGGGAGGCCGGCAGTCGACTTCCGAGGTGAAGTACATCAT
GGCGATGTACTCTTGTTGTCTCGGCGGGCCGACGACAACAAGAGTAGAGGCCGTCGTGACGAGAAAGCTCCTTCAGACCGTCGAGGGTCGAAGTTCGACAGGTTC
ACCCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCGGCCAAAGACACCGACCTGAAGGCGCTTTTCGCAGCCCCAAAGAGGCTCCGTCAACCTCCAGGGAAG
CGAGACAAGCGACTCTACTGCCGATTCCACAAGGATCACGGCCACGACACTTCACGCGATTTCCACCTGAAGGAGCAGGTTGAGGATCTGATCCGGCGGGGTTAT
TTGAAAAAGTATGTCGGCAGGCGTGAAGCGGCAGAGGCAGAGGGGTCGGCTCGGGAGGAGAAGCGAGCGAAGTCACCACCGCCGAGGCGGAAGGAAGATCGCCCC
GCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACAAGAGTGAAGGAGTGCACATGCCC
CATAAGGACGCGCTAGTGATCGCCCCACTCATAGATCTTGTGAAGGTGAGAAGAGTTCCTATCGACGGTGGAGCGTCGACCAATATCTTGTTATTCTCGACCTAC
ACGACTCTGGGGTGGGAGAGGAAGCACTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTCGCAGGGGAGTCAGTCAGCACGGAAGGATATGTCTCGCTCCCTGAG
CAGGTTGAAGATCTGATCCGGTGGGGTTATTTGAAAAAGTATGTCGGCAGGCGTGAAGCGGCAGAGCCAGAGGGGTCAGCTCGGGAGGAGAAGCGAGCGAAGTCA
CCACCGCCGAGGCGGAAGGAAGATCGCCCCGCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGG
CAGCACAAGAGTGAAGGAGTGCACATGCCCCATAAGGACGCGCTAGTGATCGCCCCACTCATAGATCTTGTGAAGGTGAGAAGAGTTCCTATCGACGGTGGAGCG
TCGACCAATATCTTGTTATTCTCGACCTACACGACTCTGGGGTGGGAGAGGAAGCACTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTCGCAGGGGAGTCAGTC
AGCACGGAAGGATGTGTCTCCCTCCCTGTTACCATCAGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTTGTGATAGATCGGAGCTATGCGTATAAC
GCCATAATTGGTCGGCCCTTGATTCATGATCTCAGGGAAGTTTCGTCCACTTACCACCAGGTCTTGAAGTACCCCACCTCGGCCGGAATTGCGACAGTCCGGGGT
GAGCAAAAAACGTCCAGAGAATGCTATGCCACTGCGATGAAGGGAATAGCCACTTATGCAGCGGTCATGGACGCGGCAAAGCCATGCGCCGACGAACCAGACCAG
AGCCGCGGTACCCCAGCTGAAGAGCTAGAACTTATCCCCCTGCTGGGGTCAGAAAAGCAGGTCAGCGTCGGCAGCAGATTGGGGGCCGAGGTGCCGAGGTCCGAA
AACTCCAACGCCGACTTACTGGCTCGCCTAGCCTCGGCATACGAGACCGACCTACCGAGAACAGTTCCAGTTGAAATACTCGTCGAGTCGTCCATCGACCAGCCT
GAGATAATGGAGGTCCAGTCAGCTCAGCCTCCATGGATGGACCCGATTAAGGACTTTCTGGTCAGTGGCTCAGTCCCTGCTGATCAGAGCCAGGCCAGAAAGCTC
CGACGCCAAGCTGCTCACTACTTGATACAAGAAGGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGTTGCGAGTTGTTACTACTACCGAGCTACGTCAAATA
GGGGCTAAGGAAGACTACGTGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGCCACTTCCCGTGCCCATCCTCGACTCGCGTACTCTCAGGTGGCCGGGACTCCTGTCATCGAACGACGACCCCAGGCGGGGTGGTCAAGGAAAATGGAGG
TCGGCCGGCCACATCCGATCCCATAGCAATCCGGGACTTCCACCTCGCCTCAGATCAGTTTCCGCCACTGCAGCCTCAGAGGAACGGGTTGCCGCCCCGTGCAAC
TCGCCTTCGCGGCTGGGGGAACACAGTAAAGGCAAGGCTCGAAGCGGTCGAGAGAGGCAGCGAGGTGTCCGGCTCTTCCGTCTCCAGGGATCCCATTCGAGGAAA
AGGGTTTATGCATCCGACCCAAATAACGGAGTATCAGTTCTGACCTCCCAAGGATGCCCGAGCTGGAGCACCCTCGCGAAGGCCACAACGGGTGGGGACAGGCGA
TGCCTTGGGAGCACAGGCCGAGGTGAGGGGAGGCCAAGGCGACGAGTGGTGGCGCCCGGAGATCGGGAGTACCTAGTCGGCGACGAGGAGGGTAGCCCAGAGGTC
GACGATCGAGAGAGGTCCTCCCACGGTAACCATTCATTCCGGTCTGAAGTGGACCTCCTCCGGGATCAGTTTCAGAGGGAGATAGAAGATCTCAAGCGGCGGTGC
AGGCCTGTAGATCCGCACCGCGTGAACGAGCAAGAGGAACCGCCTTTCTCCCAAGCAATCTTGGACGCACCTATCCCACCGAGGTTCAAGGCTCCGATCATGAGT
TCCTACGACGGTTATCAACAGCTGAGGAGGTTGTTCATCAACCAGTTCTCAGCTCGGCAGTTGTTGAAATTGCCGCCTTCTCACCTCAGAACAGTGAAGCAATGG
GACAATGAGTCTCTGACAGAATACATCGCTCGGTTCATAGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGACATCGCCATGATGTACTTCACCACGTGTTTG
AACGACAGGAACTTGACAATAGAGTTCGGAAGTCGACTGCCGGCCTCCCTGAACCACGACACTTCAGGGAGGCCGGCAGTCGACTTCCGAGGTGAAGTACATCAT
GGCGATGTACTCTTGTTGTCTCGGCGGGCCGACGACAACAAGAGTAGAGGCCGTCGTGACGAGAAAGCTCCTTCAGACCGTCGAGGGTCGAAGTTCGACAGGTTC
ACCCCGCTGAACGCCTCAATCGCGGAGATCTACGCAGCGGCCAAAGACACCGACCTGAAGGCGCTTTTCGCAGCCCCAAAGAGGCTCCGTCAACCTCCAGGGAAG
CGAGACAAGCGACTCTACTGCCGATTCCACAAGGATCACGGCCACGACACTTCACGCGATTTCCACCTGAAGGAGCAGGTTGAGGATCTGATCCGGCGGGGTTAT
TTGAAAAAGTATGTCGGCAGGCGTGAAGCGGCAGAGGCAGAGGGGTCGGCTCGGGAGGAGAAGCGAGCGAAGTCACCACCGCCGAGGCGGAAGGAAGATCGCCCC
GCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACAAGAGTGAAGGAGTGCACATGCCC
CATAAGGACGCGCTAGTGATCGCCCCACTCATAGATCTTGTGAAGGTGAGAAGAGTTCCTATCGACGGTGGAGCGTCGACCAATATCTTGTTATTCTCGACCTAC
ACGACTCTGGGGTGGGAGAGGAAGCACTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTCGCAGGGGAGTCAGTCAGCACGGAAGGATATGTCTCGCTCCCTGAG
CAGGTTGAAGATCTGATCCGGTGGGGTTATTTGAAAAAGTATGTCGGCAGGCGTGAAGCGGCAGAGCCAGAGGGGTCAGCTCGGGAGGAGAAGCGAGCGAAGTCA
CCACCGCCGAGGCGGAAGGAAGATCGCCCCGCCATTATAAATACCATCCATGGGGGCCCGAGTGGGGGACAGTCGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGG
CAGCACAAGAGTGAAGGAGTGCACATGCCCCATAAGGACGCGCTAGTGATCGCCCCACTCATAGATCTTGTGAAGGTGAGAAGAGTTCCTATCGACGGTGGAGCG
TCGACCAATATCTTGTTATTCTCGACCTACACGACTCTGGGGTGGGAGAGGAAGCACTTGAAGCTCAGCCCGACGCCTTTGGTCGGTTTCGCAGGGGAGTCAGTC
AGCACGGAAGGATGTGTCTCCCTCCCTGTTACCATCAGCGAGGGAGATCAACAAGTAACTAAGGTTGCAGAATTTGTTGTGATAGATCGGAGCTATGCGTATAAC
GCCATAATTGGTCGGCCCTTGATTCATGATCTCAGGGAAGTTTCGTCCACTTACCACCAGGTCTTGAAGTACCCCACCTCGGCCGGAATTGCGACAGTCCGGGGT
GAGCAAAAAACGTCCAGAGAATGCTATGCCACTGCGATGAAGGGAATAGCCACTTATGCAGCGGTCATGGACGCGGCAAAGCCATGCGCCGACGAACCAGACCAG
AGCCGCGGTACCCCAGCTGAAGAGCTAGAACTTATCCCCCTGCTGGGGTCAGAAAAGCAGGTCAGCGTCGGCAGCAGATTGGGGGCCGAGGTGCCGAGGTCCGAA
AACTCCAACGCCGACTTACTGGCTCGCCTAGCCTCGGCATACGAGACCGACCTACCGAGAACAGTTCCAGTTGAAATACTCGTCGAGTCGTCCATCGACCAGCCT
GAGATAATGGAGGTCCAGTCAGCTCAGCCTCCATGGATGGACCCGATTAAGGACTTTCTGGTCAGTGGCTCAGTCCCTGCTGATCAGAGCCAGGCCAGAAAGCTC
CGACGCCAAGCTGCTCACTACTTGATACAAGAAGGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGTTGCGAGTTGTTACTACTACCGAGCTACGTCAAATA
GGGGCTAAGGAAGACTACGTGGTATGA
Protein sequenceShow/hide protein sequence
MLPLPVPILDSRTLRWPGLLSSNDDPRRGGQGKWRSAGHIRSHSNPGLPPRLRSVSATAASEERVAAPCNSPSRLGEHSKGKARSGRERQRGVRLFRLQGSHSRK
RVYASDPNNGVSVLTSQGCPSWSTLAKATTGGDRRCLGSTGRGEGRPRRRVVAPGDREYLVGDEEGSPEVDDRERSSHGNHSFRSEVDLLRDQFQREIEDLKRRC
RPVDPHRVNEQEEPPFSQAILDAPIPPRFKAPIMSSYDGYQQLRRLFINQFSARQLLKLPPSHLRTVKQWDNESLTEYIARFIDEHVKVVSCTDDIAMMYFTTCL
NDRNLTIEFGSRLPASLNHDTSGRPAVDFRGEVHHGDVLLLSRRADDNKSRGRRDEKAPSDRRGSKFDRFTPLNASIAEIYAAAKDTDLKALFAAPKRLRQPPGK
RDKRLYCRFHKDHGHDTSRDFHLKEQVEDLIRRGYLKKYVGRREAAEAEGSAREEKRAKSPPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMP
HKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESVSTEGYVSLPEQVEDLIRWGYLKKYVGRREAAEPEGSAREEKRAKS
PPPRRKEDRPAIINTIHGGPSGGQSGRREKLWLGRQHKSEGVHMPHKDALVIAPLIDLVKVRRVPIDGGASTNILLFSTYTTLGWERKHLKLSPTPLVGFAGESV
STEGCVSLPVTISEGDQQVTKVAEFVVIDRSYAYNAIIGRPLIHDLREVSSTYHQVLKYPTSAGIATVRGEQKTSRECYATAMKGIATYAAVMDAAKPCADEPDQ
SRGTPAEELELIPLLGSEKQVSVGSRLGAEVPRSENSNADLLARLASAYETDLPRTVPVEILVESSIDQPEIMEVQSAQPPWMDPIKDFLVSGSVPADQSQARKL
RRQAAHYLIQEGKLFKRGYSLPLLRVVTTTELRQIGAKEDYVV