; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g26510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g26510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr9:19792804..19795172
RNA-Seq ExpressionMoc09g26510
SyntenyMoc09g26510
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]2.0e-11045.71Show/hide
Query:  KAQREIEDPKRQCRPVDSH-RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR
        K   ++E  K +C   +      +  E PF+  +L+        AP + SYDGS DP  YV VFE  MDF AASDA+KCRAFQIAL GSARLW+++ + +
Subjt:  KAQREIEDPKRQCRPVDSH-RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR

Query:  SIDSYQQLRRLFI-----NQFSAWQLLKLPPSHLGIVKQQ------DNESLTEYIAR----------FKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEK
           S       +      ++    +L K  P+    V Q+        E L     R           KDE   +K +  G  SS RA+      RR   
Subjt:  SIDSYQQLRRLFI-----NQFSAWQLLKLPPSHLGIVKQQ------DNESLTEYIAR----------FKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEK

Query:  APSNRRGPKFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSA
         P+  R   +++FTP    I +I    E++ +E L   PEKL   P +R+K  YCRFH++HD +TS  + LK Q+E LI+  Y KK+VG+     P+ S+
Subjt:  APSNRRGPKFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSA

Query:  REAKREK--SAPPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGT
         E K E+  S  P RR DRPA+INTI GGP+G Q G KRK LAR A  EVC    + P  PI FD  D EEVH+PHNDA+VIAPLIDHV VRRVL+D G 
Subjt:  REAKREK--SAPPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGT

Query:  SANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGI
        SANI+S  TY ALGW R  LK + TPLV F+ ESV  EGC+ LPVT+G    +VT++AEFVVID  SAYNAI GRP+IH  +A+PST HQVLKY T  G+
Subjt:  SANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGI

Query:  ATVQGEQKTSRECYAAAMEGTTTCATVT-----NAAEPCADEPEPNRGTPAEELELVPLL
          V+GEQ  SRECYA+A++G++ CA  T        E  A+ P      P EELELVPLL
Subjt:  ATVQGEQKTSRECYAAAMEGTTTCATVT-----NAAEPCADEPEPNRGTPAEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]8.8e-11444.5Show/hide
Query:  KAQREIEDPKRQC-RPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR
        K   ++E  K +C +   S    +  E  FS  IL+A IPP+FK P M  YDGS DP  YV VFE  MDF AA+DA+KC AFQIAL GSARLWYR+L  R
Subjt:  KAQREIEDPKRQC-RPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR

Query:  SIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV----------------------------------------KQRCNG
         I +Y QLR+ FI+QFS+    +  P+HL  ++Q++ E+L EY+ RF +E +KV                                        K+  +G
Subjt:  SIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV----------------------------------------KQRCNG

Query:  -------WGSSQRADDNQGKGRRDEKAPSNRR--GPK--------------------FDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKR
                G  ++  D    G+   KA S  R  GP                     ++ +TP    I +I    E+T +E L   PEKL   P KR+  
Subjt:  -------WGSSQRADDNQGKGRRDEKAPSNRR--GPK--------------------FDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKR

Query:  LYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKS--APPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCT
         YCRFH+DH  +TS  + LK Q+E LI+ GY KK+VG+     P+ ++ E K E+     P RR+DRPA+IN             K+K LAREA  EVC 
Subjt:  LYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKS--APPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCT

Query:  SYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQ
           + P   I F+  D E VH+PHNDA+VIAPLID V VRR+L+DGG SANILS STY ALGW R  LK +PTPLV F+GES+S EGC+ LPV+I + D 
Subjt:  SYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQ

Query:  RVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCA
        +VT++AEFVVID  SAYNAI GRP+IH  +AVPST HQVLKY T  G+ TV+GE KTSRECYA+  + ++ CA
Subjt:  RVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCA

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]1.5e-14257.09Show/hide
Query:  YRQLKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRC-NGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKF
        +R   P S++      R +I+    W+      S  G  + +D++S               K+RC +   SS+RADD++ + RRDE+  SNRRGPKFDKF
Subjt:  YRQLKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRC-NGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKF

Query:  TPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRR
        TPLNASI +IY   EDTD+E LFA+PEKL RP GKR+KRLYCRFHKDH  DTSRCFHLKEQVE LIR GYLKKYVG RE+AE +GSARE KRE+S PPR 
Subjt:  TPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRR

Query:  REDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW
        +EDRPA+INTI GGP+G + GQKRKALARE AHEVCTSYPK PVMPILFD+QDGE VHMPHNDA+VIAPLIDHVKVRRV +DGG SANI SFSTYTALGW
Subjt:  REDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW

Query:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA
        ER+HLK   T LV FA ESVS EGC+ LPVTI EG+ +VT+VAEFVVIDRSSAY                                         R+C  
Subjt:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA

Query:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLGPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDP
             + +C T            +P     +   ELVPLLGP++QVSIGS L A+ KEEL+ FL++N++VFAWSHDDM +IDP+IMVHRLN++P
Subjt:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLGPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDP

XP_030955724.1 uncharacterized protein LOC115977839 [Quercus lobata]7.5e-10535.99Show/hide
Query:  VDDGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQ
        +D  +K   E+++  R+  P++   +  + + PF+ +I   P+P +FK P + SYDG+ DP  ++A F+  M      D + CRAF   L+G AR+W+ +
Subjt:  VDDGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQ

Query:  LKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQ---------RCNGWGSS-------------------QRAD
        + P S+ S+++L +LF+N F   Q  K   S L  ++Q +NESL  +I RF  E + V +           NG  S                    +RA+
Subjt:  LKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQ---------RCNGWGSS-------------------QRAD

Query:  DNQGKG-RRDEKAPSNRRGPKFDK----------------FTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLK
          +    R  E+AP  ++G   D+                +TPLNA +  + +  +D   +     PEK+   P KR+K  YCRFH+DH  DT  C+ LK
Subjt:  DNQGKG-RRDEKAPSNRRGPKFDK----------------FTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLK

Query:  EQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRRREDRPAIINTILGG-PTGRQLGQKRKALAREAAHEVCTSYPKEPVM---PILFDDQDGE
        +Q+E LIR+G LK +VG R+R + K    + K E+S+ P   E     I  I+GG P G+    K+  L      ++    P+   M    I F D+D E
Subjt:  EQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRRREDRPAIINTILGG-PTGRQLGQKRKALAREAAHEVCTSYPKEPVM---PILFDDQDGE

Query:  EVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYN
         +H PH+DAIVI  LI     RRVL+D G+SA+IL + T+  +   R  L+   +PL+ F G  V   G + LPV +G   Q++TK   F+V+D +S+YN
Subjt:  EVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYN

Query:  AIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLL--GPEKQVSIGSGLG
        AIIGRP ++  KA+ STYH  +K+PT  GI   QG+Q  +RECY A M        +    +  + E       P E LE V L    PEK   IG+G+ 
Subjt:  AIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLL--GPEKQVSIGSGLG

Query:  AEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLV
         + +++LI FL+ + +VFAWSHDDM  IDPS++ HRLN+ P ++ +RQK+R    ER   I +EV++L  AKFI+EV+Y  WL+NVV++
Subjt:  AEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLV

XP_030963307.1 uncharacterized protein LOC115984421 [Quercus lobata]9.8e-10537.89Show/hide
Query:  VDDGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQ
        +D  +K   E+++  R+  P++   +  + + PF+ +I   P+P +FK P + SYDG+ DP  ++A F+  M      D + CRAF   L+G AR+W+ +
Subjt:  VDDGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQ

Query:  LKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKFTPLN
        + P SI S+++L +LF+N F   Q  K   S L  ++Q +NESL  +I RF  E + V +           DD            S+    K  +  P  
Subjt:  LKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKFTPLN

Query:  ASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRRREDR
         + + +    +D  L+     PEK+   P KR+K  YCRFH+DH  DT  C+ LK+Q+E LIR+G LK +VG R+R E K    + K E+S+ P   E  
Subjt:  ASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRRREDR

Query:  PAIINTILGG-PTGRQLGQKRKALAREAAHEVCTSYPKEPVM---PILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW
           I  I+GG P GR    K+  L      ++    P+   M    I F D+D E +H PH+DAIVI  LI     RRVL+D G+SA+IL + T+  +  
Subjt:  PAIINTILGG-PTGRQLGQKRKALAREAAHEVCTSYPKEPVM---PILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW

Query:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA
         R  L+   +PL+ F G  V   G V LPV +G   Q++TK   F+V+D +S+YNAIIGRP ++  KA+ STYH  +K+PT  GI   QG+Q  +RECY 
Subjt:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA

Query:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLL--GPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDPSYRS
        A M        +    +  + E       P EELE V L    PEK   IG+G+  + +E+LI FL+ + +VFAWSHDDM  IDPS++ HRLN+ P ++ 
Subjt:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLL--GPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDPSYRS

Query:  VRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLV
        +RQK+R    ER   I +EV++L  AKFI+EV+Y  WL+NVV++
Subjt:  VRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLV

TrEMBL top hitse value%identityAlignment
A0A2N9EL41 Reverse transcriptase3.7e-11038.42Show/hide
Query:  DGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLK
        D    Q ++ D  +     +   +  + + P   +I D P+P RFK P++ ++DG+ DP  Y+  F+  M   A  + + CRAF + L GSAR+W+ +L+
Subjt:  DGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLK

Query:  PRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV-KQRCNGWGSSQRADDNQGKGRRD----------EKAPSNRRGP
          SI S+ QL R FI+ F   Q    PP HL  VKQ + ESL  ++ RF +E +K+ + + N   + +  DD   K R++          +K P     P
Subjt:  PRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV-KQRCNGWGSSQRADDNQGKGRRD----------EKAPSNRRGP

Query:  ----------KFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEP-K
                  KF  FTPLN  I  + +  +D   +     P K+   P  R K LYCRFH+DH   T  C  LKEQVE LIR+G L+KYV R     P K
Subjt:  ----------KFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEP-K

Query:  GSAREAKREKSAPPRRREDRPAIINTILGGP-TGRQLGQKRKALAREAAHEVCTSYPKEPV----MPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRV
          A+  + E + P    E     I TI+GGP +G      RKA AR+  + +    P + V      I F ++D    H PH+DA+VI   I     RRV
Subjt:  GSAREAKREKSAPPRRREDRPAIINTILGGP-TGRQLGQKRKALAREAAHEVCTSYPKEPV----MPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRV

Query:  LIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKY
        ++D G+SA+IL   TY  +  ++  L+    PLV F  + V   G V LP+T+G   + V+K  +F+V++  SAYNAIIGRP ++ L+AV STYH +LK+
Subjt:  LIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKY

Query:  PTSAGIATVQGEQKTSRECYAAAM--EGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLG--PEKQVSIGSGLGAEVKEELIGFLQANANVFAWSH
        PT  GI  V+G+Q  SRECY A++  EG     T+         E       P+EEL+ + L    PE+   IG+ L  ++KE L+ FL++N +VFAWSH
Subjt:  PTSAGIATVQGEQKTSRECYAAAM--EGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLG--PEKQVSIGSGLGAEVKEELIGFLQANANVFAWSH

Query:  DDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLVKM
        +DM  IDPSI+ H+LN+DPS R V+QKRR    ER+N I +E+++LL A FIREV Y  WL+NVV++ M
Subjt:  DDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLVKM

A0A2N9IJR2 Ribonuclease H3.9e-10737.15Show/hide
Query:  RKAQREIEDPKRQCRPVDSH---RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQL
        R+ ++++ D K   R   +     +  + + PF  +I D P+P RFK P++ ++DG+ DP  Y+  F+  M   A  + + CRAF + L GSAR+W+ +L
Subjt:  RKAQREIEDPKRQCRPVDSH---RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQL

Query:  KPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQ--------RCNGWGSSQRADDNQGKGRRD----EKAPSNRR
        +  SI S+ QL R FI+ F   Q    PP+HL  VKQ + ESL  ++ RF  E +K+ +              + +  DD   K R++    +  PS ++
Subjt:  KPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQ--------RCNGWGSSQRADDNQGKGRRD----EKAPSNRR

Query:  GP----------------KFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGR
         P                KF  FTPLN  I  + +  +D   +     P K+   P  R K LYCRFH+DH   T  C  LKEQVE LIR+G L+KYV R
Subjt:  GP----------------KFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGR

Query:  RERAEPKGSAREAKREKSAPPRRREDRPAIINTILGGP-TGRQLGQKRKALAREAAHEVCTSYPKEPV----MPILFDDQDGEEVHMPHNDAIVIAPLID
             P  +    +RE++ P   R      I TI+GGP +G      RKA AR+  + +    P + +      I F ++D    H PH+DA+VI   I 
Subjt:  RERAEPKGSAREAKREKSAPPRRREDRPAIINTILGGP-TGRQLGQKRKALAREAAHEVCTSYPKEPV----MPILFDDQDGEEVHMPHNDAIVIAPLID

Query:  HVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPST
            RRV++D G+SA+IL    Y  +  ++  L+    PLV F G+ +   G V LP+ +G   + V+K  +F+V++  SAYNAIIGRP ++ L+AV ST
Subjt:  HVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPST

Query:  YHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLG--PEKQVSIGSGLGAEVKEELIGFLQANANV
        YH +LK+PT  GI  V+G+Q  +RECY A++       T+       + E +     P+ EL  + L    PE+   IG+ L  ++KE L+ FL+ N +V
Subjt:  YHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLG--PEKQVSIGSGLGAEVKEELIGFLQANANV

Query:  FAWSHDDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLVK
        FAWSH+DM  I+PSI+ H+LN+DPS R ++QKRR    ER+N I +EV++LL A FIREV Y  WL+NVV+VK
Subjt:  FAWSHDDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVICKEVEQLLRAKFIREVHYLAWLSNVVLVK

A0A6J1D9E1 uncharacterized protein LOC1110188239.9e-11145.71Show/hide
Query:  KAQREIEDPKRQCRPVDSH-RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR
        K   ++E  K +C   +      +  E PF+  +L+        AP + SYDGS DP  YV VFE  MDF AASDA+KCRAFQIAL GSARLW+++ + +
Subjt:  KAQREIEDPKRQCRPVDSH-RVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR

Query:  SIDSYQQLRRLFI-----NQFSAWQLLKLPPSHLGIVKQQ------DNESLTEYIAR----------FKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEK
           S       +      ++    +L K  P+    V Q+        E L     R           KDE   +K +  G  SS RA+      RR   
Subjt:  SIDSYQQLRRLFI-----NQFSAWQLLKLPPSHLGIVKQQ------DNESLTEYIAR----------FKDEHVKVKQRCNGWGSSQRADDNQGKGRRDEK

Query:  APSNRRGPKFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSA
         P+  R   +++FTP    I +I    E++ +E L   PEKL   P +R+K  YCRFH++HD +TS  + LK Q+E LI+  Y KK+VG+     P+ S+
Subjt:  APSNRRGPKFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSA

Query:  REAKREK--SAPPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGT
         E K E+  S  P RR DRPA+INTI GGP+G Q G KRK LAR A  EVC    + P  PI FD  D EEVH+PHNDA+VIAPLIDHV VRRVL+D G 
Subjt:  REAKREK--SAPPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGT

Query:  SANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGI
        SANI+S  TY ALGW R  LK + TPLV F+ ESV  EGC+ LPVT+G    +VT++AEFVVID  SAYNAI GRP+IH  +A+PST HQVLKY T  G+
Subjt:  SANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGI

Query:  ATVQGEQKTSRECYAAAMEGTTTCATVT-----NAAEPCADEPEPNRGTPAEELELVPLL
          V+GEQ  SRECYA+A++G++ CA  T        E  A+ P      P EELELVPLL
Subjt:  ATVQGEQKTSRECYAAAMEGTTTCATVT-----NAAEPCADEPEPNRGTPAEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204794.3e-11444.5Show/hide
Query:  KAQREIEDPKRQC-RPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR
        K   ++E  K +C +   S    +  E  FS  IL+A IPP+FK P M  YDGS DP  YV VFE  MDF AA+DA+KC AFQIAL GSARLWYR+L  R
Subjt:  KAQREIEDPKRQC-RPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGDPISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR

Query:  SIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV----------------------------------------KQRCNG
         I +Y QLR+ FI+QFS+    +  P+HL  ++Q++ E+L EY+ RF +E +KV                                        K+  +G
Subjt:  SIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKV----------------------------------------KQRCNG

Query:  -------WGSSQRADDNQGKGRRDEKAPSNRR--GPK--------------------FDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKR
                G  ++  D    G+   KA S  R  GP                     ++ +TP    I +I    E+T +E L   PEKL   P KR+  
Subjt:  -------WGSSQRADDNQGKGRRDEKAPSNRR--GPK--------------------FDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKR

Query:  LYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKS--APPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCT
         YCRFH+DH  +TS  + LK Q+E LI+ GY KK+VG+     P+ ++ E K E+     P RR+DRPA+IN             K+K LAREA  EVC 
Subjt:  LYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKS--APPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCT

Query:  SYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQ
           + P   I F+  D E VH+PHNDA+VIAPLID V VRR+L+DGG SANILS STY ALGW R  LK +PTPLV F+GES+S EGC+ LPV+I + D 
Subjt:  SYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQ

Query:  RVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCA
        +VT++AEFVVID  SAYNAI GRP+IH  +AVPST HQVLKY T  G+ TV+GE KTSRECYA+  + ++ CA
Subjt:  RVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAMEGTTTCA

A0A6J1E0L8 uncharacterized protein LOC1110253104.4e-14357.29Show/hide
Query:  YRQLKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRC-NGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKF
        +R   P S++      R +I+    W+      S  G  + +D++S               K+RC +   SS+RADD++ + RRDE+  SNRRGPKFDKF
Subjt:  YRQLKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRC-NGWGSSQRADDNQGKGRRDEKAPSNRRGPKFDKF

Query:  TPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRR
        TPLNASI +IY   EDTD+E LFA+PEKL RP GKR+KRLYCRFHKDH  DTSRCFHLKEQVE LIR GYLKKYVG RE+AE +GSARE KRE+S PPR 
Subjt:  TPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPKGSAREAKREKSAPPRR

Query:  REDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW
        +EDRPA+INTI GGP+G + GQKRKALARE AHEVCTSYPK PVMPILFD+QDGE VHMPHNDA+VIAPLIDHVKVRRV +DGG SANI SFSTYTALGW
Subjt:  REDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFSTYTALGW

Query:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA
        ER+HLK   T LV FA ESVS EGC+ LPVTI EG+ +VT+VAEFVVIDRSSAY                                         R+C  
Subjt:  ERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYA

Query:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLGPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDP
             + +C T            +P     +   ELVPLLGP++QVSIGS L A+ KEEL+ FL++N++VFAWSHDDM +IDP+IMVHRLN+DP
Subjt:  AAMEGTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLGPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATGCTGCCACTTCTCGTGCCCAACCTCCACTCACGTACTCTCAGGTGGCAGGAACTCCCGTCATCAAACAACGATCCCAGGCGGGGGTGGTCAAGGAGAATGG
AGGTCGACGAGTAATGGCACCCGGAGATCGGGAGTATCCGGTTGACGATGGGAGGAAAGCCCAGAGGGAGATAGAAGATCCCAAGCGGCAGTGCAGGCCTGTAGACTCGC
ATCGCGTAGCCGAGCAAGATGAACCGCCTTTCTCCCAAGCGATCTTGGACGCACCTATCCCACCAAGGTTCAAGGCTCCGGTCATGAGTTCTTACGACGGATCTGGAGAT
CCGATCTCCTACGTGGCAGTGTTCGAGAGGAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGCTCAGCAAGATTGTG
GTACCGACAGTTGAAGCCCCGATCCATCGATAGTTATCAACAGCTGAGAAGATTGTTCATCAACCAATTCTCAGCTTGGCAGTTGTTGAAGTTGCCGCCCTCTCACCTCG
GAATAGTAAAGCAACAGGACAATGAGTCCCTGACAGAGTACATCGCTCGGTTCAAGGACGAGCATGTCAAAGTGAAGCAACGCTGCAATGGTTGGGGCTCGTCTCAGCGG
GCCGACGACAACCAAGGTAAAGGCCGCCGCGACGAAAAAGCCCCTTCAAACCGACGAGGGCCGAAGTTCGACAAGTTCACTCCGTTGAACGCCTCAATCGTAGATATCTA
CGTGGCGGCTGAAGATACCGACCTGGAGGCGCTTTTCGCGGCCCCAGAAAAGCTCCTCCGACCTCCAGGGAAACGAGACAAGCGACTTTACTGCCGATTCCACAAGGATC
ACGACCAGGACACTTCACGCTGTTTCCACCTGAAGGAGCAGGTCGAGGGTCTGATCCGGAGGGGTTATCTGAAAAAATACGTCGGCAGGCGTGAAAGGGCAGAGCCAAAG
GGGTCGGCTCGGGAGGCGAAGCGAGAGAAGTCAGCACCGCCGAGACGGAGGGAAGATCGGCCCGCCATTATAAATACCATCCTTGGGGGCCCAACTGGGCGACAGTTGGG
GCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTCTGTACCTCATACCCCAAGGAGCCTGTTATGCCGATCTTATTCGACGACCAAGACGGCGAAGAAGTGC
ACATGCCTCATAATGACGCCATAGTAATTGCCCCACTCATAGATCACGTGAAGGTGAGAAGAGTTCTTATCGACGGTGGAACGTCGGCCAACATCTTATCGTTCTCGACC
TACACGGCCCTGGGTTGGGAGAGGAAGCACTTGAAGCTCAACCCGACGCCTTTGGTCGATTTTGCAGGGGAGTCAGTTAGCGCGGAAGGGTGTGTCTTGCTCCCTGTCAC
CATCGGCGAGGGAGATCAACGAGTAACTAAGGTCGCAGAATTTGTTGTGATAGATCGGAGCTCTGCGTACAACGCCATAATTGGTCGGCCTTTGATTCATGATCTCAAGG
CAGTTCCATCCACTTATCACCAGGTCTTGAAGTACCCCACCTCGGCCGGAATTGCGACAGTCCAGGGTGAGCAAAAGACGTCCAGAGAATGCTATGCAGCCGCGATGGAG
GGAACAACCACTTGTGCAACGGTCACGAACGCAGCAGAGCCATGTGCCGACGAACCAGAGCCGAACCGTGGTACCCCAGCTGAAGAGCTAGAACTTGTCCCCCTGCTGGG
GCCAGAAAAGCAGGTCAGCATCGGCAGCGGACTGGGGGCCGAGGTAAAAGAAGAGCTCATCGGTTTTCTGCAAGCAAATGCTAACGTGTTCGCATGGTCTCATGACGACA
TGTCGAGCATAGACCCTAGCATAATGGTGCATAGACTAAATATAGATCCTAGCTATAGGTCTGTGAGGCAAAAGCGTCGGCCTGTCGACGCCGAGCGAAGCAATGTAATT
TGTAAAGAAGTCGAGCAGTTGCTACGAGCTAAATTCATAAGAGAAGTTCATTACCTCGCGTGGCTATCTAATGTAGTTTTAGTTAAAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATGCTGCCACTTCTCGTGCCCAACCTCCACTCACGTACTCTCAGGTGGCAGGAACTCCCGTCATCAAACAACGATCCCAGGCGGGGGTGGTCAAGGAGAATGG
AGGTCGACGAGTAATGGCACCCGGAGATCGGGAGTATCCGGTTGACGATGGGAGGAAAGCCCAGAGGGAGATAGAAGATCCCAAGCGGCAGTGCAGGCCTGTAGACTCGC
ATCGCGTAGCCGAGCAAGATGAACCGCCTTTCTCCCAAGCGATCTTGGACGCACCTATCCCACCAAGGTTCAAGGCTCCGGTCATGAGTTCTTACGACGGATCTGGAGAT
CCGATCTCCTACGTGGCAGTGTTCGAGAGGAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAAGGCTCAGCAAGATTGTG
GTACCGACAGTTGAAGCCCCGATCCATCGATAGTTATCAACAGCTGAGAAGATTGTTCATCAACCAATTCTCAGCTTGGCAGTTGTTGAAGTTGCCGCCCTCTCACCTCG
GAATAGTAAAGCAACAGGACAATGAGTCCCTGACAGAGTACATCGCTCGGTTCAAGGACGAGCATGTCAAAGTGAAGCAACGCTGCAATGGTTGGGGCTCGTCTCAGCGG
GCCGACGACAACCAAGGTAAAGGCCGCCGCGACGAAAAAGCCCCTTCAAACCGACGAGGGCCGAAGTTCGACAAGTTCACTCCGTTGAACGCCTCAATCGTAGATATCTA
CGTGGCGGCTGAAGATACCGACCTGGAGGCGCTTTTCGCGGCCCCAGAAAAGCTCCTCCGACCTCCAGGGAAACGAGACAAGCGACTTTACTGCCGATTCCACAAGGATC
ACGACCAGGACACTTCACGCTGTTTCCACCTGAAGGAGCAGGTCGAGGGTCTGATCCGGAGGGGTTATCTGAAAAAATACGTCGGCAGGCGTGAAAGGGCAGAGCCAAAG
GGGTCGGCTCGGGAGGCGAAGCGAGAGAAGTCAGCACCGCCGAGACGGAGGGAAGATCGGCCCGCCATTATAAATACCATCCTTGGGGGCCCAACTGGGCGACAGTTGGG
GCAGAAGAGAAAAGCTCTGGCTCGGGAGGCAGCACACGAGGTCTGTACCTCATACCCCAAGGAGCCTGTTATGCCGATCTTATTCGACGACCAAGACGGCGAAGAAGTGC
ACATGCCTCATAATGACGCCATAGTAATTGCCCCACTCATAGATCACGTGAAGGTGAGAAGAGTTCTTATCGACGGTGGAACGTCGGCCAACATCTTATCGTTCTCGACC
TACACGGCCCTGGGTTGGGAGAGGAAGCACTTGAAGCTCAACCCGACGCCTTTGGTCGATTTTGCAGGGGAGTCAGTTAGCGCGGAAGGGTGTGTCTTGCTCCCTGTCAC
CATCGGCGAGGGAGATCAACGAGTAACTAAGGTCGCAGAATTTGTTGTGATAGATCGGAGCTCTGCGTACAACGCCATAATTGGTCGGCCTTTGATTCATGATCTCAAGG
CAGTTCCATCCACTTATCACCAGGTCTTGAAGTACCCCACCTCGGCCGGAATTGCGACAGTCCAGGGTGAGCAAAAGACGTCCAGAGAATGCTATGCAGCCGCGATGGAG
GGAACAACCACTTGTGCAACGGTCACGAACGCAGCAGAGCCATGTGCCGACGAACCAGAGCCGAACCGTGGTACCCCAGCTGAAGAGCTAGAACTTGTCCCCCTGCTGGG
GCCAGAAAAGCAGGTCAGCATCGGCAGCGGACTGGGGGCCGAGGTAAAAGAAGAGCTCATCGGTTTTCTGCAAGCAAATGCTAACGTGTTCGCATGGTCTCATGACGACA
TGTCGAGCATAGACCCTAGCATAATGGTGCATAGACTAAATATAGATCCTAGCTATAGGTCTGTGAGGCAAAAGCGTCGGCCTGTCGACGCCGAGCGAAGCAATGTAATT
TGTAAAGAAGTCGAGCAGTTGCTACGAGCTAAATTCATAAGAGAAGTTCATTACCTCGCGTGGCTATCTAATGTAGTTTTAGTTAAAATGTAA
Protein sequenceShow/hide protein sequence
MRDAATSRAQPPLTYSQVAGTPVIKQRSQAGVVKENGGRRVMAPGDREYPVDDGRKAQREIEDPKRQCRPVDSHRVAEQDEPPFSQAILDAPIPPRFKAPVMSSYDGSGD
PISYVAVFERKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPRSIDSYQQLRRLFINQFSAWQLLKLPPSHLGIVKQQDNESLTEYIARFKDEHVKVKQRCNGWGSSQR
ADDNQGKGRRDEKAPSNRRGPKFDKFTPLNASIVDIYVAAEDTDLEALFAAPEKLLRPPGKRDKRLYCRFHKDHDQDTSRCFHLKEQVEGLIRRGYLKKYVGRRERAEPK
GSAREAKREKSAPPRRREDRPAIINTILGGPTGRQLGQKRKALAREAAHEVCTSYPKEPVMPILFDDQDGEEVHMPHNDAIVIAPLIDHVKVRRVLIDGGTSANILSFST
YTALGWERKHLKLNPTPLVDFAGESVSAEGCVLLPVTIGEGDQRVTKVAEFVVIDRSSAYNAIIGRPLIHDLKAVPSTYHQVLKYPTSAGIATVQGEQKTSRECYAAAME
GTTTCATVTNAAEPCADEPEPNRGTPAEELELVPLLGPEKQVSIGSGLGAEVKEELIGFLQANANVFAWSHDDMSSIDPSIMVHRLNIDPSYRSVRQKRRPVDAERSNVI
CKEVEQLLRAKFIREVHYLAWLSNVVLVKM