; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025528 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025528
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr10:14442877..14456500
RNA-Seq ExpressionLag0025528
SyntenyLag0025528
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.3e-8336.47Show/hide
Query:  NGGRIPPVPPVPPVPQQENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-----
        N G IP VP            +I +D++RAIR YA P F+ LN GI+ P I A QFELKPVMFQML T+GQFSG+P EDPH H+ LF+ + +SFK     
Subjt:  NGGRIPPVPPVPPVPQQENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-----

Query:  --------------GDAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLD
                        A TWL+S P  S+T+WNDL EKFL K+FP N NAK + EI SF+Q  +E L  AWERF+ L++KCPHHG   CI +E FY+GL+
Subjt:  --------------GDAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLD

Query:  QASKALVNASANGSFLKKSGPQGSQ---------------------------------------------------------------------------
          +K +V+ASANG+ L KS  Q  +                                                                           
Subjt:  QASKALVNASANGSFLKKSGPQGSQ---------------------------------------------------------------------------

Query:  ---------------------NQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNF-QQVPVQNQESNLETLIKEYMARNDCAFTNKED
                             NQN+  PY  TYN  WRQHPNFSW  Q  +S ++T          F QQ P   Q ++LE ++KEY+ +N+ + +  E 
Subjt:  ---------------------NQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNF-QQVPVQNQESNLETLIKEYMARNDCAFTNKED

Query:  KIE--------------------RTKEHEALMEKEEKSK----------------------------------------LEKRQGNATAKPQKFVIDPDY
         ++                    R + H  L    EK K                                         +K   N    P K       
Subjt:  KIE--------------------RTKEHEALMEKEEKSK----------------------------------------LEKRQGNATAKPQKFVIDPDY

Query:  KPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK
        +P PP+PQRF+   Q+VQ KKFLDVLKQLHINIPLVEALEQMP YVKF+KDIL+KKRRLGE+ETVALT+  S+ +++ +P K+K
Subjt:  KPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]7.2e-7933.33Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE
        NPI++A+ R+RA+RDYA  + + LN  ++       +FE KP+M QMLN +GQF GL +EDP  H+  F++V N+F+                   G A 
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE

Query:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK
         WL++FP ++IT+W+D+ +KFL K+FP  +NA  + EIISFRQ  NE ++VAWERF+ L+  CP+ G PAC+ +EHF+ G D  +K ++N +ANG F  K
Subjt:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK

Query:  SGPQ------------------------------------------------------------------------------------------------
        S  +                                                                                                
Subjt:  SGPQ------------------------------------------------------------------------------------------------

Query:  --------GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQ-----------FGKQP-----------QKNFQQVPVQNQESNLETLIKE---
                G  NQ ++NPY  TYNPGW+QHPNFSW GQ GSSN+  H            F   P           QKN+ Q P Q   SN+E L+KE   
Subjt:  --------GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQ-----------FGKQP-----------QKNFQQVPVQNQESNLETLIKE---

Query:  -------------------------YMARNDCAFTNKEDKIERT-----------------------KEH------------EALMEKEEKSKLEKRQGN
                                 YM RND      E ++ +                        KEH            E     +E S    R+ +
Subjt:  -------------------------YMARNDCAFTNKEDKIERT-----------------------KEH------------EALMEKEEKSKLEKRQGN

Query:  ATAKPQKFVIDP-----------DYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLV
          A P K +++P           + +PPPP+PQR    +QD   +KFLD+LKQLHINIP VEALEQMPTY KF+KDI+++K++LGEYETVALTECSS + 
Subjt:  ATAKPQKFVIDP-----------DYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLV

Query:  KNDIPPKLK
        K+ +PPKLK
Subjt:  KNDIPPKLK

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]2.2e-8037.71Show/hide
Query:  QENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GD
        + NPI +A+DR RAIR+YA P+F  LNPGI+ P I AP FELKPVMFQML T+GQF G P EDPH HI  FL V +SFK                     
Subjt:  QENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GD

Query:  AETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFL
        A  WL++ PP+S+T+WNDLAEKFL K+FP  +NAK+++EI+SF+QS +E    AWERF+ L++KCPHHG P CI LE FY+GL+ AS+ +++ASANG+ L
Subjt:  AETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFL

Query:  KKSGPQ---------------------------------------------------------------------------------------------G
         KS  +                                                                                             G
Subjt:  KKSGPQ---------------------------------------------------------------------------------------------G

Query:  SQNQNR-WNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARNDCAF------------------------
        +QN NR  NPY  +YNP W+ HPNFSWGGQ     S    F +QP+      P  +Q S+LE+L+++YMA+ND                           
Subjt:  SQNQNR-WNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARNDCAF------------------------

Query:  ------TNKEDKIERTKEH---------------EALMEKEEKSKLEKRQGNATAKPQKFVIDPD----------------YKPPPPYPQRFKHASQDVQ
              ++ E+     KEH                A    +E S ++K +G    KP    ++                   KPPPP+PQRFK    D Q
Subjt:  ------TNKEDKIERTKEH---------------EALMEKEEKSKLEKRQGNATAKPQKFVIDPD----------------YKPPPPYPQRFKHASQDVQ

Query:  LKKFLDVLKQLHINIPLVEALEQMPTYVKFLKD
         ++FLDVLKQLHINIPLVEALEQMPTYVKFLKD
Subjt:  LKKFLDVLKQLHINIPLVEALEQMPTYVKFLKD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.9e-8035.65Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE
        +PI + +DR RAIR+YA P+F  LNPGI+ P I APQFELKPVMFQML T+GQFS +P EDPH H+  FL + +SFK                     A 
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE

Query:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK
        +WL++  P+S+T+WND AEKFL K+FP  +NAK+++EI+SF Q  +E    AWERF+ L++KCPHHG P CI +E FY+GL+  S+ +++ASANG+ L K
Subjt:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK

Query:  ------------------------------------------------------------------------------------------SGPQ-----G
                                                                                                  S P+     G
Subjt:  ------------------------------------------------------------------------------------------SGPQ-----G

Query:  SQNQNRWN-PYLATYNPGWRQHPNFSWGGQAG----SSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARNDCA---------------------
        +QN NR N  +  +YN  W+ HPN SWG ++           +  G   Q    Q    +Q S+LE+L+++YMA+ND                       
Subjt:  SQNQNRWN-PYLATYNPGWRQHPNFSWGGQAG----SSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARNDCA---------------------

Query:  ----------------------------------FTNKEDKIERTKEHEALMEKEEKSK-----------LEKRQGNATAKPQKFVIDPDYKPPPPYPQR
                                            N E++I+ + E  ++   E+ SK           ++   G  +   Q   +    KPP P+PQR
Subjt:  ----------------------------------FTNKEDKIERTKEHEALMEKEEKSK-----------LEKRQGNATAKPQKFVIDPDYKPPPPYPQR

Query:  FKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK
        F+   QD Q KKFLDVLKQLHINIPLVEALEQMP YVKFLKDIL+KKRRLGE+E+  LTE    ++KN IPPKLK
Subjt:  FKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK

XP_030508947.1 uncharacterized protein LOC115723603 [Cannabis sativa]6.7e-8539.96Show/hide
Query:  ENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFKGD-------------------A
        +NPI +A+DR RA R+YA  VF  LNPG + P I AP FELKPVMFQML  +GQFSG P EDPH HI  F  V +SFK                     A
Subjt:  ENPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFKGD-------------------A

Query:  ETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLK
          WL++ PP+ +TSWNDLAEKFL K+FP  +NA +++EI+SF+Q  +E    AWERF+ L++KCPHHG P CI LE FY+GL+ AS+ +++ASA+G+ L 
Subjt:  ETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLK

Query:  KSGPQG-------SQNQNRWNP------------------------------YLATYNPGW--------------------------------RQHPNFS
        KS  +        ++N  +W+                                L   N G                                   HPNFS
Subjt:  KSGPQG-------SQNQNRWNP------------------------------YLATYNPGW--------------------------------RQHPNFS

Query:  WGGQAGSSNSNTHQ--------FGKQPQKNFQQVPVQNQESNLETLIKEYMARNDC------------------AFTNKEDKIERTK------------E
        WGGQ  SS+    Q        F +QP+      P  +Q S+LE+L+++YMA+ND                     T +  KI  +             +
Subjt:  WGGQAGSSNSNTHQ--------FGKQPQKNFQQVPVQNQESNLETLIKEYMARNDC------------------AFTNKEDKIERTK------------E

Query:  HEALMEKEEKSKLEKRQGNATAKPQKFVIDPD-YKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVA
         E  M+K+    + +   + T   Q   ++    KPPPP+PQR K    D Q ++FLDVLKQL+INIPL EALEQMPTYVKFLKDIL++KRRLGE+ETVA
Subjt:  HEALMEKEEKSKLEKRQGNATAKPQKFVIDPD-YKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVA

Query:  LTECSSTLVKNDIPPKLK
        LTE  S ++K+ IPPKLK
Subjt:  LTECSSTLVKNDIPPKLK

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129454.0e-6733.27Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE
        N I +  + NRA+RDY +P+ Q L+  I  P I A  FE+KP   QM+ +  QFSGLP++DP+ H+  FL + ++FK                     A+
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE

Query:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK
        +WL+S P  SIT+W DLA+KFL KFFP  K AK + +I SF Q   E L  AWERF+ L+++CPHHG P  + ++ FY+GL  + K +++A+A G+ + K
Subjt:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK

Query:  -----------------------SGPQ--------------------------------------------------------------GSQNQNRWNPY
                               SG +                                                              G+ N+ + NPY
Subjt:  -----------------------SGPQ--------------------------------------------------------------GSQNQNRWNPY

Query:  LATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARND-------CAFTNKEDKIERT------------------
          TYNPGWR HPNFSW   AG SN          Q+   Q+P   ++S LE L+ +Y+++ D        +  N E ++ +                   
Subjt:  LATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNFQQVPVQNQESNLETLIKEYMARND-------CAFTNKEDKIERT------------------

Query:  ----------------KEHEALMEKEEKSKLE--KRQGNATAKPQKFVIDPD----------YKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVE
                        KE E + +K  +S++E   ++G    + +    D D            PPPP+PQR +    + Q +KFL+V K+LHINIP  E
Subjt:  ----------------KEHEALMEKEEKSKLE--KRQGNATAKPQKFVIDPD----------YKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVE

Query:  ALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK
        ALEQMP+YVKFLKDILSKKR+LGE+ETV LTE  S +++N +PPKLK
Subjt:  ALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK

A0A6J1CPJ3 uncharacterized protein LOC1110129471.7e-7834.29Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE
        NPI++A+ R+RA+RDYA  + + LN  +         FE KP+M QMLN +GQF GL +EDP  H+  F++V N+F+                   G A 
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE

Query:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK
         WL++FP ++I + +D+ +KFL K+FP  +NA  + EIISFRQ  NE ++VAWERF+ L++ CP+ G PAC+ +EHF+   D  +  ++N +ANG F  K
Subjt:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK

Query:  SGPQ---------------------------------------------------------------------------------GSQNQNRWNPYLATY
        S  +                                                                                    NQ ++NPY   Y
Subjt:  SGPQ---------------------------------------------------------------------------------GSQNQNRWNPYLATY

Query:  NPGWRQHPNFSWGGQAGSSNSNTHQFGKQ---------------------PQKNFQQVPVQNQESNLETLIKEYMARNDCA----FTNKEDKIERTKEHE
        NPGW+QHPNFSW GQ  SS +  +Q  KQ                      QKN+ Q P Q   SN+E L+KE++ +ND       T  +  I+  KE +
Subjt:  NPGWRQHPNFSWGGQAGSSNSNTHQFGKQ---------------------PQKNFQQVPVQNQESNLETLIKEYMARNDCA----FTNKEDKIERTKEHE

Query:  ALMEKEEKS--KLEKRQG---------------NATAKPQKFV-----------------IDPDY-----------KPPPPYPQRFKHASQDVQLKKFLD
          M + + +   LE + G               ++T +P++ V                 ++P+            + PPP+PQR    +QD   +KFLD
Subjt:  ALMEKEEKS--KLEKRQG---------------NATAKPQKFV-----------------IDPDY-----------KPPPPYPQRFKHASQDVQLKKFLD

Query:  VLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK
        +LKQLHINIP VEALEQMPTY KFLKDI+++K++LGEYETVALTECSS + K+  PPKLK
Subjt:  VLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK

A0A6J1DY39 uncharacterized protein LOC1110256533.5e-7933.33Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE
        NPI++A+ R+RA+RDYA  + + LN  ++       +FE KP+M QMLN +GQF GL +EDP  H+  F++V N+F+                   G A 
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFK-------------------GDAE

Query:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK
         WL++FP ++IT+W+D+ +KFL K+FP  +NA  + EIISFRQ  NE ++VAWERF+ L+  CP+ G PAC+ +EHF+ G D  +K ++N +ANG F  K
Subjt:  TWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKK

Query:  SGPQ------------------------------------------------------------------------------------------------
        S  +                                                                                                
Subjt:  SGPQ------------------------------------------------------------------------------------------------

Query:  --------GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQ-----------FGKQP-----------QKNFQQVPVQNQESNLETLIKE---
                G  NQ ++NPY  TYNPGW+QHPNFSW GQ GSSN+  H            F   P           QKN+ Q P Q   SN+E L+KE   
Subjt:  --------GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQ-----------FGKQP-----------QKNFQQVPVQNQESNLETLIKE---

Query:  -------------------------YMARNDCAFTNKEDKIERT-----------------------KEH------------EALMEKEEKSKLEKRQGN
                                 YM RND      E ++ +                        KEH            E     +E S    R+ +
Subjt:  -------------------------YMARNDCAFTNKEDKIERT-----------------------KEH------------EALMEKEEKSKLEKRQGN

Query:  ATAKPQKFVIDP-----------DYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLV
          A P K +++P           + +PPPP+PQR    +QD   +KFLD+LKQLHINIP VEALEQMPTY KF+KDI+++K++LGEYETVALTECSS + 
Subjt:  ATAKPQKFVIDP-----------DYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLV

Query:  KNDIPPKLK
        K+ +PPKLK
Subjt:  KNDIPPKLK

A0A6J1E1F3 uncharacterized protein LOC1110250651.6e-6338.14Show/hide
Query:  MFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFKG-------------------DAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQ
        MFQML T+ QF G   EDPH+H+  F+ V NSFK                    +A TWL+S P  SITSW+DLAEKFL K+FP +KNAKY++EI +F+Q
Subjt:  MFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFKG-------------------DAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQ

Query:  SYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQAS-----KALVNASANG-----------------------SFLKKS--------------
           E +  +WE F+RL+Q CPHHG P CI +E +Y  L+ A+     +A+   S+ G                       S +++S              
Subjt:  SYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQAS-----KALVNASANG-----------------------SFLKKS--------------

Query:  -------------------GPQ-----GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNT-------HQFGKQPQKNFQQVPVQNQES-----NL
                            P+     G+   NR N Y  TYNPGWR HPNFSW G  G  N+ T       H+    P    Q   V  ++S     +L
Subjt:  -------------------GPQ-----GSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNT-------HQFGKQPQKNFQQVPVQNQES-----NL

Query:  ETLIKEYMARNDCAFTNKEDKIERTKEHEALMEKEEKSKLEKRQGNATAKPQKFVIDPDYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALE
        E L+K+YMA ND    ++   +   K     +  + KSK   R      + +      +Y P PPYP+R +   ++VQ  KFLDVLKQLH+NIPLVEALE
Subjt:  ETLIKEYMARNDCAFTNKEDKIERTKEHEALMEKEEKSKLEKRQGNATAKPQKFVIDPDYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALE

Query:  QMPTYVKFLKDILSKKRRLGEYETVALTEC
        QMP YV+FLK+IL KKR LGEY+T+ L  C
Subjt:  QMPTYVKFLKDILSKKRRLGEYETVALTEC

A0A6J1EQ90 uncharacterized protein LOC1114364111.3e-7333.51Show/hide
Query:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSF------------------------
        NPI++A+DR RAIR YA P  + LNP I+ P I    FELKPVMFQML T+GQF GLP EDPH H+  FL V +SF                        
Subjt:  NPIYIAEDRNRAIRDYALPVFQTLNPGILEPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSF------------------------

Query:  --KGDAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASA
          +  A++WL++  P +I SWN LAE FL K+FP  +NA++K EI++F+Q  +E L  A ERF+ +++KCPHHG P CI +E FY+GL+  +K +V+ASA
Subjt:  --KGDAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQRLVQKCPHHGFPACIILEHFYSGLDQASKALVNASA

Query:  NGSFLKKS--------------------------------------------------------------------------------------------
        NG+ L K+                                                                                            
Subjt:  NGSFLKKS--------------------------------------------------------------------------------------------

Query:  -----------GPQGSQNQNRWNPYLATYNPGWRQHPNFSWGGQA--------GSSNSNTHQFGKQPQKNFQQVPVQNQ---------ESNLETLIKEYM
                   G Q SQ   + NP+  TYNPGWR HPNFSW GQ+         ++  +  +   Q   + QQV  Q +         E+++E+LIKEYM
Subjt:  -----------GPQGSQNQNRWNPYLATYNPGWRQHPNFSWGGQA--------GSSNSNTHQFGKQPQKNFQQVPVQNQ---------ESNLETLIKEYM

Query:  ARNDCAFTNKEDKIE-------------------------RTKEHEALMEKE------EKSKLEKRQGNATAKPQKFVIDPDYKPPPPYPQRFKHASQDV
        A+ND    +++  +                          + +  EA ++KE      E  +  K Q  A+++ +       Y P PP+PQR K   ++ 
Subjt:  ARNDCAFTNKEDKIE-------------------------RTKEHEALMEKE------EKSKLEKRQGNATAKPQKFVIDPDYKPPPPYPQRFKHASQDV

Query:  QLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK
          +KF+D+LK++HINIPLVEAL+QMP YVKFLKD+L  +R+  E++ V+L E  S ++KN IP K K
Subjt:  QLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDILSKKRRLGEYETVALTECSSTLVKNDIPPKLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0524.05Show/hide
Query:  ILWSMWNNRNTEVFQGKRSYLHLSYDPIDWASVYILEFMETQAKQHHEDHRDRERLRDAEDSRLTAWIPPPYNSFKLNVDAA--LANGEMGSGMIIRNSK
        ++W +W + N  VF   R+    +   ++ A     E+++       ++       R+A+ SR T W PP  +  K N DA+    N   G G I+RNS+
Subjt:  ILWSMWNNRNTEVFQGKRSYLHLSYDPIDWASVYILEFMETQAKQHHEDHRDRERLRDAEDSRLTAWIPPPYNSFKLNVDAA--LANGEMGSGMIIRNSK

Query:  GESMATAKYFRRTSCSVEWAEAQALVDGIQLAMESGLSPIWAETDSKIVWNLIQDRDN
        G  +       +   + E AE   L+  IQ +   G   +  E D++ +  +I  + +
Subjt:  GESMATAKYFRRTSCSVEWAEAQALVDGIQLAMESGLSPIWAETDSKIVWNLIQDRDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGCCACCGAAAAGGAAGGATCCCTCTCATTCAGATTTGGCCTCGGTTCATTCATGTACCCCTCCGAACTCGGGTGTTACAATTGCCATGGAGCCACCTTTTGA
AGGCCCTCTCGGTGCACAGCAAAGAGGAGCACAACCCAATAATGTGCAGAAACAAGCTGCAGGAGCACATAATCCTATTCTTATTGCAGATGAAAGGGATAAAGCCATTA
GGGATTATTCGACCGTAGCTTTGTATAATCTCAATACCGAAATTGTTGAGCCAGTCATTGATGCAAATCATTTCGAGCTCAAACCAATCATGTTCCAAATGCTTCAGACT
ATATGCCAATTTTCAAGACTTCCTAGTGAAGATCCTCACAAGCACATGAAGAAATTCTTGAGAATATGCAACTCGTTCAAGCTTAATGGAGTATCCAATGAAACTTTGCG
TTTGAAAATTTTTCAATTTTCTTTGCAGGAGCATTTTTACAACGGATTGACTCAAGCTTCCAAAGCAATGGTCAATGTATCAGCTAATGGTTCGTTGCTTAAAAATACTT
TCAATAAGGTAAACAAGATTCTTGACGCAATTGTTTCAAACAATAGTCAATGTGGAGAAGAGAAAACACCATCTAGAAGCAGTGTCAAGGTTCAAGAAGTCAATCATCTA
GAGACACTATCGGAGCAGATGTTAACTATGAATGATATGCTTATGAGTCTTACTTTAGGAAATCAAGTTCAAGTTGCAGCCTGTATACTTGAACTCGAGCAGGGTGTTGC
ACTTGCAACGCGCACATGCACTCGTCAGGTCTTGAATATGGGGCTCCACCCTTTCATTGACTCGAGCGGGACTCGGTTTGGTGGTTGGACCACAAACCAATTGTTCATTA
AAGGATTTGTGGGACTTAAGGAACGAGAGTCCTATGTGGGAACTCAGTTCATGCTTGGAGGAGAAGGCGAGGAAGAAAACGATCCGTTCATGCTGGAGGAGAAGACAAGA
ACTCAGCCGGGCCTTGTAACGAGGGCCTGTGACCCATGTGGGGGCATTAAGGAGGATCTTACTGCTAATGATTTGGCAACCTTTGTGGCAATCCTATGGTCTATGTGGAA
CAACAGAAATACAGAGGTTTTTCAGGGCAAACGAAGCTATCTCCATCTGAGCTACGACCCTATTGATTGGGCATCTGTGTACATATTAGAGTTCATGGAAACCCAAGCTA
AGCAGCATCATGAAGACCATCGAGATCGAGAGCGTCTGAGAGATGCTGAAGATTCCAGGTTGACTGCGTGGATTCCGCCACCCTATAACAGTTTCAAGCTGAATGTGGAT
GCAGCACTAGCAAATGGAGAGATGGGATCAGGCATGATCATCAGAAATTCAAAGGGAGAATCGATGGCGACTGCAAAATATTTCAGGAGAACGAGTTGCTCAGTAGAGTG
GGCAGAAGCACAGGCTCTTGTAGACGGTATTCAGCTAGCGATGGAGTCGGGGCTATCACCAATCTGGGCAGAGACGGATTCCAAAATCGTATGGAATCTAATTCAGGACC
GCGACAATTATCATAATGAGATTGCTCCTCTAATTCACCATCTCAAGCTTTTGGGCACAACAAAGCATATCAGCGGGTTTTTACTAACAAAAAGGGACGACAATAAAGTA
GCGCACCAACTGGCTACTCATGCTCGCACAACAAAGCATTCTGAAGTTTGGTTGGAAGACCATCCTTCTTGGATTCACAACCTTGTTGCTAAAGAAATGTCACAAGTGGA
CGTTTCGACCTTTTCTTCTGTTTCTGGTTCCTTGCATCAAGCAGCCATTTGGAGTCTGAGAGGAGAGTTTAGAAAGTGGAAGGTTTTGGGCATTCTTAAGGGAGGATCGG
GAGCTTGTTCGGGTTCCGATCGAGGCTTAGATCGTGGGGGATTGAGGGCAAAAACAGAGGAAAAGCTGGAATTTCCCAGAAATGCGACCGCATTTCTGGAAAAACTGAAG
CCGTTCCGAGTCATCCGCGGAGATAGAAAGACTCTTTCGTCGTCGAAGGAACAAAGACGAAAGAAAAAGGAACAACAAGAGTTGAGCGCACGAGAATCTCTAGAGGAAGC
ATCTTACATTCAAGAGTTTCTAATGGAACCTCCTGGAGTCGATCCTCACGTTGATCCACAAGATCGTGGAAGGGAGCAGAATGGCGGGAGAATTCCTCCTGTTCCTCCAG
TTCCACCGGTGCCACAACAAGAAAATCCGATCTACATTGCTGAAGATCGCAATAGAGCAATAAGGGACTACGCTTTACCAGTTTTCCAAACATTAAATCCAGGAATTTTG
GAGCCACCGATTGGTGCTCCACAATTTGAACTTAAACCAGTGATGTTCCAGATGTTGAATACAATGGGCCAATTCTCTGGACTTCCAAACGAAGATCCACACAAACATAT
TAACCTATTTTTGAGAGTTTACAATTCTTTTAAAGGAGATGCTGAAACCTGGTTGGATTCATTTCCTCCAAACTCCATCACCTCTTGGAATGATTTGGCAGAGAAATTTT
TGGAGAAGTTTTTTCCCTCCAATAAAAATGCCAAGTACAAAGCAGAAATTATTTCTTTCAGACAATCTTACAATGAACCTTTGGATGTAGCTTGGGAAAGATTTCAAAGG
CTGGTTCAGAAGTGTCCACACCATGGATTCCCTGCTTGCATTATTTTGGAGCATTTTTATAGTGGATTAGATCAAGCTTCAAAGGCACTAGTAAATGCATCTGCAAACGG
ATCTTTTCTGAAGAAGTCTGGTCCACAAGGAAGTCAGAACCAGAATAGATGGAATCCTTATTTGGCCACGTACAACCCAGGATGGAGACAACATCCGAATTTTTCTTGGG
GAGGACAAGCCGGATCAAGTAATTCCAACACTCATCAGTTTGGGAAGCAGCCGCAGAAAAACTTCCAACAAGTTCCAGTGCAGAATCAAGAGTCAAATCTAGAGACTCTT
ATAAAGGAGTACATGGCAAGGAACGATTGTGCATTTACAAATAAAGAAGATAAAATAGAAAGGACCAAGGAGCATGAAGCCTTGATGGAAAAAGAAGAAAAATCAAAATT
GGAGAAAAGACAGGGAAATGCGACCGCAAAACCCCAAAAATTTGTGATAGATCCAGATTACAAACCACCTCCTCCGTACCCTCAGAGATTCAAACATGCCTCACAAGATG
TACAGCTCAAGAAATTTTTAGATGTATTGAAGCAGTTGCATATCAACATACCATTAGTGGAGGCACTTGAACAAATGCCTACCTATGTGAAGTTCCTGAAAGATATCCTA
TCAAAAAAGAGAAGGTTGGGAGAATACGAAACGGTTGCACTCACGGAATGTTCCAGTACACTGGTGAAGAACGATATCCCTCCCAAGCTTAAANNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGATCCAGGAAGCTTCACCATTCCATGC
TCCATCGGAGGCAGGGATGTAGGCAGAGCTCTATGCGATTTTGGAGCAAACATCAACTTAATGCCATATTCGGTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGCCACCGAAAAGGAAGGATCCCTCTCATTCAGATTTGGCCTCGGTTCATTCATGTACCCCTCCGAACTCGGGTGTTACAATTGCCATGGAGCCACCTTTTGA
AGGCCCTCTCGGTGCACAGCAAAGAGGAGCACAACCCAATAATGTGCAGAAACAAGCTGCAGGAGCACATAATCCTATTCTTATTGCAGATGAAAGGGATAAAGCCATTA
GGGATTATTCGACCGTAGCTTTGTATAATCTCAATACCGAAATTGTTGAGCCAGTCATTGATGCAAATCATTTCGAGCTCAAACCAATCATGTTCCAAATGCTTCAGACT
ATATGCCAATTTTCAAGACTTCCTAGTGAAGATCCTCACAAGCACATGAAGAAATTCTTGAGAATATGCAACTCGTTCAAGCTTAATGGAGTATCCAATGAAACTTTGCG
TTTGAAAATTTTTCAATTTTCTTTGCAGGAGCATTTTTACAACGGATTGACTCAAGCTTCCAAAGCAATGGTCAATGTATCAGCTAATGGTTCGTTGCTTAAAAATACTT
TCAATAAGGTAAACAAGATTCTTGACGCAATTGTTTCAAACAATAGTCAATGTGGAGAAGAGAAAACACCATCTAGAAGCAGTGTCAAGGTTCAAGAAGTCAATCATCTA
GAGACACTATCGGAGCAGATGTTAACTATGAATGATATGCTTATGAGTCTTACTTTAGGAAATCAAGTTCAAGTTGCAGCCTGTATACTTGAACTCGAGCAGGGTGTTGC
ACTTGCAACGCGCACATGCACTCGTCAGGTCTTGAATATGGGGCTCCACCCTTTCATTGACTCGAGCGGGACTCGGTTTGGTGGTTGGACCACAAACCAATTGTTCATTA
AAGGATTTGTGGGACTTAAGGAACGAGAGTCCTATGTGGGAACTCAGTTCATGCTTGGAGGAGAAGGCGAGGAAGAAAACGATCCGTTCATGCTGGAGGAGAAGACAAGA
ACTCAGCCGGGCCTTGTAACGAGGGCCTGTGACCCATGTGGGGGCATTAAGGAGGATCTTACTGCTAATGATTTGGCAACCTTTGTGGCAATCCTATGGTCTATGTGGAA
CAACAGAAATACAGAGGTTTTTCAGGGCAAACGAAGCTATCTCCATCTGAGCTACGACCCTATTGATTGGGCATCTGTGTACATATTAGAGTTCATGGAAACCCAAGCTA
AGCAGCATCATGAAGACCATCGAGATCGAGAGCGTCTGAGAGATGCTGAAGATTCCAGGTTGACTGCGTGGATTCCGCCACCCTATAACAGTTTCAAGCTGAATGTGGAT
GCAGCACTAGCAAATGGAGAGATGGGATCAGGCATGATCATCAGAAATTCAAAGGGAGAATCGATGGCGACTGCAAAATATTTCAGGAGAACGAGTTGCTCAGTAGAGTG
GGCAGAAGCACAGGCTCTTGTAGACGGTATTCAGCTAGCGATGGAGTCGGGGCTATCACCAATCTGGGCAGAGACGGATTCCAAAATCGTATGGAATCTAATTCAGGACC
GCGACAATTATCATAATGAGATTGCTCCTCTAATTCACCATCTCAAGCTTTTGGGCACAACAAAGCATATCAGCGGGTTTTTACTAACAAAAAGGGACGACAATAAAGTA
GCGCACCAACTGGCTACTCATGCTCGCACAACAAAGCATTCTGAAGTTTGGTTGGAAGACCATCCTTCTTGGATTCACAACCTTGTTGCTAAAGAAATGTCACAAGTGGA
CGTTTCGACCTTTTCTTCTGTTTCTGGTTCCTTGCATCAAGCAGCCATTTGGAGTCTGAGAGGAGAGTTTAGAAAGTGGAAGGTTTTGGGCATTCTTAAGGGAGGATCGG
GAGCTTGTTCGGGTTCCGATCGAGGCTTAGATCGTGGGGGATTGAGGGCAAAAACAGAGGAAAAGCTGGAATTTCCCAGAAATGCGACCGCATTTCTGGAAAAACTGAAG
CCGTTCCGAGTCATCCGCGGAGATAGAAAGACTCTTTCGTCGTCGAAGGAACAAAGACGAAAGAAAAAGGAACAACAAGAGTTGAGCGCACGAGAATCTCTAGAGGAAGC
ATCTTACATTCAAGAGTTTCTAATGGAACCTCCTGGAGTCGATCCTCACGTTGATCCACAAGATCGTGGAAGGGAGCAGAATGGCGGGAGAATTCCTCCTGTTCCTCCAG
TTCCACCGGTGCCACAACAAGAAAATCCGATCTACATTGCTGAAGATCGCAATAGAGCAATAAGGGACTACGCTTTACCAGTTTTCCAAACATTAAATCCAGGAATTTTG
GAGCCACCGATTGGTGCTCCACAATTTGAACTTAAACCAGTGATGTTCCAGATGTTGAATACAATGGGCCAATTCTCTGGACTTCCAAACGAAGATCCACACAAACATAT
TAACCTATTTTTGAGAGTTTACAATTCTTTTAAAGGAGATGCTGAAACCTGGTTGGATTCATTTCCTCCAAACTCCATCACCTCTTGGAATGATTTGGCAGAGAAATTTT
TGGAGAAGTTTTTTCCCTCCAATAAAAATGCCAAGTACAAAGCAGAAATTATTTCTTTCAGACAATCTTACAATGAACCTTTGGATGTAGCTTGGGAAAGATTTCAAAGG
CTGGTTCAGAAGTGTCCACACCATGGATTCCCTGCTTGCATTATTTTGGAGCATTTTTATAGTGGATTAGATCAAGCTTCAAAGGCACTAGTAAATGCATCTGCAAACGG
ATCTTTTCTGAAGAAGTCTGGTCCACAAGGAAGTCAGAACCAGAATAGATGGAATCCTTATTTGGCCACGTACAACCCAGGATGGAGACAACATCCGAATTTTTCTTGGG
GAGGACAAGCCGGATCAAGTAATTCCAACACTCATCAGTTTGGGAAGCAGCCGCAGAAAAACTTCCAACAAGTTCCAGTGCAGAATCAAGAGTCAAATCTAGAGACTCTT
ATAAAGGAGTACATGGCAAGGAACGATTGTGCATTTACAAATAAAGAAGATAAAATAGAAAGGACCAAGGAGCATGAAGCCTTGATGGAAAAAGAAGAAAAATCAAAATT
GGAGAAAAGACAGGGAAATGCGACCGCAAAACCCCAAAAATTTGTGATAGATCCAGATTACAAACCACCTCCTCCGTACCCTCAGAGATTCAAACATGCCTCACAAGATG
TACAGCTCAAGAAATTTTTAGATGTATTGAAGCAGTTGCATATCAACATACCATTAGTGGAGGCACTTGAACAAATGCCTACCTATGTGAAGTTCCTGAAAGATATCCTA
TCAAAAAAGAGAAGGTTGGGAGAATACGAAACGGTTGCACTCACGGAATGTTCCAGTACACTGGTGAAGAACGATATCCCTCCCAAGCTTAAANNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGATCCAGGAAGCTTCACCATTCCATGC
TCCATCGGAGGCAGGGATGTAGGCAGAGCTCTATGCGATTTTGGAGCAAACATCAACTTAATGCCATATTCGGTTTTTAA
Protein sequenceShow/hide protein sequence
MIEPPKRKDPSHSDLASVHSCTPPNSGVTIAMEPPFEGPLGAQQRGAQPNNVQKQAAGAHNPILIADERDKAIRDYSTVALYNLNTEIVEPVIDANHFELKPIMFQMLQT
ICQFSRLPSEDPHKHMKKFLRICNSFKLNGVSNETLRLKIFQFSLQEHFYNGLTQASKAMVNVSANGSLLKNTFNKVNKILDAIVSNNSQCGEEKTPSRSSVKVQEVNHL
ETLSEQMLTMNDMLMSLTLGNQVQVAACILELEQGVALATRTCTRQVLNMGLHPFIDSSGTRFGGWTTNQLFIKGFVGLKERESYVGTQFMLGGEGEEENDPFMLEEKTR
TQPGLVTRACDPCGGIKEDLTANDLATFVAILWSMWNNRNTEVFQGKRSYLHLSYDPIDWASVYILEFMETQAKQHHEDHRDRERLRDAEDSRLTAWIPPPYNSFKLNVD
AALANGEMGSGMIIRNSKGESMATAKYFRRTSCSVEWAEAQALVDGIQLAMESGLSPIWAETDSKIVWNLIQDRDNYHNEIAPLIHHLKLLGTTKHISGFLLTKRDDNKV
AHQLATHARTTKHSEVWLEDHPSWIHNLVAKEMSQVDVSTFSSVSGSLHQAAIWSLRGEFRKWKVLGILKGGSGACSGSDRGLDRGGLRAKTEEKLEFPRNATAFLEKLK
PFRVIRGDRKTLSSSKEQRRKKKEQQELSARESLEEASYIQEFLMEPPGVDPHVDPQDRGREQNGGRIPPVPPVPPVPQQENPIYIAEDRNRAIRDYALPVFQTLNPGIL
EPPIGAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHINLFLRVYNSFKGDAETWLDSFPPNSITSWNDLAEKFLEKFFPSNKNAKYKAEIISFRQSYNEPLDVAWERFQR
LVQKCPHHGFPACIILEHFYSGLDQASKALVNASANGSFLKKSGPQGSQNQNRWNPYLATYNPGWRQHPNFSWGGQAGSSNSNTHQFGKQPQKNFQQVPVQNQESNLETL
IKEYMARNDCAFTNKEDKIERTKEHEALMEKEEKSKLEKRQGNATAKPQKFVIDPDYKPPPPYPQRFKHASQDVQLKKFLDVLKQLHINIPLVEALEQMPTYVKFLKDIL
SKKRRLGEYETVALTECSSTLVKNDIPPKLKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSRKLHHSMLHRRQGCRQSSMRFWSKHQLNAIFGF