; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040812 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040812
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H
Genome locationchr13:8584167..8588932
RNA-Seq ExpressionLag0040812
SyntenyLag0040812
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GFS31826.1 hypothetical protein Acr_00g0019390 [Actinidia rufa]6.2e-7037.07Show/hide
Query:  VEKRRSAKTAESEAKAVEAEARAAEAEARAAEA-EAKKDDLP-WKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVP
        V  RR+         A EA++ A +++A+   A E   +  P W  ++     +L    G     K  G  ++EEL ++ D PFT  VM+  +P KFK+P
Subjt:  VEKRRSAKTAESEAKAVEAEARAAEAEARAAEA-EAKKDDLP-WKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVP

Query:  TFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITR
        T + +DG KDP+ HL  Y++ MD   V D I CRAF  TL GS R WF RL   +IS F +L++ F+  ++G + Q+KP  +L  VKQ+  E+LR+++T 
Subjt:  TFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITR

Query:  FNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDIKRTK-----------GS
        FN E L V+   +  A+ A  AG+   + L S+ K  P+T A  + RAQKYM+AE+ LK++       S  + KR      D K TK             
Subjt:  FNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDIKRTK-----------GS

Query:  GPTKEAGAEQTM----AEADQTMAEADQTKEHLLSVSLRNTPQLLFR------------KSKNCIQLKDEIEALIQNGYLKEFV--GEPKAEADHG---W
          TK A   Q         DQ + E  +         +R+ P+   +            ++++C  LK ++E LI+ G L  F+  G P  + DH     
Subjt:  GPTKEAGAEQTM----AEADQTMAEADQTKEHLLSVSLRNTPQLLFR------------KSKNCIQLKDEIEALIQNGYLKEFV--GEPKAEADHG---W

Query:  PRPSLTKDGRDKEEPLREIRTIFGGPAGGG-SSRKRKAMAREARSEPEYRAFKAMRLGMELLKPS--LTPLVGFGRERVTPEGSIELPVTFGDGQNAVTK
         R SL K+ +  ++P+ EI+ I GG AGGG SS  RK  ARE R++ E      M +G    +P    + LVGF  +RVTP GSI LPVT GD     TK
Subjt:  PRPSLTKDGRDKEEPLREIRTIFGGPAGGG-SSRKRKAMAREARSEPEYRAFKAMRLGMELLKPS--LTPLVGFGRERVTPEGSIELPVTFGDGQNAVTK

Query:  MINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKE
        M+ FLVVDC SAYN I+GR  L+E KAI STYH  +KFPT  GVG  K   +E
Subjt:  MINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKE

XP_022158830.1 uncharacterized protein LOC111025293 [Momordica charantia]2.2e-7535.57Show/hide
Query:  DRHQRRSRDDDSIRGSPRQAGRAEDADIKIAALEDEVKGMNRSLSRIL---QILDKPDPSTK---------VHERSLIRDPRKGKEPMEHTAESGTRSRG
        +R  RRS DD S +       R    D ++   ED     N+     L     L  P+P  K           E  L+RDP+KGK P E   E  T S G
Subjt:  DRHQRRSRDDDSIRGSPRQAGRAEDADIKIAALEDEVKGMNRSLSRIL---QILDKPDPSTK---------VHERSLIRDPRKGKEPMEHTAESGTRSRG

Query:  KKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLN
         K      ++ G       I   R++     + H    P+ G +  D RN             SE  +++               + K  D P  +E  +
Subjt:  KKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLN

Query:  ALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFER
        + KE G               +LEEL DQ D PFTEE+M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR F  TL GSAR WF +
Subjt:  ALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFER

Query:  LKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQK
        LKR SIS FK LA+AF+ QF+G R + +P   LLT+KQ+  ESLRDY+ RFN+E LQVEG ++  +L+A  +G+ DE L  S GK  P T++E +SRAQ+
Subjt:  LKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQK

Query:  YMSAEELLKSKRS----------EREYKRFSSSSYDIKRTKGS---GPTKEAGAEQTMAEADQTMAEADQTK-----EHLLSVSLRNTP--QLLFRK---
        YMSA E   SKR           ER   +   S ++ KR + S    P K      T    +Q + E    +     E + + S + +     LF +   
Subjt:  YMSAEELLKSKRS----------EREYKRFSSSSYDIKRTKGS---GPTKEAGAEQTMAEADQTMAEADQTK-----EHLLSVSLRNTP--QLLFRK---

Query:  --SKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------
          +++C  LK+E+E LI+ GYLKE+V EPKA           T++G   + P REIRTI GGP    S RKRKA  REAR+  E                
Subjt:  --SKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------

Query:  --------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFG
                                                    + A+KAM L  ++LK S  PLVGFG ERV PEG IELPVTFG
Subjt:  --------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFG

XP_024041095.1 uncharacterized protein LOC112098853 [Citrus clementina]8.7e-8035.29Show/hide
Query:  LEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMG
        L+++  + +PPFT ++M A+ P +F +P  + YDG++DP +HL  YR+ M+  G S AI CRAF  TL G+AR WF RL+  SIS F +L++ F + F  
Subjt:  LEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMG

Query:  AREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-EYKRFS
        AR++ KP   LLTVKQQ GE+LRDYI R+N+E  QV+GY +G AL  I  GL   +L  S+ K  P +Y+E ++RA+KY +AEE  K++  E+ E  +  
Subjt:  AREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSER-EYKRFS

Query:  SSSYDIKRTKGSGPTKEAGAEQ--------------TMAEADQTMAEADQTKEHLL----SVSLRNTPQLL--------------FRK-----SKNCIQL
            D +  +   P +     Q               +A       E +  +E +L    + +L  TP  +              F K     +  C +L
Subjt:  SSSYDIKRTKGSGPTKEAGAEQ--------------TMAEADQTMAEADQTKEHLL----SVSLRNTPQLL--------------FRK-----SKNCIQL

Query:  KDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKE---EPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------------
        K++IE+L++ G L+E+V  P+       P  S  + G+ KE   E + ++  I+GGPA G S + RK +AR+AR EP                       
Subjt:  KDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKE---EPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------------

Query:  ------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVD
                                                     F  M L    LKP  TPL GF    V PEG IEL V+FG     VT M+NF+VVD
Subjt:  ------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVD

Query:  CMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTALK---------AVKPQRAEADQRMS
          S+YNA+LGRPTL+ +KA  S YH  LKFPT  GVGVV+GEQKE+RECY  A +         ++ P+  + DQR S
Subjt:  CMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTALK---------AVKPQRAEADQRMS

XP_024047974.1 uncharacterized protein LOC112101548 [Citrus clementina]6.0e-7333.97Show/hide
Query:  EELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGA
        +++  + +PPFT+E+M+A  P  F++P+ + YDG+K P++H+  YRS M+  GVS AI CRAF  TL+ +AR WF  L+  SIS F EL + F   F  A
Subjt:  EELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGA

Query:  REQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSS
        R++ KP   LLTVKQ  GESLR+YI R+N E  QV+GY +G AL  +  GL+  RL  S+ K+ P TY+E +SRA+KY +AEE  +SK+  E+     + 
Subjt:  REQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRS-EREYKRFSS

Query:  SSYDIKRTKGSGPTKEAGAEQTMAEADQ---------------TMAEADQTKEHLL-----SVSLRNTPQLLFRKSK------------------NCIQL
         + + +R    G   +A  +      DQ                  E +  +EH+L     S   +  P L   +S+                   C  L
Subjt:  SSYDIKRTKGSGPTKEAGAEQTMAEADQ---------------TMAEADQTKEHLL-----SVSLRNTPQLLFRKSK------------------NCIQL

Query:  KDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKE----EPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE---------------------
        K++IE+L++   L+ +V    +  +   P  S    G+ KE    E +  +  I+GGP  G S + RK +AR+AR EP                      
Subjt:  KDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKE----EPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE---------------------

Query:  -------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVV
                                                     AF  M L    L P  T L GF    V PEG IEL V+FG     VT M+ F+VV
Subjt:  -------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVV

Query:  DCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTALK---------AVKPQRAEADQR
        D  SAYN++LGRPTL+ +KA  S YH  LKFPT   +GVV+G QKE+RECY  + K         ++ P+    DQR
Subjt:  DCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTALK---------AVKPQRAEADQR

XP_030963320.1 uncharacterized protein LOC115984434 [Quercus lobata]2.9e-6733.82Show/hide
Query:  LEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMG
        +E+L  + D PF   +    +P KFK+P+   YDG +DP  H+  +++ M   GV D I CRAF  TL G AR WF ++   +++ F+EL++ F+  F+G
Subjt:  LEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMG

Query:  AREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSS
         +  ++   +LLT++Q+  ESLR +ITRFN EAL V+   +   L A   G+  +  ++ + + +P+T AE V  AQ +M+AE+ + +K+     KR   
Subjt:  AREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSS

Query:  SSYDIKRTKGSGPTKEAGAEQTMAEADQTMAEADQTKEHL--------LSVSLRNTPQLLF------------------------RKSKNCIQLKDEIEA
           +  R    GP  + G  +   + D+    + +++++         + + +++ P L +                          +  C  LK +IE 
Subjt:  SSYDIKRTKGSGPTKEAGAEQTMAEADQTMAEADQTKEHL--------LSVSLRNTPQLLF------------------------RKSKNCIQLKDEIEA

Query:  LIQNGYLKEFVG-EPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPEYRAFKAMRLGMELLKPSLTPLVGFGRERVT
        LI+ G L+ F+G + K E   G       K       PL EIR I GG +   SSR+       +     + AF+ M+LG + L+P  +PLVGFG  +V 
Subjt:  LIQNGYLKEFVG-EPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPEYRAFKAMRLGMELLKPSLTPLVGFGRERVT

Query:  PEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL
        P G++ LPV  G     VTK ++FLVVDC S+YNAI+GRPTL+  K + STYH  +KFPT  GVG V+G+Q  +RECY   L
Subjt:  PEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL

TrEMBL top hitse value%identityAlignment
A0A2N9IGX8 Ribonuclease H2.6e-6932.35Show/hide
Query:  GTRSRGKKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPW
        G RS  ++ DS     R     +RT  RSR  +   G   + + PS   T    R+      +  +  +  +K +E E R      R    E    DL  
Subjt:  GTRSRGKKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPW

Query:  KT-ELLNALKELGNPQGDQQRS-KNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTG
        KT  L   L+E+    GD + S +    +NL+ L  + D PF   +    +P +FKVP  + +DG KDP  +L A++  M    VS+ + CRAF   L G
Subjt:  KT-ELLNALKELGNPQGDQQRS-KNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTG

Query:  SARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYA
        SAR WF +L+  SI  F +L++AF+  F+G++ + +P  +LL+VKQ  GESLR ++ RFN+EA++++   E   + A  AGL     L  + K  P T +
Subjt:  SARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYA

Query:  EFVSRAQKYMSAEELLKS-------KRSEREYKRFSSSSYDI---------KRTKGSGPTKEAGAEQTMAEADQTMAEADQTKEHLLSVSLRNTPQ----
        E +  A K+M+AE+ L++       +R E E ++   +   +         KRT  + P K +         D+ + +            +R+ P     
Subjt:  EFVSRAQKYMSAEELLKS-------KRSEREYKRFSSSSYDI---------KRTKGSGPTKEAGAEQTMAEADQTMAEADQTKEHLLSVSLRNTPQ----

Query:  ---LLFRKS-----KNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEE-----PLREIRTIFGGPAGGGSSR-KRKAMAR-EARSE
             F +      + C+ LK++IE LI+ G L+++V  P         RP+     R++ E     P+ EIRTI GGPA GG+SR  RKA AR +AR  
Subjt:  ---LLFRKS-----KNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEE-----PLREIRTIFGGPAGGGSSR-KRKAMAR-EARSE

Query:  PE--------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAI
         +                                  A++ MRL  + L+P   PLVGF  ++V P G + LP+T G     V+K ++FLVV+C SAYNAI
Subjt:  PE--------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAI

Query:  LGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL
        +GRPT + ++A+ STYH LLKFPT +G+G V+G+Q  +RECY  +L
Subjt:  LGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL

A0A2N9IPA1 Ribonuclease H5.1e-7033.39Show/hide
Query:  RHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVD
        R+ T    S  H + D       + R        ++  +    AA+A A   E E K        E+   +KE+      + + K    +NL+ L  + +
Subjt:  RHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVD

Query:  PPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHI
         PFT+ + +  +P KFKVP  + +DG KDP+ +L+++R+ M  HGVSD I CR F   L GSAR WF +L+  SI  F +L++AF+  F+G R   +P  
Subjt:  PPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHI

Query:  NLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDIKRTK
         LL ++Q+ GESLR Y+ RFN EA+Q++  +E  AL A  AGL     L  + K  P++ +E +  AQK+++AE+  ++ R E++Y +    +      +
Subjt:  NLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDIKRTK

Query:  GSGPTKEAGAEQTMAEADQTMAEADQTKEHLLSVSLRNTP-----QLLFR-------KSKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTK
           P+       ++   DQ + +     +      LR+ P      L  R        +++C  LK +IEALI+ G L +FV + K E  H   RP    
Subjt:  GSGPTKEAGAEQTMAEADQTMAEADQTKEHLLSVSLRNTP-----QLLFR-------KSKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTK

Query:  DGRDKEEP-----LREIRTIFGGPAGGGSSR-KRKAMAREA-------RSEPEYR-------------------------AFKAMRLGMELLKPSLTPLV
        + +D++E      + EIRTI GG A GG+SR  RKA AR+A       RS    +                         A++ M++  E LKP   PLV
Subjt:  DGRDKEEP-----LREIRTIFGGPAGGGSSR-KRKAMAREA-------RSEPEYR-------------------------AFKAMRLGMELLKPSLTPLV

Query:  GFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL
        GF  ++V P G + L +  G     V   + FLVVDC SAYN I+GRPTL++++A+ STYH L++FPT +G+G +KG+Q  +RECY+ +L
Subjt:  GFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL

A0A2N9J5E0 Reverse transcriptase6.1e-7134.58Show/hide
Query:  AEARAAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGV
        A+ARAA+ E          E L  ++E       Q ++K    +NL+ L  + + PFT+ + +  +P KFKVP  + +DG KDP+ +L+++R+ M  HGV
Subjt:  AEARAAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGV

Query:  SDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDE
        SD I CRAF   L GSAR WF +L+  SI  F +L++AF+  F+G R   +P   LL V+Q+ GESLR Y+ RFN EA+Q++  +E  AL A  AGL   
Subjt:  SDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDE

Query:  RLLNSIGKSQPRTYAEFVSRAQKYMSAE-------ELLKSKRSEREYKRF-----SSSSYDIKRTKGSGPTKEAGAEQTMA-------EADQTMAEADQT
          L  + K  P++ +E +  AQK+++AE       E L  KR E E +RF      SS  D  R +    +     E+  +         DQ + +    
Subjt:  RLLNSIGKSQPRTYAEFVSRAQKYMSAE-------ELLKSKRSEREYKRF-----SSSSYDIKRTKGSGPTKEAGAEQTMA-------EADQTMAEADQT

Query:  KEHLLSVSLRNTP-----QLLFR-------KSKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEP-----LREIRTIFGGPAGG
         +      LR+ P      L  R        +++C  LK +IEALI+ G L +FV + K E  H  PRP    + +D++E      + EIRTI GG A G
Subjt:  KEHLLSVSLRNTP-----QLLFR-------KSKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEP-----LREIRTIFGGPAGG

Query:  GSSR-KRKAMAREAR---------------------SEPEYR--------------------------------------AFKAMRLGMELLKPSLTPLV
        G+SR  RKA AR+A                      SE + R                                      A++ M++  E LKP   PLV
Subjt:  GSSR-KRKAMAREAR---------------------SEPEYR--------------------------------------AFKAMRLGMELLKPSLTPLV

Query:  GFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL
        GF  ++V P G + L +  G     V   + FLVVDC SAYN I+GRPTL++++A+ STYH L++FPT +G+G +KG+Q  +RECY+ +L
Subjt:  GFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKESRECYWTAL

A0A6J1DWY0 uncharacterized protein LOC1110252931.1e-7535.57Show/hide
Query:  DRHQRRSRDDDSIRGSPRQAGRAEDADIKIAALEDEVKGMNRSLSRIL---QILDKPDPSTK---------VHERSLIRDPRKGKEPMEHTAESGTRSRG
        +R  RRS DD S +       R    D ++   ED     N+     L     L  P+P  K           E  L+RDP+KGK P E   E  T S G
Subjt:  DRHQRRSRDDDSIRGSPRQAGRAEDADIKIAALEDEVKGMNRSLSRIL---QILDKPDPSTK---------VHERSLIRDPRKGKEPMEHTAESGTRSRG

Query:  KKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLN
         K      ++ G       I   R++     + H    P+ G +  D RN             SE  +++               + K  D P  +E  +
Subjt:  KKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLN

Query:  ALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFER
        + KE G               +LEEL DQ D PFTEE+M+ +VP KFK+PT KQ+D   DPV HL+AYR WMD +GVS+A+RCR F  TL GSAR WF +
Subjt:  ALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFER

Query:  LKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQK
        LKR SIS FK LA+AF+ QF+G R + +P   LLT+KQ+  ESLRDY+ RFN+E LQVEG ++  +L+A  +G+ DE L  S GK  P T++E +SRAQ+
Subjt:  LKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQK

Query:  YMSAEELLKSKRS----------EREYKRFSSSSYDIKRTKGS---GPTKEAGAEQTMAEADQTMAEADQTK-----EHLLSVSLRNTP--QLLFRK---
        YMSA E   SKR           ER   +   S ++ KR + S    P K      T    +Q + E    +     E + + S + +     LF +   
Subjt:  YMSAEELLKSKRS----------EREYKRFSSSSYDIKRTKGS---GPTKEAGAEQTMAEADQTMAEADQTK-----EHLLSVSLRNTP--QLLFRK---

Query:  --SKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------
          +++C  LK+E+E LI+ GYLKE+V EPKA           T++G   + P REIRTI GGP    S RKRKA  REAR+  E                
Subjt:  --SKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMAREARSEPE----------------

Query:  --------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFG
                                                    + A+KAM L  ++LK S  PLVGFG ERV PEG IELPVTFG
Subjt:  --------------------------------------------YRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFG

A0A7J0DBQ5 Retrotrans_gag domain-containing protein3.0e-7037.07Show/hide
Query:  VEKRRSAKTAESEAKAVEAEARAAEAEARAAEA-EAKKDDLP-WKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVP
        V  RR+         A EA++ A +++A+   A E   +  P W  ++     +L    G     K  G  ++EEL ++ D PFT  VM+  +P KFK+P
Subjt:  VEKRRSAKTAESEAKAVEAEARAAEAEARAAEA-EAKKDDLP-WKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVP

Query:  TFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITR
        T + +DG KDP+ HL  Y++ MD   V D I CRAF  TL GS R WF RL   +IS F +L++ F+  ++G + Q+KP  +L  VKQ+  E+LR+++T 
Subjt:  TFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFLAQFMGAREQRKPHINLLTVKQQPGESLRDYITR

Query:  FNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDIKRTK-----------GS
        FN E L V+   +  A+ A  AG+   + L S+ K  P+T A  + RAQKYM+AE+ LK++       S  + KR      D K TK             
Subjt:  FNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKR------SEREYKRFSSSSYDIKRTK-----------GS

Query:  GPTKEAGAEQTM----AEADQTMAEADQTKEHLLSVSLRNTPQLLFR------------KSKNCIQLKDEIEALIQNGYLKEFV--GEPKAEADHG---W
          TK A   Q         DQ + E  +         +R+ P+   +            ++++C  LK ++E LI+ G L  F+  G P  + DH     
Subjt:  GPTKEAGAEQTM----AEADQTMAEADQTKEHLLSVSLRNTPQLLFR------------KSKNCIQLKDEIEALIQNGYLKEFV--GEPKAEADHG---W

Query:  PRPSLTKDGRDKEEPLREIRTIFGGPAGGG-SSRKRKAMAREARSEPEYRAFKAMRLGMELLKPS--LTPLVGFGRERVTPEGSIELPVTFGDGQNAVTK
         R SL K+ +  ++P+ EI+ I GG AGGG SS  RK  ARE R++ E      M +G    +P    + LVGF  +RVTP GSI LPVT GD     TK
Subjt:  PRPSLTKDGRDKEEPLREIRTIFGGPAGGG-SSRKRKAMAREARSEPEYRAFKAMRLGMELLKPS--LTPLVGFGRERVTPEGSIELPVTFGDGQNAVTK

Query:  MINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKE
        M+ FLVVDC SAYN I+GR  L+E KAI STYH  +KFPT  GVG  K   +E
Subjt:  MINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKFPTPNGVGVVKGEQKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATCGGGTGACAAGAATCAAAGAAGAAGCAGACAGCAAGGCAAGAAGAAGAAGAAGAAGAAGAATCAACACACTCAATTAAAGTGGTTCGGCAACCAGGGCCTACG
TCCACTATCGAAGGGAGCTATCACTTTTATTTCAAAGGAGAACAGAGATGGAGAAGATACAAGACAATCGGGGAGAAATCGGGCTGGAAACCGGGCCAAGGAGGCGGAGC
CGGTAAGCGGGACGGGCCAAGGTCGAAGGGGTCGGGTTTTTGACCCGACCCCATGCTCGGCCTCGGCGACCCTCGGCCCGCTCGTGCGGGCCGAGCTCGTTCGGTCTTGT
CTGGTCCCCACCGCCTCTGGCTGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTTCGAATACCTAAAAACCCTAGGAGGATGAGCATGTATTTATATCCCTCTTCGCC
ACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACACTCTATTCCCTGCTTTCTCCTCTTGCTCATACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCA
TCGGAGCCGGTGTGGCGAGCACCACACCGGTGTGCAGGTTAGGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAAT
CTGCACATCGGTTATTCCACGAGTAAGGGTATGGAGAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGATGACAG
CATCCGGGGGTCACCGAGACAAGCAGGCCGAGCAGAGGATGCCGACATCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAGAATACTCC
AAATCCTGGATAAACCCGACCCTAGCACCAAAGTCCATGAGAGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGG
TCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGATTTTGAGGAGCCGAGAGTCAAGCACACTTAAGGGGCGTCACCA
CACAGTTTCTACCCCAAGTTACGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGCCGAGTCCGAGGCCAAAGCCGTCGAAG
CTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTAAATGCACTAAAGGAGCTCGGAAAT
CCTCAGGGAGACCAGCAGAGGTCAAAGAATTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCC
TCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAA
TCAGGTGCCGTGCATTCTTTTTCACCCTGACAGGATCAGCTAGGCACTGGTTTGAGAGGCTGAAAAGGAGGTCCATCAGCTGTTTCAAAGAGTTAGCCCAAGCATTCCTT
GCACAGTTCATGGGAGCTAGAGAGCAGCGCAAGCCTCACATCAATCTCTTGACGGTCAAACAACAGCCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGA
GGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCT
ATGCGGAGTTCGTCTCCCGGGCACAAAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACATA
AAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCATCTCCTTTC
GGTAAGTTTGAGAAATACACCCCAACTGCTGTTCCGCAAGAGCAAGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCG
TCGGTGAGCCCAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGA
CCAGCTGGAGGAGGTTCGAGCAGGAAGAGGAAAGCTATGGCCAGGGAAGCAAGGTCCGAACCAGAATATCGAGCCTTTAAAGCAATGCGGTTAGGGATGGAACTCTTAAA
GCCAAGCCTTACCCCACTTGTAGGATTTGGAAGAGAGAGAGTGACCCCAGAAGGTAGCATAGAACTTCCTGTGACCTTTGGGGATGGACAGAATGCAGTGACAAAGATGA
TAAACTTCCTAGTTGTGGATTGCATGTCAGCCTACAATGCAATCCTAGGAAGACCAACCTTGCATGAGATGAAGGCCATAGCTTCAACTTACCATCAGTTACTGAAGTTT
CCAACTCCGAACGGTGTGGGAGTCGTCAAGGGAGAACAGAAGGAATCAAGGGAGTGTTACTGGACCGCACTAAAAGCTGTCAAGCCACAGAGGGCCGAGGCCGACCAGAG
AATGAGCCGAGGCCGAGCAGCACACACTTCCACCAACACTGAATCTGAACTTATGGAGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATCGGGTGACAAGAATCAAAGAAGAAGCAGACAGCAAGGCAAGAAGAAGAAGAAGAAGAAGAATCAACACACTCAATTAAAGTGGTTCGGCAACCAGGGCCTACG
TCCACTATCGAAGGGAGCTATCACTTTTATTTCAAAGGAGAACAGAGATGGAGAAGATACAAGACAATCGGGGAGAAATCGGGCTGGAAACCGGGCCAAGGAGGCGGAGC
CGGTAAGCGGGACGGGCCAAGGTCGAAGGGGTCGGGTTTTTGACCCGACCCCATGCTCGGCCTCGGCGACCCTCGGCCCGCTCGTGCGGGCCGAGCTCGTTCGGTCTTGT
CTGGTCCCCACCGCCTCTGGCTGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTTCGAATACCTAAAAACCCTAGGAGGATGAGCATGTATTTATATCCCTCTTCGCC
ACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACACTCTATTCCCTGCTTTCTCCTCTTGCTCATACTTTTCCACGCCCTACCGTTCTGTTTGCTGACTTAAGCA
TCGGAGCCGGTGTGGCGAGCACCACACCGGTGTGCAGGTTAGGTGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAAT
CTGCACATCGGTTATTCCACGAGTAAGGGTATGGAGAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGATGACAG
CATCCGGGGGTCACCGAGACAAGCAGGCCGAGCAGAGGATGCCGACATCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAGAATACTCC
AAATCCTGGATAAACCCGACCCTAGCACCAAAGTCCATGAGAGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGG
TCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCGCACGATTTTGAGGAGCCGAGAGTCAAGCACACTTAAGGGGCGTCACCA
CACAGTTTCTACCCCAAGTTACGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGCCGAGTCCGAGGCCAAAGCCGTCGAAG
CTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTAAATGCACTAAAGGAGCTCGGAAAT
CCTCAGGGAGACCAGCAGAGGTCAAAGAATTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCC
TCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAA
TCAGGTGCCGTGCATTCTTTTTCACCCTGACAGGATCAGCTAGGCACTGGTTTGAGAGGCTGAAAAGGAGGTCCATCAGCTGTTTCAAAGAGTTAGCCCAAGCATTCCTT
GCACAGTTCATGGGAGCTAGAGAGCAGCGCAAGCCTCACATCAATCTCTTGACGGTCAAACAACAGCCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGA
GGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCT
ATGCGGAGTTCGTCTCCCGGGCACAAAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACATA
AAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCATCTCCTTTC
GGTAAGTTTGAGAAATACACCCCAACTGCTGTTCCGCAAGAGCAAGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCG
TCGGTGAGCCCAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGA
CCAGCTGGAGGAGGTTCGAGCAGGAAGAGGAAAGCTATGGCCAGGGAAGCAAGGTCCGAACCAGAATATCGAGCCTTTAAAGCAATGCGGTTAGGGATGGAACTCTTAAA
GCCAAGCCTTACCCCACTTGTAGGATTTGGAAGAGAGAGAGTGACCCCAGAAGGTAGCATAGAACTTCCTGTGACCTTTGGGGATGGACAGAATGCAGTGACAAAGATGA
TAAACTTCCTAGTTGTGGATTGCATGTCAGCCTACAATGCAATCCTAGGAAGACCAACCTTGCATGAGATGAAGGCCATAGCTTCAACTTACCATCAGTTACTGAAGTTT
CCAACTCCGAACGGTGTGGGAGTCGTCAAGGGAGAACAGAAGGAATCAAGGGAGTGTTACTGGACCGCACTAAAAGCTGTCAAGCCACAGAGGGCCGAGGCCGACCAGAG
AATGAGCCGAGGCCGAGCAGCACACACTTCCACCAACACTGAATCTGAACTTATGGAGTTCTAG
Protein sequenceShow/hide protein sequence
MQSGDKNQRRSRQQGKKKKKKKNQHTQLKWFGNQGLRPLSKGAITFISKENRDGEDTRQSGRNRAGNRAKEAEPVSGTGQGRRGRVFDPTPCSASATLGPLVRAELVRSC
LVPTASGCPGFAWFDLKRLRIPKNPRRMSMYLYPSSPLKRGSRILSLNSTLYSLLSPLAHTFPRPTVLFADLSIGAGVASTTPVCRLGEILWPKSTINSWRRLWGRKPVN
LHIGYSTSKGMEKKDQDVNIENSDGDRHQRRSRDDDSIRGSPRQAGRAEDADIKIAALEDEVKGMNRSLSRILQILDKPDPSTKVHERSLIRDPRKGKEPMEHTAESGTR
SRGKKTDSMTSKVRGLKPTDRTILRSRESSTLKGRHHTVSTPSYGHTKTDLRNLIVEKRRSAKTAESEAKAVEAEARAAEAEARAAEAEAKKDDLPWKTELLNALKELGN
PQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRCRAFFFTLTGSARHWFERLKRRSISCFKELAQAFL
AQFMGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDI
KRTKGSGPTKEAGAEQTMAEADQTMAEADQTKEHLLSVSLRNTPQLLFRKSKNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGG
PAGGGSSRKRKAMAREARSEPEYRAFKAMRLGMELLKPSLTPLVGFGRERVTPEGSIELPVTFGDGQNAVTKMINFLVVDCMSAYNAILGRPTLHEMKAIASTYHQLLKF
PTPNGVGVVKGEQKESRECYWTALKAVKPQRAEADQRMSRGRAAHTSTNTESELMEF