; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017875 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017875
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr5:10587197..10594850
RNA-Seq ExpressionLag0017875
SyntenyLag0017875
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022929949.1 uncharacterized protein LOC111436411 [Cucurbita moschata]2.1e-12238.96Show/hide
Query:  VRFELDPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQAT
        + F LDPEIERTFRRR ++++   +  N+ ++  G +      NR   +NP M  N  Q     NPI +A+DR RAIRAYA P  + LN  I RP+IQ T
Subjt:  VRFELDPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQAT

Query:  NFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGV-------SDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPN
         FE+KPVMFQMLQT+G+FHGL  EDPHL+LKSFLGV       SDSF  QGV +D +RL+LFPY LRDGAK+WLN+ APG+I +W+ LAE  L KYFPP 
Subjt:  NFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGV-------SDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPN

Query:  RNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKK
        RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT+NEAYEILERI++N+ QW+DVR    +K
Subjt:  RNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKK

Query:  VKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTE---------------------H
         + VLEVD +S+I   LA + N L+N+ +       A V  AA +NQ   E+CVYCGE+H ++ CP+NPAS+F+V  +                     H
Subjt:  VKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTE---------------------H

Query:  PRREGKEQ-------------------------VKAVTLRSGKPLEEPRKTQDIE-----------------------RNSDKNVVAEKELESG-----Q
        P    K Q                                 GK   + + T +                         RN +  +  EK  E G     +
Subjt:  PRREGKEQ-------------------------VKAVTLRSGKPLEEPRKTQDIE-----------------------RNSDKNVVAEKELESG-----Q

Query:  GAGGSNKNAGA------SKSVPDVEP---------------PYVSPPPY---------------------------------------------------
         A    +N  A      SK   +VE                 Y   PP+                                                   
Subjt:  GAGGSNKNAGA------SKSVPDVEP---------------PYVSPPPY---------------------------------------------------

Query:  ---------------VLVKLDLP--------------------------------------------------QFTLQLADRSIKYPKGKIEDVLVKVDK
                        ++K  +P                                                    TLQLADRSI YP+GKIED+L++VDK
Subjt:  ---------------VLVKLDLP--------------------------------------------------QFTLQLADRSIKYPKGKIEDVLVKVDK

Query:  FIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI
        FIF  DFIILDYE D DVPIILG PFL  GR L DV K  +T+ +  ++V+FN+   MKYP  +E+CS +
Subjt:  FIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]5.6e-12348.15Show/hide
Query:  PNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-N
        P+   K+ ++I      EDET SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+ +VDASA GA+L+KT+NEAYEILERI++N+ QW+DVR    
Subjt:  PNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-N

Query:  KKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTEH--PRREGKEQVKAVTLRSG
        +K + VLEVD +S+I   LA + N L+N+ +       A V  AA++NQ   E+CVYCGE+H ++ CP+NPAS+F+V  +     R   E+ K V     
Subjt:  KKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTEH--PRREGKEQVKAVTLRSG

Query:  KPLEEPRKTQDIERNSDKNVVAEKELESGQ-----GAGGSNKNAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVK
         PL E            KN +  KE + G        GG        +++ D+       P  +  KL + +      TLQLADRS  YP+GKIED+L++
Subjt:  KPLEEPRKTQDIERNSDKNVVAEKELESGQ-----GAGGSNKNAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVK

Query:  VDKFIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQDSADKHLEDHGEVG
        VDKFIFP DFIILDYEAD DVPIILG PFL TGR L DV K  +T+ + +++V+FN+   MKYP   E+C       S + E   Q + ++   D GE G
Subjt:  VDKFIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQDSADKHLEDHGEVG

Query:  VEDIEVCLLERKNEKELFRCEDVFESLDLDQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLAD
         E+        +    L      FESL+ + RK+ P+ PS+ EAP LDLKPL  +LKY YLG+ KTLPII+++ L    E+ L+  L++++ AIGWTLAD
Subjt:  VEDIEVCLLERKNEKELFRCEDVFESLDLDQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLAD

Query:  IQGISPSFCMHKITLDEGSFRSIEQQRMLNPAMKEVVKKE
        I+GISPS CMHKI L+EG  +SIEQQR LNP MKEVV+KE
Subjt:  IQGISPSFCMHKITLDEGSFRSIEQQRMLNPAMKEVVKKE

XP_030443636.1 uncharacterized protein LOC115665966 [Syzygium oleosum]1.6e-11439.52Show/hide
Query:  MGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRL
        M +N+Q     ENP        R ++ YA P      S I RP IQA NFE+KP + QMLQ   +F GL ++DP+++L +FL + D+    GV  DA+RL
Subjt:  MGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRL

Query:  TLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGM
         LFP+SLRD AKTWL S   GSI TW+++A+K LSKYFPP ++AK+R++I  F Q++ E+  EAWERFKELLR+CPHHGLP  +Q+ TFYNG+    +  
Subjt:  TLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGM

Query:  VDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVSTIRVDLAM---IANALK-------NVTMISHQQ------PPAVEPAALV
        +DA+A G L  K+  EA+++LE ++ NS+QW   R + +K   + EV   +TI  +       +N          N +  +  Q      PP+  P    
Subjt:  VDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVSTIRVDLAM---IANALK-------NVTMISHQQ------PPAVEPAALV

Query:  NQVTEEACVYCGEDHNYEFC-------PNNPASVFFVDTEH-----PRREGKEQVKAVTLRSGKPLEEPRKTQDIER---NSDKNVVAEKELESGQGAGG
             E  V        +F         N  AS+  ++ +      P  +   Q+ +      + L    K +D E    N + + + + +L       G
Subjt:  NQVTEEACVYCGEDHNYEFC-------PNNPASVFFVDTEH-----PRREGKEQVKAVTLRSGKPLEEPRKTQDIER---NSDKNVVAEKELESGQGAGG

Query:  SNK------NAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATG
        S        N+   K++ D+       P  V  KL L +      +LQLADRSIKYPKG +EDVLVKVDKFIFP DFI+L+ E D +VPIIL  PFLATG
Subjt:  SNK------NAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATG

Query:  RALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI----RILESTIVETAIQDSADKHLEDHGEVGVEDIEVCLLERKNEKELFRCEDVFESLDL
        RAL DVQ+ +L + V ++ V F+VFK MKYP +  +C  +    R++ES   +  I+DS +  L        + IEV         E FR    FE L  
Subjt:  RALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI----RILESTIVETAIQDSADKHLEDHGEVGVEDIEVCLLERKNEKELFRCEDVFESLDL

Query:  DQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRML
        + +K     PS+ E P L+LK L  HLKY +L    +LP+I++S L    EE L+R L+ ++ AIGWT+ADI+GISPS CMHKI ++E     I+ QR L
Subjt:  DQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRML

Query:  NPAMKEVVKKE
        NP M+EVVK E
Subjt:  NPAMKEVVKKE

XP_030443756.1 uncharacterized protein LOC115666104 [Syzygium oleosum]3.0e-11639.8Show/hide
Query:  MGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRL
        M +N+Q     ENP        R ++ YA P      S I RP IQA NFE+KP + QMLQ   +F GL ++DP+++L +FL + D+    GV  DA+RL
Subjt:  MGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRL

Query:  TLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGM
         LFP+SLRD AKTWL S   GSI TW+++A+K LSKYFPP ++AK+R++I  F Q++ E+  EAWERFKELLR+CPHHGLP  +Q+ TFYNG+    +  
Subjt:  TLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGM

Query:  VDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVSTIRVDLAM---IANALK-------NVTMISHQQ------PPAVEPAALV
        +DA+A G L  K+  EA+++LE ++ NS+QW   R + +K   + EV   +TI  +       +N          N +  +  Q      PP+  P    
Subjt:  VDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVSTIRVDLAM---IANALK-------NVTMISHQQ------PPAVEPAALV

Query:  NQVTEEACVYCGEDHNYEFC-------PNNPASVFFVDTEH-----PRREGKEQVKAVTLRSGKPLEEPRKTQDIER---NSDKNVVAEKELESGQGAGG
             E  V        +F         N  AS+  ++ +      P  +   Q+ +      + L   RK +D E    N + + + + +L       G
Subjt:  NQVTEEACVYCGEDHNYEFC-------PNNPASVFFVDTEH-----PRREGKEQVKAVTLRSGKPLEEPRKTQDIER---NSDKNVVAEKELESGQGAGG

Query:  SNK------NAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATG
        S        N+   K++ D+       P  V  KL L +      +LQLADRSIKYPKG +EDVLVKVDKFIFP DFI+L+ E D +VPIILG PFLATG
Subjt:  SNK------NAGASKSVPDVEPPYVSPPPYVLVKLDLPQ-----FTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATG

Query:  RALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI----RILESTIVETAIQDSADKHLEDHGEVGVEDIEVCLLERKNEKELFRCEDVFESLDL
        RAL DVQ+ +L + V ++ V F+VFK MKYP +  +C  +    R++ES   +  I+DS +  L    +   + IEV         E FR    FE L  
Subjt:  RALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI----RILESTIVETAIQDSADKHLEDHGEVGVEDIEVCLLERKNEKELFRCEDVFESLDL

Query:  DQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRML
        + +K     PS+ E P L+LK L  HLKY +L    +LP+I++S L    EE L+R L+ ++ AIGWT+ADI+GISPS CMHKI ++E     I+ QR L
Subjt:  DQRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRML

Query:  NPAMKEVVKKE
        NP M+EVVK E
Subjt:  NPAMKEVVKKE

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]6.4e-11938.54Show/hide
Query:  QAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFPYSLRD
        Q  +PI++ +DR RAIR YA PMF+ LN GI RP+IQA  FE+KPVMFQMLQTVG+F  + +EDPHL+L+SFL +SDSF IQGV  +  RL LFP+SLRD
Subjt:  QAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFPYSLRD

Query:  GAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGAL
         A++WLN+ +P S+  W++ AEK L KYFPP RNAK RSEI+ F QLEDE+ S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q ++DASA GA+
Subjt:  GAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGAL

Query:  LAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNP
        L+K++NEA+EILE I++N++QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+++ + +    ++PAA + Q  + +CV+C E H +E CP+NP
Subjt:  LAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNP

Query:  ASVFFV----------------------------------------------------------------------------------------------
         SV ++                                                                                              
Subjt:  ASVFFV----------------------------------------------------------------------------------------------

Query:  ------------------DTEHPRREGKEQVKAVTLRSGKPLE----------EPRKTQDIERNSDKNVVAEKELESGQGAGGSNKNAGASKSV-PDVEP
                          DTE+PRR+GKEQ K++ LRSGK L+          EP   Q  E+ S K      +      A G   N+  S  V    +P
Subjt:  ------------------DTEHPRREGKEQVKAVTLRSGKPLE----------EPRKTQDIERNSDKNVVAEKELESGQGAGGSNKNAGASKSV-PDVEP

Query:  PYVSP-----------------------------------PPYV--------------------------LVKLDLP-----------------------
        P   P                                   P YV                          ++K  +P                       
Subjt:  PYVSP-----------------------------------PPYV--------------------------LVKLDLP-----------------------

Query:  -----QFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCS
               TLQLADRS+ +P GKIEDVLV+VDKFIFP DFIILDYE D++VPIIL  PFLATGR L DV+K ELTM   +E+  F VF+ ++ PD + +C 
Subjt:  -----QFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCS

Query:  FIRILESTIVE
         I  ++  +VE
Subjt:  FIRILESTIVE

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase3.6e-10735.1Show/hide
Query:  DPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVAND--RTRAIRAYAFPMFDVLNSGIARPQIQATNFE
        DPEIERTFR RR +                                L    EQ+    EN I+V  D      +R  A P      S +  P++ A   +
Subjt:  DPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVAND--RTRAIRAYAFPMFDVLNSGIARPQIQATNFE

Query:  MKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIV
        ++  M QM+Q   +F GLS E+P+ ++ +FL + D+   +GV +DALRL LF +SL   A  W  S    SI TW +L E+ +SK+F P + A LR+EI+
Subjt:  MKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIV

Query:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVS
         FRQ   ET  EAW RF+++LR CP+H +P  IQ+ TFY+GL    +  +D     + L+ T  E + +L  +  N ++    R T  K   V+EVD V+
Subjt:  GFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVS

Query:  TIRVDLAMIANALKNVTM----ISHQQPPAVEPAALVNQVTEEACVYCGEDHN------YEFCPNN-----PASVFFVDTEH-----PRREGKEQVKAVT
         +   +  +  ++KN        S Q P +VE    V+   +         +N        F  NN      A  F    +H     PR++GK Q +AVT
Subjt:  TIRVDLAMIANALKNVTM----ISHQQPPAVEPAALVNQVTEEACVYCGEDHN------YEFCPNN-----PASVFFVDTEH-----PRREGKEQVKAVT

Query:  LRSGKPLE----EPRKTQDIERNSDKNVVAEKELESGQGAG--------------------------------------------------------GSN
        LR+G+ L+    EP K+++ E  S++    EKE+E+                                                                
Subjt:  LRSGKPLE----EPRKTQDIERNSDKNVVAEKELESGQGAG--------------------------------------------------------GSN

Query:  KNAGASKSVPDVE----------PPYVSPP---------------PYV------LVKLDLPQFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDY
        +  G  ++V   E          PP +  P               PY       LV+      TLQLADRS+ YPKG IED+LVKVDKFIFP DF++LD 
Subjt:  KNAGASKSVPDVE----------PPYVSPP---------------PYV------LVKLDLPQFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDY

Query:  EADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQ----DSADKHLED-HGEVGVEDIEVCLLE
        E D +VPIILG PFLATGR L DVQK ELTM V ++++ FNVFK MK+P++ ++C  + + ++   + +I     D  ++ L D   E   ED EV  ++
Subjt:  EADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQ----DSADKHLED-HGEVGVEDIEVCLLE

Query:  RKNEKELFRCEDVFESLDLDQRKAPP--INPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSF
          +  + F+   V ESL   +R AP   + PS+ E PTL+LKPL  HL Y YLGE  TLP+I++S L     E L+R L+ ++ AIGWT+ADI+GISPSF
Subjt:  RKNEKELFRCEDVFESLDLDQRKAPP--INPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSF

Query:  CMHKITLDEGSFRSIEQQRMLNPAMKEVVKKE
        CMHKI L++    S+E QR LNP MKEVVKKE
Subjt:  CMHKITLDEGSFRSIEQQRMLNPAMKEVVKKE

A0A6A2ZSK6 Integrase catalytic domain-containing protein7.4e-10534.07Show/hide
Query:  EQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFP
        E  +++  NP      R RAIR +   + + LN GI  P IQA  FE+KPVMF +L ++                            GV +D L+L LFP
Subjt:  EQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFP

Query:  YSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS
        YS+ D A+ WL+    GS+ +W  L +  L +Y PPN N +LR++I  FRQ +DE+  E W+R+K LLRKC +HG     Q+  FYNG+N  T+ M+DAS
Subjt:  YSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDAS

Query:  ARGALLAKTFNEAYEILERISTNSFQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEF
        A G LL K+  +A++IL+RI+TN +Q+   R G  +K    LE+D    +   LA I N LKN+     Q+P  V       +    AC+ C  +HN   
Subjt:  ARGALLAKTFNEAYEILERISTNSFQWSDVR-GTNKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEF

Query:  CPNNPASVFFV--------------------------------------------DTEHPRREGKEQVKAVTLRS-----------GKPLEEPRKTQDIE
        CP N  S+ FV                                            DTE  +   KE+   +TLRS           GKP ++   T    
Subjt:  CPNNPASVFFV--------------------------------------------DTEHPRREGKEQVKAVTLRS-----------GKPLEEPRKTQDIE

Query:  RNSDKNV--VAEKELESGQG-----AGGSNKNAGA------SKSVPDVE-----------------------------------------PP-------Y
          +DK V   A KE +  +G     A G+N+NA A      S +VP +E                                         PP       +
Subjt:  RNSDKNV--VAEKELESGQG-----AGGSNKNAGA------SKSVPDVE-----------------------------------------PP-------Y

Query:  VSP----------------------PPYVLVKLDL-----PQFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATGRA
        ++P                      P  + VKL +         LQLAD S  +P+G+IEDV+V+VDKF+FPVDF+ILD E D   PIILG PFLATGR 
Subjt:  VSP----------------------PPYVLVKLDL-----PQFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATGRA

Query:  LTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQDSADKHLE-DHGEVGVEDIEV----CLLERKNEKELFRCEDVFESLDLD
        L D ++ ELTM V ++ V  N+F+ +KY DD E+C  I  +++T+ + A Q   D  ++ D  E   E ++          + +    +         L+
Subjt:  LTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQDSADKHLE-DHGEVGVEDIEV----CLLERKNEKELFRCEDVFESLDLD

Query:  QRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRMLN
         ++  P  PSL  APTL+LK L   LKY YLG  +TLP+I++S L P  E +L+  L Q++KA+GWT+ D++GIS +  MHKI L+E    SIE QR LN
Subjt:  QRKAPPINPSLIEAPTLDLKPLSDHLKYVYLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRMLN

Query:  PAMKEVVKKE
        P MK+VV KE
Subjt:  PAMKEVVKKE

A0A6A2ZSK6 Integrase catalytic domain-containing protein2.1e-0641.25Show/hide
Query:  WLGFIKLRLLPTTHDSTVSRDQVILIFAILRSLSIDVGKIISNEILSCWRKKVGKLFFPNTITMLCSRAGVPTVLEDVIL
        W  F+K +LLPT+H++TVS  +++L+ +I+    IDVGKII      C +++   L FPN IT LC +  V   + D IL
Subjt:  WLGFIKLRLLPTTHDSTVSRDQVILIFAILRSLSIDVGKIISNEILSCWRKKVGKLFFPNTITMLCSRAGVPTVLEDVIL

A0A6J1EEI2 uncharacterized protein LOC1114333942.5e-10858.65Show/hide
Query:  QNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRD
        +NP M  N  Q     N I +A+DR RAIRAYA P  + LN  I RP++QAT FE+KPVMFQMLQT+G+FHGL SEDPHL+LKSFLGVSDSF  Q V +D
Subjt:  QNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRD

Query:  ALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGV
         +RL+LFPYSLRDGAK+WLN+ A G+I +W+ L EK L KYFPP RNA+ R+EIV F+Q ED+T SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN  
Subjt:  ALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGV

Query:  TQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVY
        T+ +VDASA GA+L+KT+NEAYEILERI++N+ QW+DVR    +K + VLEVD +S+I   LA + N L+N+ +       A V   A++NQ   E+CVY
Subjt:  TQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVY

Query:  CGEDHNYEFCPNNPASVFFVDTEHPRREGKEQVKAVTLRSG
        CGE+H ++ CP+NPAS+F+V  +  +   K    + T   G
Subjt:  CGEDHNYEFCPNNPASVFFVDTEHPRREGKEQVKAVTLRSG

A0A6J1EQ90 uncharacterized protein LOC1114364111.0e-12238.96Show/hide
Query:  VRFELDPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQAT
        + F LDPEIERTFRRR ++++   +  N+ ++  G +      NR   +NP M  N  Q     NPI +A+DR RAIRAYA P  + LN  I RP+IQ T
Subjt:  VRFELDPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQAT

Query:  NFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGV-------SDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPN
         FE+KPVMFQMLQT+G+FHGL  EDPHL+LKSFLGV       SDSF  QGV +D +RL+LFPY LRDGAK+WLN+ APG+I +W+ LAE  L KYFPP 
Subjt:  NFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGV-------SDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPN

Query:  RNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKK
        RNA+ ++EIV F+Q EDET SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN VT+ +VDASA GA+L+KT+NEAYEILERI++N+ QW+DVR    +K
Subjt:  RNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKK

Query:  VKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTE---------------------H
         + VLEVD +S+I   LA + N L+N+ +       A V  AA +NQ   E+CVYCGE+H ++ CP+NPAS+F+V  +                     H
Subjt:  VKSVLEVDGVSTIRVDLAMIANALKNVTMISHQQPPA-VEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTE---------------------H

Query:  PRREGKEQ-------------------------VKAVTLRSGKPLEEPRKTQDIE-----------------------RNSDKNVVAEKELESG-----Q
        P    K Q                                 GK   + + T +                         RN +  +  EK  E G     +
Subjt:  PRREGKEQ-------------------------VKAVTLRSGKPLEEPRKTQDIE-----------------------RNSDKNVVAEKELESG-----Q

Query:  GAGGSNKNAGA------SKSVPDVEP---------------PYVSPPPY---------------------------------------------------
         A    +N  A      SK   +VE                 Y   PP+                                                   
Subjt:  GAGGSNKNAGA------SKSVPDVEP---------------PYVSPPPY---------------------------------------------------

Query:  ---------------VLVKLDLP--------------------------------------------------QFTLQLADRSIKYPKGKIEDVLVKVDK
                        ++K  +P                                                    TLQLADRSI YP+GKIED+L++VDK
Subjt:  ---------------VLVKLDLP--------------------------------------------------QFTLQLADRSIKYPKGKIEDVLVKVDK

Query:  FIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI
        FIF  DFIILDYE D DVPIILG PFL  GR L DV K  +T+ +  ++V+FN+   MKYP  +E+CS +
Subjt:  FIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVCNEEVKFNVFKDMKYPDDMEDCSFI

A0A6J1H7E4 uncharacterized protein LOC1114611685.9e-11060Show/hide
Query:  QNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRD
        +NP+M  N  Q     N I +A+DR RAIRAYA P  D LN  I RP++QAT FE+KPVMFQMLQT+G+FHGL SEDPHL+LKSFLGVSDSF  QGV +D
Subjt:  QNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFDVLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRD

Query:  ALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGV
         +RL+LFPYSLRDGAK+WLN+ AP +I +W+ LAEK L KYFPP RNA+ R+EIV F+Q EDET SEAWERFKE+LRKCPHHGLPHCIQMETFYNGLN  
Subjt:  ALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNAKLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGV

Query:  TQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNV-----TMISHQQPPAVEPAALVNQVTEE
        T+ +VDASA GA+L+KT+NEAYEILERI++N+ QW+DVR    KK + VLEVD +S+I   LA + N L+N+     TMI   + PA   AA++ Q   E
Subjt:  TQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGT-NKKVKSVLEVDGVSTIRVDLAMIANALKNV-----TMISHQQPPAVEPAALVNQVTEE

Query:  ACVYCGEDHNYEFCPNNPASVFFVDTEHPRREGKEQVKAVTLRSG
        +CVYCGE+H ++ CP NPAS+ +V  +  +   K    + T   G
Subjt:  ACVYCGEDHNYEFCPNNPASVFFVDTEHPRREGKEQVKAVTLRSG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCGAGAGATGAGAGCTAAGAGACGTGCGACCATTGAAGAAGAGGCACGGTTGCGTGAAGCTGAAAACTCATTCCCTAAGGTAGGCAAAAGCTCACGGCAAGGGGA
GGCTTCAACGGATGATTTGGCTAGAAAAAGAGATGTGGATGATGAGAAGAAAAAGGAAGAAGTTGGGAAAAGGAGAGAAGAAGAAGAACAAGAGGCCGAGAGGGCCCTTA
GAGAAGAAAAAGAAAAACTTGAGGCTGAAAAAGCAAGAAAAGAGGATGAAGAACTAAGGAGATTGGCTACTGACCTCCAACTCCTTGAGGAAGAAAAGACAAGAAGAGAA
ATTGTTGGTGGTGCGATCCGCAAGCGTACGGTTGCCACAAGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGTGAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATT
CAGGAGAAGGAGAGAGCAGCGTAGGAACCAGAACCAAATGGATAACGTGTCGCGTCTCTCGCAGGGTCCTGAAGATCCAGCAGACCCCCAGAATCGCTTGCTGCAGCAAA
ATCCGCTGATGGGGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGAT
GTGTTAAATTCAGGGATTGCACGTCCTCAAATTCAGGCGACAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGAAATTCCATGGTTTGTCATC
TGAAGATCCTCATTTATATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCTTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTA
GAGATGGAGCAAAGACATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGTTAGCTGAAAAAAATTTGAGTAAATATTTCCCACCTAATAGAAATGCT
AAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTACGAAAGTGCCCACACCATGGTTTACC
TCATTGTATCCAAATGGAAACATTTTACAATGGGTTAAACGGAGTAACCCAGGGTATGGTTGATGCCTCGGCTAGAGGGGCCCTTTTGGCAAAAACTTTTAATGAAGCCT
ATGAAATTTTAGAGAGAATATCTACTAATAGTTTTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGG
GTTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAATGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCATTGGTGAACCAAGTCACAGAGGAAGC
ATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAA
AGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAAAGACCCAGGATATAGAAAGAAATAGTGATAAAAATGTTGTTGCTGAGAAAGAGTTGGAGTCTGGT
CAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTAAATCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTATTGGTGAAGCTAGACCTACC
ACAGTTCACACTCCAACTAGCTGATAGGTCTATCAAATATCCAAAGGGTAAAATTGAGGATGTCTTAGTGAAAGTGGACAAATTCATATTTCCAGTTGATTTTATTATTT
TAGACTATGAGGCTGATAAAGATGTCCCAATTATTTTAGGTCATCCATTTTTGGCTACTGGTAGGGCATTAACAGATGTTCAAAAAAGGGAATTAACAATGGGAGTCTGT
AATGAGGAAGTGAAATTTAATGTGTTTAAAGACATGAAGTATCCAGACGACATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAGCACAATTGTTGAGACAGCAATACA
GGATTCGGCTGACAAACATTTGGAAGATCATGGAGAGGTTGGTGTAGAGGACATAGAAGTTTGTTTGTTAGAAAGAAAAAATGAAAAAGAATTGTTTAGGTGCGAGGATG
TTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCTATTAATCCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAGTATGTG
TATCTTGGGGAAGGTAAGACGTTGCCCATTATTGTTGCATCAGATTTAATACCAAAGCATGAAGAGGCCTTAATAAGGTTTCTGCAGCAATACCGCAAGGCTATAGGTTG
GACATTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGATGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAATGCTTAACCCTGCAA
TGAAAGAGGTTGTTAAAAAGGAGGACTTCCCACACGCGGGCTTTAATGAAATGGTCGTGGCACCGTCCAGTGTTCAGTTGAATGCAGCAGTCCGAGAGGTTGGAATTGAT
GGGGCTCAGTGGAGGCTATCTAAGACGGAGAAACACGCGTTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTGCGTTTGCTGCCGAC
TACGCATGATTCAACAGTGTCTCGCGACCAAGTGATTCTGATATTCGCTATTCTTCGATCCCTGAGTATTGATGTTGGAAAAATCATTTCAAATGAGATTCTAAGTTGCT
GGCGCAAGAAGGTGGGGAAGCTGTTTTTCCCAAACACGATCACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGAGGATGTGATTCTGCTTGATAAGGGAATC
ATCGACACGCCTAATCTGTCGGTAAGTAGGCAAGAGTTTGCTGAAAGCTTGGACTGGTTAAGCTTAATTAGATCAAGAGCTAGGCTATGGCGAGTTCTTAGAATTGAGTT
AAAAGTGGTGATTAATTGTTCATGCCGGAAGAATTATTTTGCTGCAGCAAAGCTCGGTTTTGTAGAGTGCTCAGAATGCATTGCTGATCGACTGGAGAGAGCAAATTCTG
TGTTGCAGCAAAGCTGTGAGCAAAACTGCCACGTCACAGTTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCGAGAGATGAGAGCTAAGAGACGTGCGACCATTGAAGAAGAGGCACGGTTGCGTGAAGCTGAAAACTCATTCCCTAAGGTAGGCAAAAGCTCACGGCAAGGGGA
GGCTTCAACGGATGATTTGGCTAGAAAAAGAGATGTGGATGATGAGAAGAAAAAGGAAGAAGTTGGGAAAAGGAGAGAAGAAGAAGAACAAGAGGCCGAGAGGGCCCTTA
GAGAAGAAAAAGAAAAACTTGAGGCTGAAAAAGCAAGAAAAGAGGATGAAGAACTAAGGAGATTGGCTACTGACCTCCAACTCCTTGAGGAAGAAAAGACAAGAAGAGAA
ATTGTTGGTGGTGCGATCCGCAAGCGTACGGTTGCCACAAGTGTGAGTTTGGTGCATGAGCGATCCGCCTGGGTGAGGTTTGAGCTTGATCCAGAAATCGAGAGGACATT
CAGGAGAAGGAGAGAGCAGCGTAGGAACCAGAACCAAATGGATAACGTGTCGCGTCTCTCGCAGGGTCCTGAAGATCCAGCAGACCCCCAGAATCGCTTGCTGCAGCAAA
ATCCGCTGATGGGGCAAAATGAGCAGCAAAATAATCAGGCTGAGAATCCTATCTTGGTAGCGAACGATAGGACCAGAGCCATTCGAGCATATGCTTTTCCAATGTTTGAT
GTGTTAAATTCAGGGATTGCACGTCCTCAAATTCAGGCGACAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTGCAAACCGTGGGGAAATTCCATGGTTTGTCATC
TGAAGATCCTCATTTATATCTTAAGTCTTTTCTAGGAGTTAGTGATTCTTTTGTAATTCAAGGAGTGCTTAGAGATGCTCTTAGATTAACTTTGTTCCCGTATTCTCTTA
GAGATGGAGCAAAGACATGGTTAAATTCTTTTGCTCCAGGATCAATTAGGACATGGGATGAGTTAGCTGAAAAAAATTTGAGTAAATATTTCCCACCTAATAGAAATGCT
AAATTAAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTACGAAAGTGCCCACACCATGGTTTACC
TCATTGTATCCAAATGGAAACATTTTACAATGGGTTAAACGGAGTAACCCAGGGTATGGTTGATGCCTCGGCTAGAGGGGCCCTTTTGGCAAAAACTTTTAATGAAGCCT
ATGAAATTTTAGAGAGAATATCTACTAATAGTTTTCAGTGGTCAGATGTTAGAGGCACAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCCACCATTAGG
GTTGATCTTGCAATGATTGCTAACGCTCTTAAGAATGTGACAATGATTAGTCATCAGCAGCCACCAGCTGTGGAGCCTGCTGCATTGGTGAACCAAGTCACAGAGGAAGC
ATGTGTCTATTGTGGTGAAGATCACAACTACGAGTTTTGCCCCAACAATCCAGCTTCTGTGTTTTTTGTAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCAGGTAA
AGGCAGTAACTCTTAGGAGTGGTAAGCCACTAGAAGAGCCTAGAAAGACCCAGGATATAGAAAGAAATAGTGATAAAAATGTTGTTGCTGAGAAAGAGTTGGAGTCTGGT
CAGGGTGCTGGAGGCAGCAATAAAAATGCTGGAGCATCTAAATCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTATTGGTGAAGCTAGACCTACC
ACAGTTCACACTCCAACTAGCTGATAGGTCTATCAAATATCCAAAGGGTAAAATTGAGGATGTCTTAGTGAAAGTGGACAAATTCATATTTCCAGTTGATTTTATTATTT
TAGACTATGAGGCTGATAAAGATGTCCCAATTATTTTAGGTCATCCATTTTTGGCTACTGGTAGGGCATTAACAGATGTTCAAAAAAGGGAATTAACAATGGGAGTCTGT
AATGAGGAAGTGAAATTTAATGTGTTTAAAGACATGAAGTATCCAGACGACATGGAAGATTGCTCTTTCATTAGGATTCTGGAGAGCACAATTGTTGAGACAGCAATACA
GGATTCGGCTGACAAACATTTGGAAGATCATGGAGAGGTTGGTGTAGAGGACATAGAAGTTTGTTTGTTAGAAAGAAAAAATGAAAAAGAATTGTTTAGGTGCGAGGATG
TTTTTGAGTCTTTAGATTTAGATCAAAGGAAGGCTCCTCCTATTAATCCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGTCGGATCATCTAAAGTATGTG
TATCTTGGGGAAGGTAAGACGTTGCCCATTATTGTTGCATCAGATTTAATACCAAAGCATGAAGAGGCCTTAATAAGGTTTCTGCAGCAATACCGCAAGGCTATAGGTTG
GACATTGGCTGACATTCAGGGAATTAGCCCATCTTTTTGTATGCATAAAATCACTCTAGATGAGGGATCCTTTAGGAGTATTGAGCAACAAAGAATGCTTAACCCTGCAA
TGAAAGAGGTTGTTAAAAAGGAGGACTTCCCACACGCGGGCTTTAATGAAATGGTCGTGGCACCGTCCAGTGTTCAGTTGAATGCAGCAGTCCGAGAGGTTGGAATTGAT
GGGGCTCAGTGGAGGCTATCTAAGACGGAGAAACACGCGTTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTGCGTTTGCTGCCGAC
TACGCATGATTCAACAGTGTCTCGCGACCAAGTGATTCTGATATTCGCTATTCTTCGATCCCTGAGTATTGATGTTGGAAAAATCATTTCAAATGAGATTCTAAGTTGCT
GGCGCAAGAAGGTGGGGAAGCTGTTTTTCCCAAACACGATCACGATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCTAGAGGATGTGATTCTGCTTGATAAGGGAATC
ATCGACACGCCTAATCTGTCGGTAAGTAGGCAAGAGTTTGCTGAAAGCTTGGACTGGTTAAGCTTAATTAGATCAAGAGCTAGGCTATGGCGAGTTCTTAGAATTGAGTT
AAAAGTGGTGATTAATTGTTCATGCCGGAAGAATTATTTTGCTGCAGCAAAGCTCGGTTTTGTAGAGTGCTCAGAATGCATTGCTGATCGACTGGAGAGAGCAAATTCTG
TGTTGCAGCAAAGCTGTGAGCAAAACTGCCACGTCACAGTTCGTTAG
Protein sequenceShow/hide protein sequence
MIREMRAKRRATIEEEARLREAENSFPKVGKSSRQGEASTDDLARKRDVDDEKKKEEVGKRREEEEQEAERALREEKEKLEAEKARKEDEELRRLATDLQLLEEEKTRRE
IVGGAIRKRTVATSVSLVHERSAWVRFELDPEIERTFRRRREQRRNQNQMDNVSRLSQGPEDPADPQNRLLQQNPLMGQNEQQNNQAENPILVANDRTRAIRAYAFPMFD
VLNSGIARPQIQATNFEMKPVMFQMLQTVGKFHGLSSEDPHLYLKSFLGVSDSFVIQGVLRDALRLTLFPYSLRDGAKTWLNSFAPGSIRTWDELAEKNLSKYFPPNRNA
KLRSEIVGFRQLEDETFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGVTQGMVDASARGALLAKTFNEAYEILERISTNSFQWSDVRGTNKKVKSVLEVDGVSTIR
VDLAMIANALKNVTMISHQQPPAVEPAALVNQVTEEACVYCGEDHNYEFCPNNPASVFFVDTEHPRREGKEQVKAVTLRSGKPLEEPRKTQDIERNSDKNVVAEKELESG
QGAGGSNKNAGASKSVPDVEPPYVSPPPYVLVKLDLPQFTLQLADRSIKYPKGKIEDVLVKVDKFIFPVDFIILDYEADKDVPIILGHPFLATGRALTDVQKRELTMGVC
NEEVKFNVFKDMKYPDDMEDCSFIRILESTIVETAIQDSADKHLEDHGEVGVEDIEVCLLERKNEKELFRCEDVFESLDLDQRKAPPINPSLIEAPTLDLKPLSDHLKYV
YLGEGKTLPIIVASDLIPKHEEALIRFLQQYRKAIGWTLADIQGISPSFCMHKITLDEGSFRSIEQQRMLNPAMKEVVKKEDFPHAGFNEMVVAPSSVQLNAAVREVGID
GAQWRLSKTEKHAFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSRDQVILIFAILRSLSIDVGKIISNEILSCWRKKVGKLFFPNTITMLCSRAGVPTVLEDVILLDKGI
IDTPNLSVSRQEFAESLDWLSLIRSRARLWRVLRIELKVVINCSCRKNYFAAAKLGFVECSECIADRLERANSVLQQSCEQNCHVTVR