; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035443 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035443
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:21547512..21552323
RNA-Seq ExpressionLag0035443
SyntenyLag0035443
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.6e-15236.55Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M  TKAPG DG+ A F+  +W IVG   ++ C+ ILN   +   +N T IALIPK + P ++  +RPISLCNV+Y+IVAKA+ANR+K  L  IISP Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+P RLI+DNVI+G+EC+H     +  ++G +A+KLD+SKAYDRVEW+F+ + M+ +GF   WI+ IM CI +  +SVL+NG P  +  P RGLRQG P+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+F++CAE FS LL + E  +   GLK  +   +++ L FADDSL+F +A   DC  +K I   Y +ASGQ  N  KSS+  S   S + +S + +I 
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
         +    +  +YLG+P    RNK   FK +K +V   + +W  KLFSAGGKEILIKA+ QA+P Y MS F++P  L EDI +  A FWWG  + K   HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
         W+ MS +K +GG                        P+SL+ + ++ RY+K   F +A +GS PSF WRSI+WG ++ KKG RW+IG+GK+V++ +D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILN--TPNYITDDEIIWSRDKRGMFSVKSVYHMAV--SLANSQEASSSSSLDI
        I  P    P   K       V  ++D + +W+   + ++F   + E IL    P+   +DE++W  DK+G +SVKS Y +A+  +  N  E+S+SSS   
Subjt:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILN--TPNYITDDEIIWSRDKRGMFSVKSVYHMAV--SLANSQEASSSSSLDI

Query:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETW--------------KNFYPLTNHMYSLNRGG
         ++WK  W L    K KI   +                              E+ SH++ ECK  ++ W              ++F+     M+S +   
Subjt:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETW--------------KNFYPLTNHMYSLNRGG

Query:  ---------WTPMTNQTNSRLD-QRQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLG
                 W   + +     + ++   RF+  K + S ++    +SK   + G +   +    +W     N  KLNVDA+   K+ + G+G I+RD+ G
Subjt:  ---------WTPMTNQTNSRLD-QRQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLG

Query:  SSICSGFKRIKRSWNMKMLEMKAIAEGLK
          +  G K+ +    + + E +AI  GL+
Subjt:  SSICSGFKRIKRSWNMKMLEMKAIAEGLK

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.6e-16138.31Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M+ TKAPG DGM A+F+  YW+IVGND   + +++LN+  S  +INKT I L+PK KNP++M ++RPISLCNV+YK+++K LANR+K+ L  IIS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+  RLI+DNV++ FE +H   +K++GK G  A+KLDMSKAYDRVEW FI+++M KMGF E WI  +M CI S+ YS+L+NG       P RGLRQGDPI
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPYIFL+CA+GFS+LL         SG+ + + CP ++ LFFADDSL+FC+A  ++C T+  IL+ YE+ASGQ IN++KSS+  S N   +    +  +L
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G  +     +YLG+PS   ++K ++F  +K+RV + L  WKEKL S GG+EILIKA+ QAIPTYTMSCF+IP +L E+I  M   FWWG+   + K  W+
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        SW+K+  +K+ GG                       +P+SL+ +  + RY+   D   A LG++PS+TWRSI  G E+ ++G RW++GNG+R++I +D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTP-GNEVPFLTKPDLSGKRVDSILD-EKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI
        + TP   +V    KP     RV +++D E+ RWK+  V + F   EA  IL+ P  +   +D+IIW  +++G FSVKS Y++AV + ++ E   SSS D 
Subjt:  ISTP-GNEVPFLTKPDLSGKRVDSILD-EKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI

Query:  VK-MWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYSLNRG--------------
           +W+  W L   PK +I A K                              ES  H+  +C+  K  W+ +      + ++N                
Subjt:  VK-MWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYSLNRG--------------

Query:  ---------GWTPMTNQTNSRLDQ-RQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLP-SHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDS
                  W    N+     +   Q   FI+          I Y+ +         + LP S  +W   PP  +K+NVD +         +G I+RD+
Subjt:  ---------GWTPMTNQTNSRLDQ-RQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLP-SHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDS

Query:  LGSSICSGFKRIKRSWNMKMLEMKAIAEGL
         G    +    ++  ++++ +E  A+  GL
Subjt:  LGSSICSGFKRIKRSWNMKMLEMKAIAEGL

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.5e-15337.91Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        ++ TKAPG DGM A F+H+YWDIVG   + + + +LN+     +INKT I+LIPKT  P  M  +RPISLCN  YKI++K LANR K+ L +IIS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F P RLI+DNV++ FE +H  N+K +GK  ++++KLDMSKA+DRVEWSFI+ +M K+GF E WI  IM+C+ S+ YSVL+NG       P RG+RQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SP +FL+CAEG SAL+      +  +G+ + + CP ++ LFFADDSL+FC+A+E++C  +  IL  YEEASGQ IN +KSS+  S N S +    +  IL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G  +     +YLG+PS   ++K +VF  +KDRV K L  WK KL S GG+EILIKA+ QA+PTYTMSCF++P +L +D+  +  +FWWG+   + K  W+
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        SW KM  SK  GG                       +P+SL+ +  + +YF   D L++  GS PS+ WRSI    ++ +KG RW++GNG+R+ I  D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSG-KRVDSILD-EKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANS-QEASSSSSLD
        + TP        + D      V S++D +  RWK   +   F   EA  IL  P    + +D +IW  +KRG F+VKS Y++A  L +S +E  S+S   
Subjt:  ISTPGNEVPFLTKPDLSG-KRVDSILD-EKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANS-QEASSSSSLD

Query:  IVKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYSLNRGGWTPMTNQTNSRLD
           +WK  W+LK  PK KI A +                              E+ +H +  C+  K TW  +      + +        +       L 
Subjt:  IVKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYSLNRGGWTPMTNQTNSRLD

Query:  QRQFFRFIYRKI---------ENSEIQPIDYLS---------KSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSSICS
          + F  +   I         E+S   PI             K+  +    S++LP   RW   P   +K+N DA+    E    IG ++RD  G  + +
Subjt:  QRQFFRFIYRKI---------ENSEIQPIDYLS---------KSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSSICS

Query:  GFKRIKRSWNMKMLEMKAIAEGL
          K +  S+  ++ E  A+ EG+
Subjt:  GFKRIKRSWNMKMLEMKAIAEGL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.5e-15336.65Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M+  K+PG DGM  MFY +YW IVG   +   + +LN G      N TL+ LIPK K PS++  YRPISLCNV+YK+V+KA+  R+K  L  +IS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+ +RLI+DN+++ FE +H+  N+++G  G  A+KLDMSKA+DRVEW F+ ++M KMGF    +  I+ C++S+ YS LLNG  Q    P RG+RQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FLICAEG S LL+ EE   +  GLK+++  PS+S LFFADDS++FCRA ++    I R L TY  ASGQ IN  K  L  S+N           +L
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G+       QYLG+PS + +NK+++F  I D++WK L +WKE LFSAGGKE+L+KA+VQAIPTY MSCFR+P +L   I  M A FWWG +      HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        +W  +  +K +GG                        P+SLL   LR RYF   ++L A LGS PS TWRS++WG+EL  KG RW++G+G+R+    D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK
        +       P+  K       V  ++ E   W    +  NF+  +   +L+ P   Y  DD +IW++   G+++VKS YH AVSLA   +++ S+S++   
Subjt:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK

Query:  MWKSFWKLKAIPKAKIIASK-----------------------------RESTSHLMWECKWIKETWK--NFYPLTNHMYSLNRGGWTPMTNQTNSRLDQ
         W +FWKLK  PK +I   K                              E+  H ++ C   K  W+  NF   +    ++ R          ++ L  
Subjt:  MWKSFWKLKAIPKAKIIASK-----------------------------RESTSHLMWECKWIKETWK--NFYPLTNHMYSLNRGGWTPMTNQTNSRLDQ

Query:  RQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHV---------------------------------RWTQSPPNCWKLNVDASWIAKENRRGI
         +   F+         +   Y   S+R P   +   PS++                                 +WT  P    KLN DA+   + N  GI
Subjt:  RQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHV---------------------------------RWTQSPPNCWKLNVDASWIAKENRRGI

Query:  GWILRDSLGSSICSGFKRIKRSWNMKMLEMKAIAEGLKSLLGN
        G +LR+S G  + +  K  + ++  + +E   +A  L  LL +
Subjt:  GWILRDSLGSSICSGFKRIKRSWNMKMLEMKAIAEGLKSLLGN

XP_030968750.1 uncharacterized protein LOC115989220 [Quercus lobata]3.2e-15638.57Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M+ TKAPG DGM A+FY  YW IVGND + +    LN       IN T IAL+PK KNP+ M ++RPISL NV YK+++K LANR+++ L  IIS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+ +RLI+DNV++ FE +H  ++K+ GK   +A+KLDMSKAYDRVEW F+ K+M K+GF   WI  ++ CI ++ YSVL+NG      YP RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FL+C + FSAL+ +  + ++ +G+ + + CP ++ LFFADDSL+FCRAE ++C  +  ILK YE ASGQ +N +KS++  S N +P++ S +  IL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G  +    G+YLG+PS   ++K +VF  IK+RV K L  WKEK+ S GGKEILIKA+ Q IPTYTMSCF +P  L ED+  M  NFWWG+   + K  W+
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        SW KM  SK  GG                       DP SL+ +  + RYF   D L + L  +PS+ W+SI    E+ +KG RW++GNG+ + +  D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSG-KRVDSILDEKGRWKEKEVIENFSTT-EAEVILNTP-NY-ITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI
        + +P        + ++     V S++DE+ RW + +    F    EAE ILN P +Y + +D+IIW  +KR MFSVKS Y++A+ L    E    S  D 
Subjt:  ISTPGNEVPFLTKPDLSG-KRVDSILDEKGRWKEKEVIENFSTT-EAEVILNTP-NY-ITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI

Query:  -VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYS------------LNRGGW
           +WK  W+LK   K +I A K                              ESTSH +  C+   + W N++     + +            LNRG  
Subjt:  -VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWKNFYPLTNHMYS------------LNRGGW

Query:  TPMTNQTNSRLDQRQFFR---FIYRK--IENSEIQPIDYLSK-----SLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGS
        TP   +T        ++R    ++    +  S+I     L++     +L +   R +   S V WT  PP  +K+NVD +         +G I+RD  G 
Subjt:  TPMTNQTNSRLDQRQFFR---FIYRK--IENSEIQPIDYLSK-----SLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGS

Query:  SICSGFKRIKRSWNMKMLEMKAIAEGL
         + +  K +   +  K  E  A+ E +
Subjt:  SICSGFKRIKRSWNMKMLEMKAIAEGL

TrEMBL top hitse value%identityAlignment
A0A7N2L6Z9 Reverse transcriptase domain-containing protein3.8e-15539.42Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        ++ TK+PG DGM A+F+  YWDIVG++ S + + +LN G S   INKT I LIPKT NP  M ++RPISLCNVIYK+++K LANR+K+ L  II+  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F   RLI+DNV++ +E +H   +K+ GK   +A KLDMSKA+DRVEW FI ++M KMGF E WI+ IM CI S+ YSV++NG       P RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FL+CAEG SALL      +  +G+ L + CP ++ LFFADDSL+FC+A  ++C  +K IL+ YE ASGQ +N +KSS+  S N +P+    +  IL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G  +     +YLG+PS   R+K+ VF  IK+RV   L  WK KL S+GGKEILIKA+ QAIPTYTMSCF +P SL +++ +M  NFWWG+   + K  WI
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        SW KM   K  GG                       +P SL  + L+ +YF   D L+A LGS PS+TWRSI    E+ KKG RW++GNG+R+ I  D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSG--KRVDSILDEKGR-WKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLD
        + +P +    +T P ++     V S++D   R WK   +   F   +AE IL  P    + DD IIW  +K+G FSVKS Y +AV+L  S E    SS D
Subjt:  ISTPGNEVPFLTKPDLSG--KRVDSILDEKGR-WKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLD

Query:  -IVKMWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNFY--PL---------------------
          + +WK+ WKL    K KI A                              + E  +H +  C +    W  +   PL                     
Subjt:  -IVKMWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNFY--PL---------------------

Query:  TNHMYSLNRGGWTPMTNQTNSRLDQRQFFRFIYRKIENSEI--QPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVD-ASWIAKENRRGIGWILR
          H+       W    N+ N R+            ++  E+  + ID    ++++  P  +S  +   W+  PP  +K+NVD A  I      G+G ++R
Subjt:  TNHMYSLNRGGWTPMTNQTNSRLDQRQFFRFIYRKIENSEI--QPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVD-ASWIAKENRRGIGWILR

Query:  DSLGSSICSGFKRIKRSWNMKMLEMKAIAEGL
        D  G  I +  K +   +  +  E+ AI +GL
Subjt:  DSLGSSICSGFKRIKRSWNMKMLEMKAIAEGL

A0A803NM27 Uncharacterized protein4.1e-15739.1Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        MN   +PG DGM A+FY H W IVG+  +R  + +LN G S + +NKTLI LIPKTK P  M ++RPISLCNV+YK+++K+L  R K+ L  +IS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+P RLI+DN+++ FE +H   +K +GK G  A+KLDMSKA+DRVEWSF+  +M KMGF   WI  I++C+++ + S ++NG       P RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FLIC+EG S LL+ EES+    GL +++  PS+S L FADDSL+FC A+++ C  IKR+L TY +ASGQ +N +KS +  S N S  S      IL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G+   +    YLG+P+ ++R+K+++F  IK+R+WK L  W +K+FS GGKE+L+KA++Q+IPTY MSCF++P     +I  + +NFWWG +  K K HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
         W+ +  SK +GG                       +P SLL + L+GRYF   DFLSA      S TW+   WGREL KKG R ++GNG  +    DPW
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEV--PFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI
        I  PGN +  P       +    D I  EK  W   ++  +FS+ + E IL+ P  ++   D  +W     G + VKS YH+A  LA+    S SS    
Subjt:  ISTPGNEV--PFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI

Query:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWK-----------------NFYPLTNHMYSLN
        V  WKSFW+LK  PK K+ A K                              ES  H M+ CK  +  WK                 +F    +  Y+ +
Subjt:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWK-----------------NFYPLTNHMYSLN

Query:  R---------GGWTPMTNQTNSRLDQRQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDS
                    W+   N  + ++ Q+            S  Q    L  SL      +++  +H  WT  PPN  KLNVDA++     R G G I+RDS
Subjt:  R---------GGWTPMTNQTNSRLDQRQFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDS

Query:  LGS
         G+
Subjt:  LGS

A0A803NTN0 Uncharacterized protein5.7e-15938.77Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M+  K+PG+DGM AMFY  YW+IVGN  +++ + +LN G     +NK++I LIPK  NPS M +YRPISLCNVIYK+++KA+  R +  L  +IS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+  RLI+DN+++ FE IH   +K +G+ G  A+KLDMSKA+DRVEW F+  +M KMGF   W+  IM+CI +  +S  LNG       P RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FLIC+EGFS LL+ EES  N  GL+L +  PS+S L FADDSL+FCRA ++    IKRIL TY +ASGQ +N NKS +  S N SP + +  +  L
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
         +   +   +YLG+PS + R+KQ++F +IK++VWK L  W E++FSAGGKE+L+KA+VQ+IPTY MSCF++       +  M ANFWWG ++   K HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
         W+ +  SK +GG                        P+SLL + L+ RYF    FL A +G +PS+TW+SI WGR+L  KG R+K+GNG  +    DPW
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP-NYITD-DEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK
        I +  N  P ++    S   V + + ++  W    +   F   + E IL  P ++  D D +IW     G+++VKS +H+A +L    ++S+S       
Subjt:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP-NYITD-DEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK

Query:  MWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNFYPLTNHMYSLNRGGWTPMTNQTNSRLDQR-
         WK FW L   PK +I A                             S  ES  H ++ CK  +  WK         +SL+   +T     TN+RL+   
Subjt:  MWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNFYPLTNHMYSLNRGGWTPMTNQTNSRLDQR-

Query:  -QFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVR------WTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSSICSGFKRIKRSWNMKM
          F   +++    ++       + +   P P     P H R      W+    N +KLNVDA+ I+++ + G+G +LRD  G  I +  K  + S+    
Subjt:  -QFFRFIYRKIENSEIQPIDYLSKSLRIPGPRSESLPSHVR------WTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSSICSGFKRIKRSWNMKM

Query:  LEMKAIAEGL
        +E KA+   L
Subjt:  LEMKAIAEGL

A0A803Q8J4 Uncharacterized protein2.3e-15238.39Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        MN   +PG DGM A+FY H WDIVG+  ++  + +LN G S + +NKTLI LIPK K P  M ++RPISLCNV+YK+++K+L  R K+ L  +IS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+P RLI+DN+++ FE +H   +K +G+ G  A+KLDMSKA+DRVEWSFI  +M KMGF   W+A IM+C+++ + S ++NG       P RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FLIC+EG S LL+ EES+    GL +++  PS+S L FADDSL+FC A+++ C  IKR+L TY +ASGQ +N +KS +  S N +  +      IL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G+   D    YLG+P+ ++R+K+ +F +IK+R+WK L  W +K+FS GGKE+L+KA++Q+IPTY MSCF++P     +I  M +N+WWG +  K K HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
         W+ +  SK +GG                       + +SLL + L+GRYF   DFLSA      S TW+ I WGREL KKG R K+GNG  +    DPW
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEV--PFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI
        I  PGN +  P     + +    D I  ++  W   ++  +FS+ +   IL+ P  N    D  IW     G + VKS Y  A S  +S + + S S   
Subjt:  ISTPGNEV--PFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTP--NYITDDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDI

Query:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWK-----------------NFYPLTNHMYSLN
           WKSFW+LK  PK KI + K                              ES  H ++ CK  +  WK                 +F    +  Y+ +
Subjt:  VKMWKSFWKLKAIPKAKIIASKR-----------------------------ESTSHLMWECKWIKETWK-----------------NFYPLTNHMYSLN

Query:  R---------GGWTPMTNQTNSRLDQRQFFRFIYRKIEN--SEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILR
                    W+   N  + +  Q+     I  K +N  +  +    LS    +  P + S+     W+  PP C KLNVDA++    N+ G G I+R
Subjt:  R---------GGWTPMTNQTNSRLDQRQFFRFIYRKIEN--SEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILR

Query:  DSLGS
        DS G+
Subjt:  DSLGS

A0A803Q9W0 Uncharacterized protein2.1e-15336.5Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        +N+ KAPG DGM  +FY  +W+++G D +++C++ILN  K  +++NKTL+ LIPK + P ++G+YRPISLCNV YKI+AK LANRMK SL+++IS  Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
        F+  RLI DN ILGFE +H     R G    +A+KLDMSKAYDRVEW F+  +M  +G+ + W+ KIM+CI+SI +S+LLNG      +P RGLRQGDP+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        SPY+FL+C+EG S L++  E      GL+  +    LS LFFADDS IF  A   DC ++K IL  Y   SGQ IN +KS L   K I+      L+AIL
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI
        G+   D   +YLG+P+   + K++VF++I+ ++   LQ WK  LFS  G+EIL+KAI+QAIPTY MSCFR+P  L +DI+ M A FWWG S  K K HW 
Subjt:  GIAKADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWI

Query:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW
        +W+K+   KEKGG                       +P S+L + L+  Y+   +FL A +G   S+ WRSI+WGR++  KG RW++  G+ V I++D W
Subjt:  SWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSAPLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPW

Query:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTPNYIT--DDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK
        +  P             G  +++I DE G W  +++ E F   +  +IL   +  T  +D++IW     G ++V S Y +  +  N Q A +S+  D  K
Subjt:  ISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTPNYIT--DDEIIWSRDKRGMFSVKSVYHMAVSLANSQEASSSSSLDIVK

Query:  MWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNF--YPLTNHMYSLNRGGWTPMTNQTNSRLDQ
         W+  WK +A PK +                                 + E+  H +W C   K  W+NF  +P   H    +R     M      ++ +
Subjt:  MWKSFWKLKAIPKAKIIA-----------------------------SKRESTSHLMWECKWIKETWKNF--YPLTNHMYSLNRGGWTPMTNQTNSRLDQ

Query:  RQFFRFIY----------------------RKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSS
          F  F+                       + IE ++++    LS+        + S  S + W   P + + +N DAS    E R G+G ++R+  G+ 
Subjt:  RQFFRFIY----------------------RKIENSEIQPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSS

Query:  ICSGFKRIKRSWNMKMLEMKAI
        I +        ++++  E  A+
Subjt:  ICSGFKRIKRSWNMKMLEMKAI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.8e-3025.94Show/hide
Query:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKT-KNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP
        K+PG DG  A FY  Y + +     ++   I   G       +  I LIPK  ++ ++  N+RPISL N+  KI+ K LANR++  ++ +I   Q  F+P
Subjt:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKT-KNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP

Query:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY
              N+      I    N+ K K+ H+ + +D  KA+D+++  F+ K + K+G    ++  I    +    +++LNG   + F    G RQG P+SP 
Subjt:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY

Query:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAILGIA
        +F I  E  +  + +E+ +K   G++L K    LS   FADD +++          + +++  + + SG  IN+ KS      N +  + S++   L   
Subjt:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAILGIA

Query:  KADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQ----NWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGK
         A +  +YLG+  Q  R+ + +FK     + K ++     WK    S  G+  ++K  +  + I  +     ++P + F ++ +    F W + R +
Subjt:  KADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQ----NWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGK

P08548 LINE-1 reverse transcriptase homolog3.1e-2926.4Show/hide
Query:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKT-KNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP
        K+PG DG  + FY  + + +      +   I   G       +  I LIPK  K+P+   NYRPISL N+  KI+ K L NR++  ++ II   Q  F+P
Subjt:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKT-KNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP

Query:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY
              N+      I    NK K K  H+ + +D  KA+D ++  F+ + + K+G   T++  I         +++LNGV    F    G RQG P+SP 
Subjt:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY

Query:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKS-SLMASKNISPDSVSRLSAILGI
        +F I  E  +  +  E+++K   G+ +      LS   FADD +++          +  ++K Y   SG  IN +KS + + + N   +   + S    +
Subjt:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKS-SLMASKNISPDSVSRLSAILGI

Query:  A--KADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESR
           K   LG YL      K   ++ ++ ++  + + +  WK    S  G+  ++K  +  +AI  +     + P S F+D+ ++  +F W + +
Subjt:  A--KADELGQYLGMPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESR

P11369 LINE-1 retrotransposable element ORF2 protein6.1e-3327.16Show/hide
Query:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPK-TKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP
        K+PG DG  A FY  + + +     ++  +I   G       +  I LIPK  K+P+++ N+RPISL N+  KI+ K LANR++  ++ II P Q  F+P
Subjt:  KAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPK-TKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVP

Query:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY
              N+      IH   NK K K+ H+ + LD  KA+D+++  F+ K++ + G    ++  I         ++ +NG   +      G RQG P+SPY
Subjt:  KRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPY

Query:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKS-SLMASKNISPDSVSRLSAILGI
        +F I  E  +  + +++ +K   G+++ K    +S L  ADD +++    +     +  ++ ++ E  G  IN NKS + + +KN   +   R +    I
Subjt:  IFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKS-SLMASKNISPDSVSRLSAILGI

Query:  AKADELGQYLG--MPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESR
           +   +YLG  +  + K    K FK++K  + + L+ WK+   S  G+  ++K  +  +AI  +     +IP+  F ++      F W   +
Subjt:  AKADELGQYLG--MPSQTKRNKQKVFKNIKDRVWKALQNWKEKLFSAGGKEILIKAIV--QAIPTYTMSCFRIPSSLFEDINRMCANFWWGESR

P14381 Transposon TX1 uncharacterized 149 kDa protein6.5e-3528.85Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV
        M   K+PG DG+   F+  +WD +G D  R+  E    G+      + +++L+PK  +   + N+RP+SL +  YKIVAKA++ R+KS L ++I P Q+ 
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAV

Query:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI
         VP R I DNV L  + +H    +R G S    + LD  KA+DRV+  ++   +    F   ++  +     S E  V +N         GRG+RQG P+
Subjt:  FVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPI

Query:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL
        S  ++ +  E F  LL      K  +GL L +    +    +ADD +I    +  D    +   + Y  AS   IN +KSS +   ++  D +    A  
Subjt:  SPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAIL

Query:  GIAKADELGQYLGM-PSQTKRNKQKVFKNIKDRVWKALQNWK--EKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKA
         I+   ++ +YLG+  S  +    + F  +++ V   L  WK   K+ S  G+ ++I  +V +   Y + C          I R   +F W      GK 
Subjt:  GIAKADELGQYLGM-PSQTKRNKQKVFKNIKDRVWKALQNWK--EKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKA

Query:  HWISWEKMSCSKEKGG
        HW+S    S   ++GG
Subjt:  HWISWEKMSCSKEKGG

P93295 Uncharacterized mitochondrial protein AtMg003101.1e-1833.57Show/hide
Query:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG------------------------DPSSLLGKTLRGRYFKGQDFLS
        A+P Y MSCFR+   L + +      FWW     K K  W++W+K+  SKE  G                         P +LL + LR RYF     + 
Subjt:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG------------------------DPSSLLGKTLRGRYFKGQDFLS

Query:  APLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI
          +G+ PS+ WRSII GREL  +G    IG+G    +  D WI
Subjt:  APLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.8e-0937.18Show/hide
Query:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIV
        M   KAPG D   A F+   W +V + T     E    G   K+ N T I LIPK     ++  +RP+S C V+YKI+
Subjt:  MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.2e-1437.21Show/hide
Query:  LANRMKSSLQDIISPGQAVFVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKI
        +  R+K  + ++I P QA F+P R+ +DN++   E +H+   K KG  G + +KLD+ KAYDR+ W ++   +   GFPE W+ +I
Subjt:  LANRMKSSLQDIISPGQAVFVPKRLISDNVILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKI

AT4G29090.1 Ribonuclease H-like superfamily protein3.5e-3626.95Show/hide
Query:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSA
        A+PTYTM+CF +P ++ + I  + A+FWW   +     HW +W+ +SC K +GG                        P SL+ K  + RYF   D L+A
Subjt:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG-----------------------DPSSLLGKTLRGRYFKGQDFLSA

Query:  PLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI-STPGN------EVPFLTKPDLSG-KRVDSILDEKGRWKEKEVIEN-FSTTEAEVI--
        PLGS PSF W+SI   +E+ ++G R  +GNG+ +II +  W+ S P +       VP      +S   +V  ++DE GR   K+VIE  F   E ++I  
Subjt:  PLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI-STPGN------EVPFLTKPDLSG-KRVDSILDEKGRWKEKEVIEN-FSTTEAEVI--

Query:  LNTPNYITDDEIIWSRDKRGMFSVKSVYHMAVSLANS----QEASSSSSLDIV-KMWKS---------FWKL--KAIPKAKIIA--------------SK
        L        D   W     G ++VKS Y +   + N     QE S  S   I  K+WKS          WK    ++P A  +A              S 
Subjt:  LNTPNYITDDEIIWSRDKRGMFSVKSVYHMAVSLANS----QEASSSSSLDIV-KMWKS---------FWKL--KAIPKAKIIA--------------SK

Query:  RESTSHLMWECKWIKETW--------------KNFYPLTNHMYSLNRGG--------------WTPMTNQTNSRLDQRQF-FRFIYRKIENSEIQPIDYL
        +E+ +HL+++C + + TW               + Y     +++L  G               W    N+       R+F  + + R+ E+ +++     
Subjt:  RESTSHLMWECKWIKETW--------------KNFYPLTNHMYSLNRGG--------------WTPMTNQTNSRLDQRQF-FRFIYRKIENSEIQPIDYL

Query:  SKSLRIPGPRSESLPSHVRWTQSPPNCW-KLNVDASWIAKENRRGIGWILRDSLGSSICSGFKRIKRSWNMKMLEMKAIAEGLKSL
        +++         +  S  RW + PP+ W K N DA+W     R GIGW+LR+  G     G + + +  ++   E++A+   + SL
Subjt:  SKSLRIPGPRSESLPSHVRWTQSPPNCW-KLNVDASWIAKENRRGIGWILRDSLGSSICSGFKRIKRSWNMKMLEMKAIAEGLKSL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-2033.57Show/hide
Query:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG------------------------DPSSLLGKTLRGRYFKGQDFLS
        A+P Y MSCFR+   L + +      FWW     K K  W++W+K+  SKE  G                         P +LL + LR RYF     + 
Subjt:  AIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGG------------------------DPSSLLGKTLRGRYFKGQDFLS

Query:  APLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI
          +G+ PS+ WRSII GREL  +G    IG+G    +  D WI
Subjt:  APLGSTPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.4e-1342.65Show/hide
Query:  LLNGVPQDVFYPGRGLRQGDPISPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDS
        ++NG PQ +  P RGLRQGDP+SPY+F++C E  S L  R +      G++++   P ++ L FADD+
Subjt:  LLNGVPQDVFYPGRGLRQGDPISPYIFLICAEGFSALLEREESLKNFSGLKLNKLCPSLSRLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTACTAAAGCCCCGGGAGCAGATGGTATGCATGCCATGTTCTACCATCACTACTGGGACATAGTAGGAAATGACACTTCGAGAATCTGCATTGAAATTCTAAA
TGCAGGAAAGAGCTCTAAGAAAATAAACAAGACCTTGATTGCCCTTATTCCCAAGACAAAAAATCCGAGCGAAATGGGGAACTACCGACCAATAAGTCTTTGCAACGTGA
TTTATAAAATTGTAGCAAAGGCGCTTGCTAACAGAATGAAATCCTCTCTTCAGGACATCATATCTCCTGGTCAAGCGGTCTTTGTCCCTAAAAGACTTATTTCAGACAAT
GTTATCTTGGGGTTCGAATGCATCCACGCTCGAAACAACAAACGAAAAGGGAAGTCTGGGCATATCGCCATGAAGCTCGACATGAGCAAAGCCTATGACAGGGTCGAATG
GTCTTTTATTCGGAAAATTATGGCTAAGATGGGCTTCCCCGAGACTTGGATCGCCAAGATTATGGACTGTATAGAATCGATAGAATACTCAGTGCTCCTAAATGGAGTTC
CTCAAGATGTGTTTTACCCTGGAAGAGGATTAAGGCAGGGAGATCCGATCTCCCCCTATATTTTCCTAATCTGTGCAGAGGGCTTCTCAGCGCTTCTTGAAAGGGAAGAA
TCTCTTAAAAACTTTAGCGGCTTAAAATTAAACAAGCTTTGTCCCTCACTCTCTCGTTTATTTTTTGCAGATGACAGCTTGATTTTCTGTCGTGCGGAGGAGAAGGACTG
CTTCACTATAAAAAGGATTCTGAAGACGTACGAGGAAGCATCCGGTCAAACCATAAATCTGAATAAATCTTCTTTGATGGCTAGCAAAAACATTAGCCCAGACTCTGTGT
CTAGGTTAAGTGCAATCCTTGGAATAGCTAAAGCTGACGAGCTTGGACAGTATCTTGGGATGCCATCCCAGACGAAAAGAAACAAGCAAAAAGTGTTTAAAAACATCAAA
GATAGAGTTTGGAAGGCTCTTCAGAATTGGAAAGAAAAGCTATTCTCGGCAGGGGGAAAAGAGATCCTGATTAAAGCCATAGTCCAAGCAATTCCAACGTATACCATGAG
CTGTTTTAGAATCCCTTCATCCCTTTTTGAGGATATAAACAGAATGTGCGCTAACTTTTGGTGGGGCGAATCCAGAGGCAAAGGCAAAGCTCACTGGATCAGTTGGGAGA
AGATGAGTTGCAGTAAAGAGAAAGGGGGGGACCCCTCCAGCCTGCTTGGCAAAACACTGAGGGGAAGATACTTCAAAGGCCAAGATTTCCTCTCAGCCCCTCTAGGTAGC
ACCCCGTCATTCACTTGGCGAAGCATAATCTGGGGAAGAGAGCTCTTTAAGAAAGGCTATAGGTGGAAAATTGGGAACGGGAAGAGAGTCATTATAGACCAAGACCCTTG
GATCTCAACTCCGGGAAATGAAGTTCCTTTCCTAACCAAGCCAGATCTCTCTGGAAAAAGAGTCGACTCTATCCTAGACGAAAAGGGAAGGTGGAAAGAAAAAGAGGTCA
TCGAAAATTTCTCCACGACAGAAGCTGAAGTAATCCTCAACACCCCAAATTATATCACTGACGATGAGATTATTTGGAGTCGTGACAAACGAGGAATGTTCTCAGTAAAG
AGTGTCTACCATATGGCCGTTTCTTTAGCGAATTCCCAGGAAGCTTCTTCTTCGAGCTCCTTGGACATAGTCAAAATGTGGAAATCGTTTTGGAAGCTTAAGGCCATCCC
GAAGGCTAAAATCATCGCGTCCAAAAGAGAATCAACAAGCCATCTTATGTGGGAATGTAAGTGGATTAAAGAGACCTGGAAGAATTTCTATCCCCTAACGAATCATATGT
ATTCTTTGAACAGAGGAGGGTGGACACCAATGACAAATCAAACAAACTCCAGACTAGACCAGCGCCAGTTCTTCAGATTCATATATCGAAAGATTGAAAACTCAGAGATC
CAGCCGATTGATTACCTGTCCAAGTCGCTTAGGATTCCAGGACCAAGGTCGGAGAGCCTTCCGAGTCATGTCAGATGGACTCAGTCGCCCCCAAATTGCTGGAAGCTAAA
TGTCGACGCCTCCTGGATCGCGAAGGAGAACCGTCGAGGCATCGGGTGGATCCTCCGTGACTCCCTAGGATCTTCGATCTGTTCAGGATTCAAGCGGATTAAGAGAAGCT
GGAATATGAAGATGCTTGAAATGAAAGCCATTGCTGAGGGGCTGAAAAGCCTACTTGGAAACGAGGCGATCAAAGAAGGATTGAAATCTAGTTGGTTTAAAGGAAAAGAA
AGAAGGGTTTGGATTATGGGGAAAGGGGGGAACATTGGGGTTGAGGGGTCGGAAGGATTAGGCAGGGGAGATGAGCCTTTCCGATGTCTCTCTCTTTCTTTCGTTTCTCT
TCCGCCAGCCACGCGCTCGCCCAGCCGAAGTCGCAGATCTCGTGAATCTCACCTAGATCTGCGAATGTTGCCCAGTCGCAATCACAGATCTCGCGAATCTCGTCCAGCCA
TAGCTCGTTCCCAGTCTCTCACGCAGCTGCAGCTCGCTCCTCGCCACTCGCCTAGCTGCAGCTCACTTGCAGTCGCTCGCAGTCACCCGTTTACAGTCCGTTCGCCGCCT
GTTCGCAGTCGCCTGTTTGTAGTCCGTTCGCCGCTCGTTTGCACGCCTCCGTTTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACTCTACTAAAGCCCCGGGAGCAGATGGTATGCATGCCATGTTCTACCATCACTACTGGGACATAGTAGGAAATGACACTTCGAGAATCTGCATTGAAATTCTAAA
TGCAGGAAAGAGCTCTAAGAAAATAAACAAGACCTTGATTGCCCTTATTCCCAAGACAAAAAATCCGAGCGAAATGGGGAACTACCGACCAATAAGTCTTTGCAACGTGA
TTTATAAAATTGTAGCAAAGGCGCTTGCTAACAGAATGAAATCCTCTCTTCAGGACATCATATCTCCTGGTCAAGCGGTCTTTGTCCCTAAAAGACTTATTTCAGACAAT
GTTATCTTGGGGTTCGAATGCATCCACGCTCGAAACAACAAACGAAAAGGGAAGTCTGGGCATATCGCCATGAAGCTCGACATGAGCAAAGCCTATGACAGGGTCGAATG
GTCTTTTATTCGGAAAATTATGGCTAAGATGGGCTTCCCCGAGACTTGGATCGCCAAGATTATGGACTGTATAGAATCGATAGAATACTCAGTGCTCCTAAATGGAGTTC
CTCAAGATGTGTTTTACCCTGGAAGAGGATTAAGGCAGGGAGATCCGATCTCCCCCTATATTTTCCTAATCTGTGCAGAGGGCTTCTCAGCGCTTCTTGAAAGGGAAGAA
TCTCTTAAAAACTTTAGCGGCTTAAAATTAAACAAGCTTTGTCCCTCACTCTCTCGTTTATTTTTTGCAGATGACAGCTTGATTTTCTGTCGTGCGGAGGAGAAGGACTG
CTTCACTATAAAAAGGATTCTGAAGACGTACGAGGAAGCATCCGGTCAAACCATAAATCTGAATAAATCTTCTTTGATGGCTAGCAAAAACATTAGCCCAGACTCTGTGT
CTAGGTTAAGTGCAATCCTTGGAATAGCTAAAGCTGACGAGCTTGGACAGTATCTTGGGATGCCATCCCAGACGAAAAGAAACAAGCAAAAAGTGTTTAAAAACATCAAA
GATAGAGTTTGGAAGGCTCTTCAGAATTGGAAAGAAAAGCTATTCTCGGCAGGGGGAAAAGAGATCCTGATTAAAGCCATAGTCCAAGCAATTCCAACGTATACCATGAG
CTGTTTTAGAATCCCTTCATCCCTTTTTGAGGATATAAACAGAATGTGCGCTAACTTTTGGTGGGGCGAATCCAGAGGCAAAGGCAAAGCTCACTGGATCAGTTGGGAGA
AGATGAGTTGCAGTAAAGAGAAAGGGGGGGACCCCTCCAGCCTGCTTGGCAAAACACTGAGGGGAAGATACTTCAAAGGCCAAGATTTCCTCTCAGCCCCTCTAGGTAGC
ACCCCGTCATTCACTTGGCGAAGCATAATCTGGGGAAGAGAGCTCTTTAAGAAAGGCTATAGGTGGAAAATTGGGAACGGGAAGAGAGTCATTATAGACCAAGACCCTTG
GATCTCAACTCCGGGAAATGAAGTTCCTTTCCTAACCAAGCCAGATCTCTCTGGAAAAAGAGTCGACTCTATCCTAGACGAAAAGGGAAGGTGGAAAGAAAAAGAGGTCA
TCGAAAATTTCTCCACGACAGAAGCTGAAGTAATCCTCAACACCCCAAATTATATCACTGACGATGAGATTATTTGGAGTCGTGACAAACGAGGAATGTTCTCAGTAAAG
AGTGTCTACCATATGGCCGTTTCTTTAGCGAATTCCCAGGAAGCTTCTTCTTCGAGCTCCTTGGACATAGTCAAAATGTGGAAATCGTTTTGGAAGCTTAAGGCCATCCC
GAAGGCTAAAATCATCGCGTCCAAAAGAGAATCAACAAGCCATCTTATGTGGGAATGTAAGTGGATTAAAGAGACCTGGAAGAATTTCTATCCCCTAACGAATCATATGT
ATTCTTTGAACAGAGGAGGGTGGACACCAATGACAAATCAAACAAACTCCAGACTAGACCAGCGCCAGTTCTTCAGATTCATATATCGAAAGATTGAAAACTCAGAGATC
CAGCCGATTGATTACCTGTCCAAGTCGCTTAGGATTCCAGGACCAAGGTCGGAGAGCCTTCCGAGTCATGTCAGATGGACTCAGTCGCCCCCAAATTGCTGGAAGCTAAA
TGTCGACGCCTCCTGGATCGCGAAGGAGAACCGTCGAGGCATCGGGTGGATCCTCCGTGACTCCCTAGGATCTTCGATCTGTTCAGGATTCAAGCGGATTAAGAGAAGCT
GGAATATGAAGATGCTTGAAATGAAAGCCATTGCTGAGGGGCTGAAAAGCCTACTTGGAAACGAGGCGATCAAAGAAGGATTGAAATCTAGTTGGTTTAAAGGAAAAGAA
AGAAGGGTTTGGATTATGGGGAAAGGGGGGAACATTGGGGTTGAGGGGTCGGAAGGATTAGGCAGGGGAGATGAGCCTTTCCGATGTCTCTCTCTTTCTTTCGTTTCTCT
TCCGCCAGCCACGCGCTCGCCCAGCCGAAGTCGCAGATCTCGTGAATCTCACCTAGATCTGCGAATGTTGCCCAGTCGCAATCACAGATCTCGCGAATCTCGTCCAGCCA
TAGCTCGTTCCCAGTCTCTCACGCAGCTGCAGCTCGCTCCTCGCCACTCGCCTAGCTGCAGCTCACTTGCAGTCGCTCGCAGTCACCCGTTTACAGTCCGTTCGCCGCCT
GTTCGCAGTCGCCTGTTTGTAGTCCGTTCGCCGCTCGTTTGCACGCCTCCGTTTGAATAG
Protein sequenceShow/hide protein sequence
MNSTKAPGADGMHAMFYHHYWDIVGNDTSRICIEILNAGKSSKKINKTLIALIPKTKNPSEMGNYRPISLCNVIYKIVAKALANRMKSSLQDIISPGQAVFVPKRLISDN
VILGFECIHARNNKRKGKSGHIAMKLDMSKAYDRVEWSFIRKIMAKMGFPETWIAKIMDCIESIEYSVLLNGVPQDVFYPGRGLRQGDPISPYIFLICAEGFSALLEREE
SLKNFSGLKLNKLCPSLSRLFFADDSLIFCRAEEKDCFTIKRILKTYEEASGQTINLNKSSLMASKNISPDSVSRLSAILGIAKADELGQYLGMPSQTKRNKQKVFKNIK
DRVWKALQNWKEKLFSAGGKEILIKAIVQAIPTYTMSCFRIPSSLFEDINRMCANFWWGESRGKGKAHWISWEKMSCSKEKGGDPSSLLGKTLRGRYFKGQDFLSAPLGS
TPSFTWRSIIWGRELFKKGYRWKIGNGKRVIIDQDPWISTPGNEVPFLTKPDLSGKRVDSILDEKGRWKEKEVIENFSTTEAEVILNTPNYITDDEIIWSRDKRGMFSVK
SVYHMAVSLANSQEASSSSSLDIVKMWKSFWKLKAIPKAKIIASKRESTSHLMWECKWIKETWKNFYPLTNHMYSLNRGGWTPMTNQTNSRLDQRQFFRFIYRKIENSEI
QPIDYLSKSLRIPGPRSESLPSHVRWTQSPPNCWKLNVDASWIAKENRRGIGWILRDSLGSSICSGFKRIKRSWNMKMLEMKAIAEGLKSLLGNEAIKEGLKSSWFKGKE
RRVWIMGKGGNIGVEGSEGLGRGDEPFRCLSLSFVSLPPATRSPSRSRRSRESHLDLRMLPSRNHRSRESRPAIARSQSLTQLQLAPRHSPSCSSLAVARSHPFTVRSPP
VRSRLFVVRSPLVCTPPFE