; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:42759391..42769457
RNA-Seq ExpressionLag0039384
SyntenyLag0039384
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR015410 - Domain of unknown function DUF1985
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.0e-12832.68Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        + RL +    AWVVVGDFN  L  ++K GG    + Q+   +  ++DC L    F+G  FT + R    + + ER+DR V    +   + + +  HL+  
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPP
         SDH PIL+    ++ E  + K+S RFHFEE+W   PD  ++I   + W + +   + + N L       K W      ++R  +    + L  L     
Subjt:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPP

Query:  PWDFAEIKRIEDILDKALEDEEIYWKQRLR-------------------------ENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAP
                ++E+ +   LE +EI W+QR R                           Q++  L +++  +T  +N +L+  F + E+E  + Q+ P KAP
Subjt:  PWDFAEIKRIEDILDKALEDEEIYWKQRLR-------------------------ENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAP

Query:  GPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSI
        G D  PALF+QKYW  VG+  +  CL ILN   SVR++N T IALIPKVK PT +S+F PISLC   YK+I+K +ANR+K +L  +++ENQSAFVP R I
Subjt:  GPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSI

Query:  FYNVR-------------------------------------------------------------------YLGD------PYRASNQ-----------
          NV                                                                    + G+      P R   Q           
Subjt:  FYNVR-------------------------------------------------------------------YLGD------PYRASNQ-----------

Query:  --ENFA---------------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDN
          E F+                     P ++HL FADDS+ F +A+NE   AL+++   YE+ SGQ+IN  KSA  +SPN       +I G+L + VV  
Subjt:  --ENFA---------------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDN

Query:  LGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCL
          KYLG+P+   + R+  F+ +K ++W+ + GWK K  S  G+EIL+K++ Q+IPTY MSCF +PK LC +++ +MA FWW   +  + IHW KW ++C 
Subjt:  LGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCL

Query:  PKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTF
         K  GGL FRD+E FN+ALLAKQ WR+   P  LV+++ + RY   V  L A + +N S  WR   W + LL  G+R R+G+G S   + D W+P  + F
Subjt:  PKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTF

Query:  K-----PLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY
        K      LP++         V D  T+S +W+V  LKD+    +++    IP++     D  IWHY
Subjt:  K-----PLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]1.5e-13429.54Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        + R+ N D S W++ GD NA L   E    +    SQ++  R+ MD C L D+ F G +FT  N +   +Q+ +R+DR +  D +  +FP+AS     W+
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPW
         + H                                                 +    + ++   +  + WG+   + + + I   +  + D Y  P P 
Subjt:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPW

Query:  DFAEIKRIEDILDKALEDEEIYWKQRLREN------------QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWS
        DF  I  +E+ L   LE EEI+WKQR RE+             I+  +  I T++T  +N +L+AP+ K EIE+A+ Q+ P KA GPD FPALFYQ YW 
Subjt:  DFAEIKRIEDILDKALEDEEIYWKQRLREN------------QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWS

Query:  EVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-----------
         VG  T   CL+ LN    ++ WN T+IALIPK+K+P  ISDF PISLCNVSYKIISK + NR+K ++  ++S+ QSAFVP R+I  NV           
Subjt:  EVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-----------

Query:  -------------------------RYL----------------------------------------------GDPY-------------RASNQENFA
                                  YL                                              GDP                 N EN +
Subjt:  -------------------------RYL----------------------------------------------GDPY-------------RASNQENFA

Query:  PK------------ISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRR
         +            I+HL FADDSL F R+   +  AL+ +L  Y +ASGQ IN  KSAL  SPNVH E +  +  +L + +V + G YLG+PS FTRRR
Subjt:  PK------------ISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRR

Query:  RDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELF
         +                                                                      +++HW KW  +C PKE GGLNFRD+E F
Subjt:  RDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELF

Query:  NKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVF
        N+AL+AK VWR   +P+LLVSKV+K +Y    SLL A   S  S FW+GF+W R LL  G+R R+G+G +   F DPW+P+ TTFKPL    GA      
Subjt:  NKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVF

Query:  VSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY-------------------CSHDLS-----------------------------
        V+ FITA   WDV  +      +D ++I ++PIS  + +D W+WHY                   C+   +                             
Subjt:  VSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY-------------------CSHDLS-----------------------------

Query:  ------------------PMFPTCKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAW--LVRHLDSESFEKACITFWSLWNDRNSYNNN
                          P    C  + E+I HA   CKRA  I   +F  +   +  ++N +    W  L   L+ +    A IT W +WNDRNS  + 
Subjt:  ------------------PMFPTCKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAW--LVRHLDSESFEKACITFWSLWNDRNSYNNN

Query:  VAIMDWVERCEWVHEYW-TKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAM
          +     +CEW+  +  + ++    + S  T SN R       P  Q   P+  + +           L TDAAC      + +G II ++S  L  A 
Subjt:  VAIMDWVERCEWVHEYW-TKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAM

Query:  EFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS
                +PL AEI+ ++ G++       T   V SDS+ AI++I  +     D  +W+++IQ L   F F++F + SRQ NR    LAK  ++
Subjt:  EFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS

XP_023899813.1 uncharacterized protein LOC112011695 [Quercus suber]6.3e-12534.59Show/hide
Query:  TQKQP-----IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFP
        T K+P     ++ L       W+ +GDFN  +   EKEGG     SQ++     ++ CG +D+ + G  +T   ++    QI ER+DR +   E++ LFP
Subjt:  TQKQP-----IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFP

Query:  NASIEHLMWARSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPD--CKQII---------------------SDLDCWNIQENDHQRLENCLRRRRAR
         A + HL  + SDH P+ L+ ++++++ + TKQ+  F FE +W  D  C++++                     ++L+CW  +E++  R     R R   
Subjt:  NASIEHLMWARSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPD--CKQII---------------------SDLDCWNIQENDHQRLENCLRRRRAR

Query:  FKRWGKGTSFSIRQNILTNQRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQI
        F+   + TSF   +            YK            IE + D+ + D         R +     L  I+ KV+  +N +L   F   E+ +A+ Q+
Subjt:  FKRWGKGTSFSIRQNILTNQRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQI

Query:  HPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAF
        +P KAPGPD  P +F+Q++WS  G + +   LD LN   S  ++N+THI LIPK+K+P  +SDF PISLCNV+YKI SK + NR+K  L  I+SE QSAF
Subjt:  HPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAF

Query:  VPGRSIFYNV---------------------------RYLGDPYRAS----NQENF-----APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASG
        V GR I  NV                           + L    +AS      E        PK+SHLFFADDSL FC+AS  +  AL+ +L  YE+ASG
Subjt:  VPGRSIFYNV---------------------------RYLGDPYRAS----NQENF-----APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASG

Query:  QKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMP
        Q++N  K++LF S N   E++  I G     V+    KYLG+PS   R +R+ F +IK+++ +TL GWK K  S  G+E+L+K++AQ+IPTY MSCF +P
Subjt:  QKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMP

Query:  KTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGF
         +LC ++ +M+  FWWG  +   +I W +W  +C PK  GG+ F++++ FN ALLAKQ WRL      LV +V+K RY  +   + A + +N S  WR  
Subjt:  KTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGF

Query:  VWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITA-SMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIW
        + A+ L++ G+R R+G+G S   ++D W+P  +T+K +   R   Q    V + I   + EW  + +  +      +IIK+IPIS     D+ IW
Subjt:  VWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITA-SMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIW

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]6.0e-12827.64Show/hide
Query:  WVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWARSDHRPILLNG
        W+++GDFN  +    K GG++  ESQ+   R+ +D C L D  F GD FT +  +   + + ER+D   + D +   F   ++ HL +  SDHR I    
Subjt:  WVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWARSDHRPILLNG

Query:  IMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPWDFAEIKRIEDI
          +    +  + S RF FE++W  +     SD+D  N                                  ILT+                A IK  E +
Subjt:  IMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPWDFAEIKRIEDI

Query:  LDKALEDEEIYWKQRLR-----------------------ENQI---------------------------------------DLTLQDIKTKVTFYLNG
        LDK LE EE+YW+QR R                        N+I                                       D TL  I   VT  +N 
Subjt:  LDKALEDEEIYWKQRLR-----------------------ENQI---------------------------------------DLTLQDIKTKVTFYLNG

Query:  KLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLA
         L+ PF K ++++A+  + P K+PG D   A+FYQK W  VGN+ +   L +LN   +    N T I LIPK+KK   + DF PISLCNV  K+I+K+L 
Subjt:  KLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLA

Query:  NRMKWILQEIVSENQSAFVPGRSIFYN---------------------------------------VRYLGDPYRASNQ-----------ENF-------
         R K +L  ++SE QSAF+P R I  N                                       ++ + D    +N+            NF       
Subjt:  NRMKWILQEIVSENQSAFVPGRSIFYN---------------------------------------VRYLGDPYRASNQ-----------ENF-------

Query:  --------------------------------------------------APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALF
                                                          AP+ISHLFFADDSL FC A+     A+K  L  Y QASGQ +N +KS L 
Subjt:  --------------------------------------------------APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALF

Query:  VSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMM
         SPN  +  +     +L M + +   KYLG+P+   R ++  F EIK+++W+ ++ W  K FSIGG+EILLK++ QSIPTYLMSCF +P TLC  + +MM
Subjt:  VSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMM

Query:  AWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGI
        + FWWGS E+G +IHW+ W ++C  K  GG+ FR    +N+ALLAKQ WRL  NPS L+S+++K RY    S L AP   + S+ W+G +  R LL+SG+
Subjt:  AWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGI

Query:  RKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHYCS-------------
        R ++G+G+      DPW+P  T F P+     A      V++ IT   +W+   L +     D++ I TIP+S     D  IWHY S             
Subjt:  RKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHYCS-------------

Query:  ---------------------------------------HDLSPMFPT--------------CKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNN
                                               HD  P+  +              C+   E+I HAL GCK A  +        D    +Q N
Subjt:  ---------------------------------------HDLSPMFPT--------------CKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNN

Query:  FADRVAWLVRHLDSESFEKACITFWSLWNDRNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASST
          D + +L         E      WS+W +RN   +       V    +   Y +    +      P  +  +      +P +   PP+ G +  N    
Subjt:  FADRVAWLVRHLDSESFEKACITFWSLWNDRNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASST

Query:  EFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQ
                DAA +   N    GA+I    G +  A       ++     E KA+ H +    +L++    + +D++     + G     S  +  I  + 
Subjt:  EFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQ

Query:  ELRESFEFLAFKYVSRQSNRKVDYLAKHAL
         L   F  ++  +  R +N+   YLA++AL
Subjt:  ELRESFEFLAFKYVSRQSNRKVDYLAKHAL

XP_030505068.1 uncharacterized protein LOC115720043 [Cannabis sativa]7.4e-12628.25Show/hide
Query:  WVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWARSDHRPILLNG
        W+V+GDFN T+ + +K GG +  + Q+   R  +D CGLQ+L F G+ FT  N+  R   + ER+D   V   ++  F +  + HL + + D R +    
Subjt:  WVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWARSDHRPILLNG

Query:  IMEEREFRTTKQSRRFHFEEVWHPD--CKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYK--NPPPWDFAEIKR
        I         K   RF FE++W  +  C  II+        E   Q L+N  +    + + W +     + + I  +Q  +  L    +     F ++K+
Subjt:  IMEEREFRTTKQSRRFHFEEVWHPD--CKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYK--NPPPWDFAEIKR

Query:  IEDILDKALEDEEIYWKQRLREN--------------------------------------------------------------QIDLTLQDIKTKVTF
         E++LD  L  EE YW+QR R +                                                               +   +  I   +T 
Subjt:  IEDILDKALEDEEIYWKQRLREN--------------------------------------------------------------QIDLTLQDIKTKVTF

Query:  YLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIIS
         +N  L  PF   E+  A++ +    +PG D    +FY  YW  VGN+ +   L +LN   S   +N T I LIPKVKKP+ +S   PISLCNV YK++S
Subjt:  YLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIIS

Query:  KVLANRMKWILQEIVSENQSAFVPGRSIFYNV----------RYL-------------------------------------------------------
        K +  R+K  L  ++SE+QSAF+  R I  NV          ++L                                                       
Subjt:  KVLANRMKWILQEIVSENQSAFVPGRSIFYNV----------RYL-------------------------------------------------------

Query:  -----------------GDP-------------YRASNQENF------------APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEK
                         GDP             +R   QE              AP +SHLFFADDSL  CRA+++   A+K+ L+ Y +ASGQ++N EK
Subjt:  -----------------GDP-------------YRASNQENF------------APKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEK

Query:  SALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADI
        S L  SPN    ++T +  +L+M + +   +YLG+PS   R +   F EIK+++W  L  W+ K FSIGG+E+LLK++AQ+IPTY MSCF + K+L   I
Subjt:  SALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADI

Query:  HSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLL
         +MM  FWWGS  S   I+WK W ++   K +GG+ F+    FN+ALLAKQ WR+F+NPS L+ +V+K R+ +  S L A + +  S+ WRG +W + LL
Subjt:  HSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLL

Query:  ESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHYCSH--------
          G+R ++GDG       +PW+P  TTFKPL + +G     + VS  I  S  W+   LK +    D++ I  IP++L   +D  +WH+ SH        
Subjt:  ESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHYCSH--------

Query:  -----DLSPMFPT------------------CKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAWLVRHLDSESFEKACITFWSLWNDR
              L    P+                  C    E+I HAL  CKRA  +  +    +D  I    +  + +  L     S   E+     WS+WN+R
Subjt:  -----DLSPMFPT------------------CKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAWLVRHLDSESFEKACITFWSLWNDR

Query:  NSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGR
        N   +            +   Y  +   + +D +  +L+ + +N         + PP+  +             L TDAA     + S +GAI+  ++G 
Subjt:  NSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGR

Query:  LSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS
        +   M       + P   E+ AL+H ++ LQ L +    + +DS   +  +   K+  S+ +  +  I  L  +F      +V R +N     LAK ALS
Subjt:  LSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein2.5e-12730.87Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        +  LH      W+V+GDFN     +EK G      +Q+   R+++ DC L+DL + G  FT SNR+E  N +  R+DR V  D ++ LFP A + H++ A
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDH------QRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQD
         SDH  ++++     +     ++ +RF FE +W     C++ I     WN            Q+++ C    R +  +W   T   +   I+  ++    
Subjt:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDH------QRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQD

Query:  LYKNPP--PWDFAEIKRIEDILDKALEDEEIYWKQRLR------------------------------------------------------------EN
          +N P   ++ +E+  +   L+   E EE++W+QR R                                                              
Subjt:  LYKNPP--PWDFAEIKRIEDILDKALEDEEIYWKQRLR------------------------------------------------------------EN

Query:  QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISD
         ID  +Q +   V+  +N  LM P+   EI  A+ QI P KAPGPD   ALF+QKYW  VG   SL  LD LN  R +   N T+IALIPKVK P  +++
Subjt:  QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISD

Query:  FIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-------RYL-------------------------------------------
        F PISLCNV YKI+SKVL NRMK IL  ++S++QSAFVPGR I  NV        YL                                           
Subjt:  FIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-------RYL-------------------------------------------

Query:  --------------------------------GDPY-----------------RASNQENF--------APKISHLFFADDSLDFCRASNEQVWALKSIL
                                        GDP                  +A     F         P+ISHLFFADDS+ FCRAS      ++++L
Subjt:  --------------------------------GDPY-----------------RASNQENF--------APKISHLFFADDSLDFCRASNEQVWALKSIL

Query:  SQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTY
        S YE+ASGQK+N +K+ALF S N     +  I  L   S      KYLG+P    R +R  F +IK+RIW+ LQGWK K  S  GRE L+K++ Q+IPTY
Subjt:  SQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTY

Query:  LMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSN
         MSCF  P  LCA+I SM   FWWG     ++IHW +   +  PK++GG+ FRD+ LFNKALLA+Q WRL  +P  LVS+ +K +Y    S L  PI +N
Subjt:  LMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSN

Query:  CSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASME-WDVNKLKDVVDIDDLNIIKTIPISLSHDEDQ
         S  WR    +R +L++G+R R+G G S   +KD W+P  T++K +   R   +E   V   I   M  W+ + L+ V    D+ IIK IP+S     D+
Subjt:  CSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASME-WDVNKLKDVVDIDDLNIIKTIPISLSHDEDQ

Query:  WIW--------------HYCSHDLSPMFPTCKSKLETIDHALCGCKRATTICDVMF-QRVDSEIPIQNNFADRVAWLVRHLDSESFEKACITFWSL-WND
         IW              H   HD + M          ++ +  G K +  + + ++  +V  +I +   F  R    +    ++ F++  ++F+S  W +
Subjt:  WIW--------------HYCSHDLSPMFPTCKSKLETIDHALCGCKRATTICDVMF-QRVDSEIPIQNNFADRVAWLVRHLDSESFEKACITFWSL-WND

Query:  RNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLS
         +    +  +  W  +C++    W+   +      +P+LS
Subjt:  RNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLS

A0A5E4FZN9 PREDICTED: retrotransposon2.9e-12832.68Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        + RL +    AWVVVGDFN  L  ++K GG    + Q+   +  ++DC L    F+G  FT + R    + + ER+DR V    +   + + +  HL+  
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPP
         SDH PIL+    ++ E  + K+S RFHFEE+W   PD  ++I   + W + +   + + N L       K W      ++R  +    + L  L     
Subjt:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPP

Query:  PWDFAEIKRIEDILDKALEDEEIYWKQRLR-------------------------ENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAP
                ++E+ +   LE +EI W+QR R                           Q++  L +++  +T  +N +L+  F + E+E  + Q+ P KAP
Subjt:  PWDFAEIKRIEDILDKALEDEEIYWKQRLR-------------------------ENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAP

Query:  GPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSI
        G D  PALF+QKYW  VG+  +  CL ILN   SVR++N T IALIPKVK PT +S+F PISLC   YK+I+K +ANR+K +L  +++ENQSAFVP R I
Subjt:  GPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSI

Query:  FYNVR-------------------------------------------------------------------YLGD------PYRASNQ-----------
          NV                                                                    + G+      P R   Q           
Subjt:  FYNVR-------------------------------------------------------------------YLGD------PYRASNQ-----------

Query:  --ENFA---------------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDN
          E F+                     P ++HL FADDS+ F +A+NE   AL+++   YE+ SGQ+IN  KSA  +SPN       +I G+L + VV  
Subjt:  --ENFA---------------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDN

Query:  LGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCL
          KYLG+P+   + R+  F+ +K ++W+ + GWK K  S  G+EIL+K++ Q+IPTY MSCF +PK LC +++ +MA FWW   +  + IHW KW ++C 
Subjt:  LGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCL

Query:  PKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTF
         K  GGL FRD+E FN+ALLAKQ WR+   P  LV+++ + RY   V  L A + +N S  WR   W + LL  G+R R+G+G S   + D W+P  + F
Subjt:  PKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTF

Query:  K-----PLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY
        K      LP++         V D  T+S +W+V  LKD+    +++    IP++     D  IWHY
Subjt:  K-----PLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY

A0A6J1DX30 uncharacterized protein LOC1110248747.2e-13529.54Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        + R+ N D S W++ GD NA L   E    +    SQ++  R+ MD C L D+ F G +FT  N +   +Q+ +R+DR +  D +  +FP+AS     W+
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPW
         + H                                                 +    + ++   +  + WG+   + + + I   +  + D Y  P P 
Subjt:  RSDHRPILLNGIMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPW

Query:  DFAEIKRIEDILDKALEDEEIYWKQRLREN------------QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWS
        DF  I  +E+ L   LE EEI+WKQR RE+             I+  +  I T++T  +N +L+AP+ K EIE+A+ Q+ P KA GPD FPALFYQ YW 
Subjt:  DFAEIKRIEDILDKALEDEEIYWKQRLREN------------QIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWS

Query:  EVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-----------
         VG  T   CL+ LN    ++ WN T+IALIPK+K+P  ISDF PISLCNVSYKIISK + NR+K ++  ++S+ QSAFVP R+I  NV           
Subjt:  EVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNV-----------

Query:  -------------------------RYL----------------------------------------------GDPY-------------RASNQENFA
                                  YL                                              GDP                 N EN +
Subjt:  -------------------------RYL----------------------------------------------GDPY-------------RASNQENFA

Query:  PK------------ISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRR
         +            I+HL FADDSL F R+   +  AL+ +L  Y +ASGQ IN  KSAL  SPNVH E +  +  +L + +V + G YLG+PS FTRRR
Subjt:  PK------------ISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRR

Query:  RDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELF
         +                                                                      +++HW KW  +C PKE GGLNFRD+E F
Subjt:  RDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELF

Query:  NKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVF
        N+AL+AK VWR   +P+LLVSKV+K +Y    SLL A   S  S FW+GF+W R LL  G+R R+G+G +   F DPW+P+ TTFKPL    GA      
Subjt:  NKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVF

Query:  VSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY-------------------CSHDLS-----------------------------
        V+ FITA   WDV  +      +D ++I ++PIS  + +D W+WHY                   C+   +                             
Subjt:  VSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHY-------------------CSHDLS-----------------------------

Query:  ------------------PMFPTCKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAW--LVRHLDSESFEKACITFWSLWNDRNSYNNN
                          P    C  + E+I HA   CKRA  I   +F  +   +  ++N +    W  L   L+ +    A IT W +WNDRNS  + 
Subjt:  ------------------PMFPTCKSKLETIDHALCGCKRATTICDVMFQRVDSEIPIQNNFADRVAW--LVRHLDSESFEKACITFWSLWNDRNSYNNN

Query:  VAIMDWVERCEWVHEYW-TKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAM
          +     +CEW+  +  + ++    + S  T SN R       P  Q   P+  + +           L TDAAC      + +G II ++S  L  A 
Subjt:  VAIMDWVERCEWVHEYW-TKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAM

Query:  EFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS
                +PL AEI+ ++ G++       T   V SDS+ AI++I  +     D  +W+++IQ L   F F++F + SRQ NR    LAK  ++
Subjt:  EFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALS

A0A803PWX1 Uncharacterized protein2.8e-13128.55Show/hide
Query:  QPIERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLM
        Q +  L       W+ VGDFN  +   EK GG       +   ++ +DDC   D   S    T  N +   +QI ER+DR +  +E+L  F  A ++ L 
Subjt:  QPIERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLM

Query:  WARSDHRPILLN--GIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLE-NC-LRRRRARFKRWGKGTSFSIRQNILTNQRILQD
        W  SDHR +++N    ++  +    K+  RFHFEE W    +C +II ++  W  ++   + +   C + +     + W K     +   I   ++IL +
Subjt:  WARSDHRPILLN--GIMEEREFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQRLE-NC-LRRRRARFKRWGKGTSFSIRQNILTNQRILQD

Query:  LYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLR-----------------------ENQI-------------------------------------
        L     P  +  I+ +ED L+  LE +E YW+QR R                       +N+I                                     
Subjt:  LYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLR-----------------------ENQI-------------------------------------

Query:  --DLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISD
             L+ ++ KV+  +N +LM  F   E+  AV  ++P KAPG D  PALFYQK+WS++ +     CL++LN    +   NDT +ALIPKV KP  I +
Subjt:  --DLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISD

Query:  FIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIF--------------------------------------------------YNVRYL
        F PISLCNV YKI+SK LANR++  L ++VS++QSAF+ GR I                                                   Y+V ++
Subjt:  FIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIF--------------------------------------------------YNVRYL

Query:  GDPYRASNQENFA---------------------------------------------------------PKISHLFFADDSLDFCRASNEQVWALKSIL
            R      F+                                                           +SHLFFADDSL F  A+ ++    K +L
Subjt:  GDPYRASNQENFA---------------------------------------------------------PKISHLFFADDSLDFCRASNEQVWALKSIL

Query:  SQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTY
         +Y +ASGQ +N  KS +     V + +RT +  ++ + VVDN GKYLG+PS   R ++  F+ IK ++W  L+GWKG +FS  G+E+L+K++ Q+IPTY
Subjt:  SQYEQASGQKINVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTY

Query:  LMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSN
         MSCF +PK     IHSM A FWWGS+E   +IHW KW ++C  KEQGGL FRD+ LFN+ALLAKQVWR    P+ L S+V+K  Y     ++ A   ++
Subjt:  LMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSN

Query:  CSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFK-----PLP-----ITRGAYQEGVFV-SDFITASMEWDVNKLKDVVDIDDLNIIKTI
         S  WR  VW + +++ G R RIG+G S     DPW+P+  TFK     PLP     I     + GVF  S +      +    ++      +    K  
Subjt:  CSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFK-----PLP-----ITRGAYQEGVFV-SDFITASMEWDVNKLKDVVDIDDLNIIKTI

Query:  PISLSHDEDQWIWHYC----------SHDLSPMFPTCK----SKLETIDHALCGCKRATTICD--VMFQRVDSEIPIQNNFADRVAWLVR---HLDSESF
         + +      ++W             +H    + P CK       E + HAL GC+    + +    + R+      +    D +A+L+R       + F
Subjt:  PISLSHDEDQWIWHYC----------SHDLSPMFPTCK----SKLETIDHALCGCKRATTICD--VMFQRVDSEIPIQNNFADRVAWLVR---HLDSESF

Query:  EKACITFWSLWNDRNSYNNN------VAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAA
        E   I  W+LW  RNS N+        AI+DW  +  ++HE        F++ +      QR        + +  PP  G    N            DA 
Subjt:  EKACITFWSLWNDRNSYNNN------VAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNASSTEFGALLFTDAA

Query:  CSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAF
             N +   +++ +  GR+  A   F     +PL  E++A++ GI+   +  +    V SD + A+ ++M +++   DV   I QI+EL +       
Subjt:  CSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFEFLAF

Query:  KYVSRQSNRKVDYLAKHALSHGRSML
         +V R++N+    LA  AL +  S +
Subjt:  KYVSRQSNRKVDYLAKHALSHGRSML

A0A803QPB7 Uncharacterized protein2.3e-12529.45Show/hide
Query:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA
        ++RL      AWV  GDFN  + + EK GG    +  ++  R ++  C L++++  GD FT  N +   N I E++DRI+    +   F   ++  L W 
Subjt:  IERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWA

Query:  RSDHRPILLNGIMEER-EFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQ----RLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDL
         SDHRP+L    ++++ E    K   RFHFE+ W  + +C +II D+  W  +++DHQ     L+  L     +  +W +      R+++   + + + L
Subjt:  RSDHRPILLNGIMEER-EFRTTKQSRRFHFEEVW--HPDCKQIISDLDCWNIQENDHQ----RLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDL

Query:  YKNPPPWDFAEIKRIEDILDK---------ALEDEEIYWKQRLR-------------------------------ENQIDLT---LQDIKTKVTFYLNGK
                 A+  +I D L K           E  EIYWKQR R                                N ID+     +    +++   N  
Subjt:  YKNPPPWDFAEIKRIEDILDK---------ALEDEEIYWKQRLR-------------------------------ENQIDLT---LQDIKTKVTFYLNGK

Query:  LMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLAN
        LM PF   E+++A+ QIHP KAPG D  P LF+QK W  VG   +  CLD+LN        N+T I LIPKVK+PT + +F PISLCNV YK+ISK LAN
Subjt:  LMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIISKVLAN

Query:  RMKWILQEIVSENQSAFVPGRSIFYNVRYLGDPYRASNQENFA---------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKIN
        RMK  L  ++ +NQSAF+ GR I  N     +      +  FA                ++SHL FADDSL F  A+ E   ALK +L  YE+ SGQ IN
Subjt:  RMKWILQEIVSENQSAFVPGRSIFYNVRYLGDPYRASNQENFA---------------PKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKIN

Query:  VEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLC
         EK+ + V   V   + T +   L +++V +  KYLG+P+   + ++  F +I+ +I   LQGWK   FS  G++IL+K++ Q++P Y+MSCF + K + 
Subjt:  VEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLC

Query:  ADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWAR
         +I  ++A F WGST++  ++HW  W  +C  KE GG+ FRD++ FN++LLAKQ W+L  NP  L++KV+K  Y +  S   A      S  WRG +W R
Subjt:  ADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWAR

Query:  GLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDE---------------
         LL  G R  +G+G      +D W+P+   F     ++      V ++  +     W  N+++     DD+  +  I  +++  +               
Subjt:  GLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDE---------------

Query:  -------------------------------------DQWIWHYCSHDLSPMF-------------PTCKSKLETIDHALCGCKRATTICDVM-FQRVDS
                                               +IW   +H +                   C  + E I HAL  C +   I  +  F ++  
Subjt:  -------------------------------------DQWIWHYCSHDLSPMF-------------PTCKSKLETIDHALCGCKRATTICDVM-FQRVDS

Query:  EIPIQ-NNFADRVAWLVRHLDSESFEKACITFWSLWNDRNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGI
         IP+     AD + WL  HL  E F K     W +W  RN++     I+D      W  +    T +    +++ T + +  +     PQ          
Subjt:  EIPIQ-NNFADRVAWLVRHLDSESFEKACITFWSLWNDRNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGI

Query:  MVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDV
                    L+ TDA+      G    A+I    G L  A   F     + L AE  A+  GI+L  R  IT A V SD+ + IK +  D    +D 
Subjt:  MVTNASSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDV

Query:  YHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALSHGRSMLCPSGLTNRACAFV
           +  I+ LR  F+ L F + +R  NR  + LAK +    +S +    L N A AF+
Subjt:  YHWILQIQELRESFEFLAFKYVSRQSNRKVDYLAKHALSHGRSMLCPSGLTNRACAFV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.2e-1523.04Show/hide
Query:  LMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSV-------RDWNDTHIALIPKVKKPTLISD-FIPISLCNVSYK
        L  P    EI   ++ +   K+PGPD F A FYQ+Y  E+          +L L +S+         + +  I LIPK  + T   + F PISL N+  K
Subjt:  LMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSV-------RDWNDTHIALIPKVKKPTLISD-FIPISLCNVSYK

Query:  IISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR------------------------------------------------YL--------------
        I++K+LANR++  +++++  +Q  F+PG   ++N+R                                                YL              
Subjt:  IISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR------------------------------------------------YL--------------

Query:  ------------------GDPY-------------RASNQEN-------FAPKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALF
                          G P              RA  QE           ++    FADD + +          L  ++S + + SG KINV+KS  F
Subjt:  ------------------GDPY-------------RASNQEN-------FAPKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQKINVEKSALF

Query:  VSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKE----IKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSC--FHMPKTLCA
        +  N + +  + I G L  ++     KYLG+    TR  +D FKE    + + I +    WK    S  GR  ++K        Y  +     +P T   
Subjt:  VSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKE----IKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSC--FHMPKTLCA

Query:  DIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTN
        ++      F W      KR    K S++    + GG+   D +L+ KA + K  W  + N
Subjt:  DIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTN

P08548 LINE-1 reverse transcriptase homolog1.1e-1227.5Show/hide
Query:  QRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKY
        Q+IL + YK      +  +K I+  L       E     RL + ++++                L  P    EI   +  +   K+PGPD F + FYQ +
Subjt:  QRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKY

Query:  WSEVGNITSLNCLDILNLRRSV-------RDWNDTHIALIPKV-KKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR
          E+  I       +LNL +++         + + +I LIPK  K PT   ++ PISL N+  KI++K+L NR++  +++I+  +Q  F+PG   ++N+R
Subjt:  WSEVGNITSLNCLDILNLRRSV-------RDWNDTHIALIPKV-KKPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR

P0C2F6 Putative ribonuclease H protein At1g657505.5e-2330.13Show/hide
Query:  VPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGG
        +P    R  +D F EI +R+   + GW+ K  S  GR  L K++  S+P + MS   +P+++   +  +   F WGST   K+ H  KWS VC PK++GG
Subjt:  VPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGG

Query:  LNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRY---ANQVSLLSAPIKSNCSVFWRGF-VWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKP
        L  R  +  N+AL++K  WRL    + L + V++ +Y     + S    P K + S  WR   +  R ++  G+    GDG+   F+ D WV  +   + 
Subjt:  LNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRY---ANQVSLLSAPIKSNCSVFWRGF-VWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKP

Query:  LPITRGAYQEGVFVSDFITASMEWDVNKL
            R    + V   D       WD  K+
Subjt:  LPITRGAYQEGVFVSDFITASMEWDVNKL

P11369 LINE-1 retrotransposable element ORF2 protein3.0e-1328.5Show/hide
Query:  QRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKY
        Q  ++  YK         +  ++  LD+       Y   +L ++Q+D                 L +P    EIE  ++ +   K+PGPD F A FYQ +
Subjt:  QRILQDLYKNPPPWDFAEIKRIEDILDKALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKY

Query:  WSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKK-PTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR
          ++  I       I         + +  I LIPK +K PT I +F PISL N+  KI++K+LANR++  ++ I+  +Q  F+PG   ++N+R
Subjt:  WSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKK-PTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVR

P93295 Uncharacterized mitochondrial protein AtMg003101.9e-2840.13Show/hide
Query:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKE-QGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLS
        ++P Y MSCF + K LC  + S M  FWW S E+ ++I W  W  +C  KE  GGL FRD+  FN+ALLAKQ +R+   P  L+S++++ RY    S++ 
Subjt:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKE-QGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLS

Query:  APIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPL
          + +  S  WR  +  R LL  G+ + IGDG  T  + D W+  ET   PL
Subjt:  APIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPL

Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)1.6e-1429.73Show/hide
Query:  KINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDK
        ++N+  R E + TI N+L G    E  +++ FG L +F   + S S  L+H L+  Q   K+  EL+F  GG  ++F +REF ++TGL CG LP  D+ K
Subjt:  KINLCCRTEIMDTINNIL-GDRCREAFRNTCFGHLLDFTFKKTS-SQLLLH-LIQHQCKPKRTSELYFKIGGKILKFGLREFALITGLNCGPLPQLDKDK

Query:  LQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAF
            S++   +       R  T+  V   ++    +   K+   L  +   ++   ++  +  + V M+ D + F  YPWGR AF
Subjt:  LQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMA-QLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAF

AT1G43760.1 DNAse I-like superfamily protein8.7e-0839.08Show/hide
Query:  EIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIIS
        EI  AV  +   KAPGPD F A F+ + W  V + T     +       ++ +N T I LIPKV     +S F P+S C V YKII+
Subjt:  EIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVKKPTLISDFIPISLCNVSYKIIS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-1227.97Show/hide
Query:  KYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPK
        +YLG+P    +    D+  + ++I   +  W  ++ S  GR  L+ S+  S+  + MS F +P     +I S+ + F W   E   +     WS VC PK
Subjt:  KYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPK

Query:  EQGGLNFRDMELFNKALLAKQVWRLFTNPSL---LVSKVIKGR
        ++GGL  R ++  NK       W +  N +L   +  K++K R
Subjt:  EQGGLNFRDMELFNKALLAKQVWRLFTNPSL---LVSKVIKGR

AT4G29090.1 Ribonuclease H-like superfamily protein6.6e-3234.9Show/hide
Query:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSA
        ++PTY M+CF +PKT+C  I S++A FWW + +  K +HWK W  +   K +GG+ F+D+E FN ALL KQ+WR+ + P  L++KV K RY ++   L+A
Subjt:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSA

Query:  PIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITAS-------MEWDVNKLKDVVDI
        P+ S  S  W+    ++ +L  G R  +G+G+    ++  W+  +     L + R   QE   VS  +  S        EW     KDV+++
Subjt:  PIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITAS-------MEWDVNKLKDVVDI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-2940.13Show/hide
Query:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKE-QGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLS
        ++P Y MSCF + K LC  + S M  FWW S E+ ++I W  W  +C  KE  GGL FRD+  FN+ALLAKQ +R+   P  L+S++++ RY    S++ 
Subjt:  SIPTYLMSCFHMPKTLCADIHSMMAWFWWGSTESGKRIHWKKWSMVCLPKE-QGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLS

Query:  APIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPL
          + +  S  WR  +  R LL  G+ + IGDG  T  + D W+  ET   PL
Subjt:  APIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTFFFKDPWVPKETTFKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCCAAAAGCAACCAATAGAACGCCTACACAATCATGACGACTCTGCATGGGTTGTGGTTGGAGATTTTAATGCAACTCTTCTATATGAAGAGAAGGAAGGTGG
TAATGTAGTGCGAGAGTCCCAACTTCAGTTGTGTCGTGATACTATGGATGACTGTGGATTACAAGATTTGGAATTTTCAGGAGATATGTTTACATTGTCGAATAGACAAG
AAAGGGATAATCAGATTAATGAACGCATTGATAGAATTGTTGTAAAAGATGAGTATCTTCTATTATTTCCAAACGCTTCCATCGAGCACTTGATGTGGGCTCGTTCTGAT
CATCGTCCTATTCTTTTGAATGGTATTATGGAAGAACGAGAATTTCGTACGACCAAACAGTCACGAAGGTTCCATTTTGAGGAAGTGTGGCATCCAGATTGCAAACAAAT
TATATCAGATTTGGACTGCTGGAACATTCAGGAAAACGATCACCAAAGACTTGAAAATTGTCTTCGGCGGCGTAGAGCTCGATTCAAGCGATGGGGCAAAGGTACATCTT
TCTCTATCAGGCAAAATATTTTAACGAACCAACGTATTCTCCAAGATTTATACAAAAACCCACCACCTTGGGATTTTGCTGAGATAAAACGAATTGAAGATATTCTGGAC
AAAGCACTTGAAGATGAAGAGATTTATTGGAAACAAAGGTTAAGAGAAAATCAAATTGATTTGACTTTACAGGATATTAAGACGAAAGTGACGTTTTACCTGAATGGGAA
ACTGATGGCACCATTTAAAAAATGTGAAATTGAAATAGCAGTCAGTCAGATACACCCATTTAAGGCTCCTGGACCAGATGATTTTCCTGCATTGTTTTATCAGAAATACT
GGAGTGAGGTTGGTAATATTACTTCCTTAAACTGTTTGGATATTTTAAATCTGAGAAGATCAGTTAGAGACTGGAACGACACTCATATTGCTTTGATTCCCAAAGTAAAG
AAACCAACCTTGATCTCTGATTTCATACCGATTAGCTTGTGTAATGTCTCGTATAAAATTATTTCAAAAGTTTTAGCTAATCGCATGAAGTGGATTCTGCAAGAAATTGT
CTCTGAAAATCAATCTGCTTTTGTTCCTGGTCGTTCTATTTTTTATAATGTGCGGTATCTAGGCGATCCATATCGGGCTTCAAACCAGGAAAATTTTGCCCCGAAAATAT
CACACCTTTTCTTTGCAGATGATAGTCTCGATTTCTGTCGAGCTTCCAATGAACAGGTGTGGGCGTTGAAGAGCATTCTGTCTCAATACGAACAGGCGTCGGGTCAGAAG
ATCAATGTTGAGAAGTCAGCATTATTTGTCTCACCAAATGTGCATATTGAGTTAAGAACGGTAATATTTGGTTTGTTGGAAATGTCAGTGGTGGATAATCTTGGGAAATA
TCTAGGTGTCCCGTCAGCATTTACTAGGAGAAGGAGGGATGATTTTAAAGAAATCAAGCAACGTATTTGGCAGACTCTACAAGGATGGAAAGGTAAATATTTTTCCATTG
GAGGAAGGGAAATTTTGCTCAAAAGTATAGCTCAGTCTATTCCCACATACTTGATGAGTTGTTTTCACATGCCAAAAACTCTATGTGCTGATATACACTCAATGATGGCC
TGGTTCTGGTGGGGTTCGACTGAGTCAGGAAAAAGGATACACTGGAAGAAATGGTCTATGGTCTGTTTGCCTAAGGAACAAGGAGGTTTGAATTTTAGGGATATGGAGCT
TTTTAATAAAGCTCTATTAGCCAAACAAGTGTGGCGGCTGTTTACAAACCCATCATTACTAGTTTCCAAGGTTATAAAGGGTCGATATGCCAATCAAGTCTCTCTTTTAT
CGGCTCCAATCAAAAGTAACTGCTCTGTTTTTTGGCGAGGTTTTGTTTGGGCACGAGGGCTTTTGGAAAGTGGGATTCGAAAAAGAATTGGGGACGGTAAATCAACATTT
TTTTTTAAAGACCCTTGGGTCCCAAAAGAGACAACCTTCAAGCCTTTGCCAATAACGAGAGGAGCTTATCAGGAGGGTGTGTTTGTTTCAGATTTTATTACTGCTTCAAT
GGAATGGGACGTGAACAAACTAAAGGATGTAGTGGATATAGATGATCTTAATATAATCAAGACCATTCCAATTAGTTTATCCCATGATGAAGATCAATGGATTTGGCATT
ATTGCTCACATGATTTGTCTCCTATGTTCCCAACTTGTAAATCCAAGTTGGAGACAATTGATCATGCCCTTTGTGGATGTAAGCGAGCTACGACAATTTGTGATGTTATG
TTTCAAAGGGTGGACTCTGAAATCCCGATTCAAAATAATTTTGCAGATCGTGTGGCTTGGCTTGTGCGTCACCTGGATAGTGAGTCTTTTGAGAAAGCATGTATTACCTT
TTGGTCCTTGTGGAATGACAGGAACAGTTATAATAATAATGTAGCCATTATGGACTGGGTAGAACGTTGCGAGTGGGTTCATGAATATTGGACGAAAACAAGAATTTCTT
TCCAAGATCGGTCCAACCCTACACTTTCTAACCAGAGGGAGAATGGACTAACGCTGATGCCACAAACGCAGCTTGCACCTCCAAATGATGGAATTATGGTCACGAATGCA
TCGTCGACCGAATTTGGCGCCTTGTTGTTCACTGATGCTGCCTGTTCACCTTACCCGAATGGGTCTAGGTATGGGGCGATAATCACCGAGGCTAGTGGACGTTTATCTGG
GGCGATGGAGTTTTTTGATTCTACAAGCTATACCCCTCTGGCTGCAGAAATCAAAGCATTAATTCATGGTATTAGACTATTACAACGCTTGCAAATAACTAAGGCGCACG
TGCTTTCGGACTCCGTTAATGCCATCAAGATGATTATGGGTGACAAGCAGATTACATCTGATGTCTATCATTGGATTTTGCAAATCCAAGAGCTGCGTGAATCATTTGAG
TTTTTAGCATTTAAGTATGTGTCAAGGCAAAGTAATAGGAAGGTCGATTACTTAGCGAAACATGCTCTTTCTCATGGTCGATCCATGTTATGCCCATCTGGTTTAACAAA
TCGTGCATGCGCCTTCGTGAAGACCATTTTCCAAAGAAAAACCCCCCTTCGAACAACCTGCTTCCTCTCCGTTGTTCGTGCTCCCTTTCCGACGATCGTTCGACGCTCCC
TCTCCGACGTCGTTCGCAGCCCGCTCTCAGACGCCATCCGAAGCTCCCCCATCCGAAGCTCCCCCTCCGTTCAACCCCCCACCTTCGACCGCTCAGCCTCCCCCGCACAA
ACCGTTCATGCATTTGATCCCGCCGCCGTTCTTGCGCCTGGTTCCACCGCCGAACGTTTGCATTCCACCCATTTGTGCTCGCCGTACAGCCCCCCTCAGAGTGTAGAACA
TTTGCGAGGTATGCCCAAGGATAAGCTCGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCA
CATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCGCTAAGGGATCCGGCGAAAAGACGAAAGGGGTAAAAAGGGACAGAGACGATGGAGGT
CCGAGCAAAAAAGTAACTCCATCAAAGAAAACAAAAGTTCGTGACTGGACCAAGAAGACCAACGATGAGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACAAA
GCGGTCGAAACAGACAAAAAACACAGATAAGGCGAGCCATGTGACACAAGAAGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATCTG
ATAGTGTGACGAATGACAACTCCACGAGTGATGAAGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGAAGCTCCTAAAAAAAAGAAGGGTGGAAAAAAG
GGAAAAAAGCTGAAGACCATGGTTGAAGAAGGTGACACCGTCCGAGTGGACGATGATTACTTTATGTCACCATCGAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTG
CAGAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGCCACCTGCTTGACTTTACGTTCAAAAAGACGT
CTTCCCAGTTACTATTGCACCTGATCCAGCATCAGTGCAAACCCAAACGGACGTCAGAACTTTACTTCAAGATTGGAGGGAAAATCTTAAAGTTTGGCCTACGGGAGTTC
GCATTAATTACGGGACTAAATTGTGGCCCATTGCCACAACTTGACAAAGACAAGCTACAAGATTCTTCCAGGTTCAAGGATGAGTATTTTGCTGATGACGAGGGTGTCAG
AAGAAAGACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGACAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGTTTTTTGTTACCTAGGCAAG
AAAAGGTGCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGTCGCCTTCACACTATTGACAAACTACATGCAG
AAGGCATCCGTTAGTAGGGGCAGCGTTGGTATTGGAATGGGCGGGTTCGTATATGCCATCCTTGCATGGGCATACGAAGTGATACCCGCATTGAGCGCCCCACCGACCAA
CTACACAAGACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTCGAAGCTCAACCCGAATGGAGAGAACTACACGCCAAGATATTCCAATCCCCATCGCTGG
AGGTGGTACCATTGGACCCAACCGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAATTGGCTTCTCGACGATTGGCAGGAGACAATCAACAAGTA
GAAGGCGATGTTCGAATCCCACCGAACTTCTCAATAGGGGCACCCCCAATGATCAGCCAGATGGATGTGATGGAAAAACACCATCAAGAAATAATTGGTAAGCTCGACAA
AGTTTACTCTGTGCTAGGAGCCTTGGTGGATACTTTGAGGGAGATACACGAGCTTGCCAACCCCCCAAACTCAAAATTCAAGATGCCCGGAGATGTTGGGACTGGTATTG
ACCCTACAACAAAAGACGATGATGTGGAGGGAAAAGAAGAAACTGATGAAAAAGATGAGCAAGATGACCATGGATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGAC
GATGGAGGACCAACAGGTGGGAAACAGCAACAGGGGTTGACCACCCCCGGACCAACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGG
GACAAAGAAAACAGGAGGTGGTGAAGGCACAAAGGCCTGTGATGATGTCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAAAAGTTTA
ATAGGGACCGCAAGGGTAAAGCGGTTATGATGGCCAAGGGAATAAAAATTAAGGAACCTACCACTCCGCTCATTCAAAATAGGACACCCCTCCGTGAGGTCAACGGGACC
ATAACTCGGGGGGGAGCCAAGCGCTCAGATTGTGTGGAAGGAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAT
GGAATCCCTACCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCCAAAAGCAACCAATAGAACGCCTACACAATCATGACGACTCTGCATGGGTTGTGGTTGGAGATTTTAATGCAACTCTTCTATATGAAGAGAAGGAAGGTGG
TAATGTAGTGCGAGAGTCCCAACTTCAGTTGTGTCGTGATACTATGGATGACTGTGGATTACAAGATTTGGAATTTTCAGGAGATATGTTTACATTGTCGAATAGACAAG
AAAGGGATAATCAGATTAATGAACGCATTGATAGAATTGTTGTAAAAGATGAGTATCTTCTATTATTTCCAAACGCTTCCATCGAGCACTTGATGTGGGCTCGTTCTGAT
CATCGTCCTATTCTTTTGAATGGTATTATGGAAGAACGAGAATTTCGTACGACCAAACAGTCACGAAGGTTCCATTTTGAGGAAGTGTGGCATCCAGATTGCAAACAAAT
TATATCAGATTTGGACTGCTGGAACATTCAGGAAAACGATCACCAAAGACTTGAAAATTGTCTTCGGCGGCGTAGAGCTCGATTCAAGCGATGGGGCAAAGGTACATCTT
TCTCTATCAGGCAAAATATTTTAACGAACCAACGTATTCTCCAAGATTTATACAAAAACCCACCACCTTGGGATTTTGCTGAGATAAAACGAATTGAAGATATTCTGGAC
AAAGCACTTGAAGATGAAGAGATTTATTGGAAACAAAGGTTAAGAGAAAATCAAATTGATTTGACTTTACAGGATATTAAGACGAAAGTGACGTTTTACCTGAATGGGAA
ACTGATGGCACCATTTAAAAAATGTGAAATTGAAATAGCAGTCAGTCAGATACACCCATTTAAGGCTCCTGGACCAGATGATTTTCCTGCATTGTTTTATCAGAAATACT
GGAGTGAGGTTGGTAATATTACTTCCTTAAACTGTTTGGATATTTTAAATCTGAGAAGATCAGTTAGAGACTGGAACGACACTCATATTGCTTTGATTCCCAAAGTAAAG
AAACCAACCTTGATCTCTGATTTCATACCGATTAGCTTGTGTAATGTCTCGTATAAAATTATTTCAAAAGTTTTAGCTAATCGCATGAAGTGGATTCTGCAAGAAATTGT
CTCTGAAAATCAATCTGCTTTTGTTCCTGGTCGTTCTATTTTTTATAATGTGCGGTATCTAGGCGATCCATATCGGGCTTCAAACCAGGAAAATTTTGCCCCGAAAATAT
CACACCTTTTCTTTGCAGATGATAGTCTCGATTTCTGTCGAGCTTCCAATGAACAGGTGTGGGCGTTGAAGAGCATTCTGTCTCAATACGAACAGGCGTCGGGTCAGAAG
ATCAATGTTGAGAAGTCAGCATTATTTGTCTCACCAAATGTGCATATTGAGTTAAGAACGGTAATATTTGGTTTGTTGGAAATGTCAGTGGTGGATAATCTTGGGAAATA
TCTAGGTGTCCCGTCAGCATTTACTAGGAGAAGGAGGGATGATTTTAAAGAAATCAAGCAACGTATTTGGCAGACTCTACAAGGATGGAAAGGTAAATATTTTTCCATTG
GAGGAAGGGAAATTTTGCTCAAAAGTATAGCTCAGTCTATTCCCACATACTTGATGAGTTGTTTTCACATGCCAAAAACTCTATGTGCTGATATACACTCAATGATGGCC
TGGTTCTGGTGGGGTTCGACTGAGTCAGGAAAAAGGATACACTGGAAGAAATGGTCTATGGTCTGTTTGCCTAAGGAACAAGGAGGTTTGAATTTTAGGGATATGGAGCT
TTTTAATAAAGCTCTATTAGCCAAACAAGTGTGGCGGCTGTTTACAAACCCATCATTACTAGTTTCCAAGGTTATAAAGGGTCGATATGCCAATCAAGTCTCTCTTTTAT
CGGCTCCAATCAAAAGTAACTGCTCTGTTTTTTGGCGAGGTTTTGTTTGGGCACGAGGGCTTTTGGAAAGTGGGATTCGAAAAAGAATTGGGGACGGTAAATCAACATTT
TTTTTTAAAGACCCTTGGGTCCCAAAAGAGACAACCTTCAAGCCTTTGCCAATAACGAGAGGAGCTTATCAGGAGGGTGTGTTTGTTTCAGATTTTATTACTGCTTCAAT
GGAATGGGACGTGAACAAACTAAAGGATGTAGTGGATATAGATGATCTTAATATAATCAAGACCATTCCAATTAGTTTATCCCATGATGAAGATCAATGGATTTGGCATT
ATTGCTCACATGATTTGTCTCCTATGTTCCCAACTTGTAAATCCAAGTTGGAGACAATTGATCATGCCCTTTGTGGATGTAAGCGAGCTACGACAATTTGTGATGTTATG
TTTCAAAGGGTGGACTCTGAAATCCCGATTCAAAATAATTTTGCAGATCGTGTGGCTTGGCTTGTGCGTCACCTGGATAGTGAGTCTTTTGAGAAAGCATGTATTACCTT
TTGGTCCTTGTGGAATGACAGGAACAGTTATAATAATAATGTAGCCATTATGGACTGGGTAGAACGTTGCGAGTGGGTTCATGAATATTGGACGAAAACAAGAATTTCTT
TCCAAGATCGGTCCAACCCTACACTTTCTAACCAGAGGGAGAATGGACTAACGCTGATGCCACAAACGCAGCTTGCACCTCCAAATGATGGAATTATGGTCACGAATGCA
TCGTCGACCGAATTTGGCGCCTTGTTGTTCACTGATGCTGCCTGTTCACCTTACCCGAATGGGTCTAGGTATGGGGCGATAATCACCGAGGCTAGTGGACGTTTATCTGG
GGCGATGGAGTTTTTTGATTCTACAAGCTATACCCCTCTGGCTGCAGAAATCAAAGCATTAATTCATGGTATTAGACTATTACAACGCTTGCAAATAACTAAGGCGCACG
TGCTTTCGGACTCCGTTAATGCCATCAAGATGATTATGGGTGACAAGCAGATTACATCTGATGTCTATCATTGGATTTTGCAAATCCAAGAGCTGCGTGAATCATTTGAG
TTTTTAGCATTTAAGTATGTGTCAAGGCAAAGTAATAGGAAGGTCGATTACTTAGCGAAACATGCTCTTTCTCATGGTCGATCCATGTTATGCCCATCTGGTTTAACAAA
TCGTGCATGCGCCTTCGTGAAGACCATTTTCCAAAGAAAAACCCCCCTTCGAACAACCTGCTTCCTCTCCGTTGTTCGTGCTCCCTTTCCGACGATCGTTCGACGCTCCC
TCTCCGACGTCGTTCGCAGCCCGCTCTCAGACGCCATCCGAAGCTCCCCCATCCGAAGCTCCCCCTCCGTTCAACCCCCCACCTTCGACCGCTCAGCCTCCCCCGCACAA
ACCGTTCATGCATTTGATCCCGCCGCCGTTCTTGCGCCTGGTTCCACCGCCGAACGTTTGCATTCCACCCATTTGTGCTCGCCGTACAGCCCCCCTCAGAGTGTAGAACA
TTTGCGAGGTATGCCCAAGGATAAGCTCGTGTCAACTAGAGCGAGCGATCGTTTGAAGGCTGCGGGAGTAACGCCAGGAAGAAAACCCCCGGAACAAACGTCCCCAATCA
CATTGGGGAGCGAACAGGACTCTGAAGAAGCCATGAGTACAACAGTTTCAGTCGCTAAGGGATCCGGCGAAAAGACGAAAGGGGTAAAAAGGGACAGAGACGATGGAGGT
CCGAGCAAAAAAGTAACTCCATCAAAGAAAACAAAAGTTCGTGACTGGACCAAGAAGACCAACGATGAGATTGAGAAACCCACTGAGGCACGAAGCAATAAGAAGACAAA
GCGGTCGAAACAGACAAAAAACACAGATAAGGCGAGCCATGTGACACAAGAAGTTGCCCCTGAAACAAGCGAGGACACCGCTAAACATGACACCGAAGACACCGAATCTG
ATAGTGTGACGAATGACAACTCCACGAGTGATGAAGGGGAAGAACAAGGGAAAAAGAAGGCATCACTTGCTAAAAAGGAAGCTCCTAAAAAAAAGAAGGGTGGAAAAAAG
GGAAAAAAGCTGAAGACCATGGTTGAAGAAGGTGACACCGTCCGAGTGGACGATGATTACTTTATGTCACCATCGAAAAGAAGTAAGGCCCTAAAGATTAACCTATGTTG
CAGAACAGAAATAATGGACACCATCAACAACATCTTAGGAGATAGGTGCAGAGAAGCTTTCAGAAACACGTGCTTCGGCCACCTGCTTGACTTTACGTTCAAAAAGACGT
CTTCCCAGTTACTATTGCACCTGATCCAGCATCAGTGCAAACCCAAACGGACGTCAGAACTTTACTTCAAGATTGGAGGGAAAATCTTAAAGTTTGGCCTACGGGAGTTC
GCATTAATTACGGGACTAAATTGTGGCCCATTGCCACAACTTGACAAAGACAAGCTACAAGATTCTTCCAGGTTCAAGGATGAGTATTTTGCTGATGACGAGGGTGTCAG
AAGAAAGACACTTAATATAGTATTCAACGCAATGAAGCATGGTGTCGAGACAGACCTCGTAAAGATGGCGCAGTTGTATTGTTTGGAGAGTTTTTTGTTACCTAGGCAAG
AAAAGGTGCACATTGAAGAGGAACATGTCCTAATGGTTGAAGACCAAGAATTGTTCACCACCTACCCTTGGGGGCGCGTCGCCTTCACACTATTGACAAACTACATGCAG
AAGGCATCCGTTAGTAGGGGCAGCGTTGGTATTGGAATGGGCGGGTTCGTATATGCCATCCTTGCATGGGCATACGAAGTGATACCCGCATTGAGCGCCCCACCGACCAA
CTACACAAGACGGATCAGAAATACAGTCCCCCGCATCATAAATTGGGAGGTCGAAGCTCAACCCGAATGGAGAGAACTACACGCCAAGATATTCCAATCCCCATCGCTGG
AGGTGGTACCATTGGACCCAACCGACACGGAAATGCAGATGTCGTACTTCCAACCTTTCTTGCAAGATGAATTGGCTTCTCGACGATTGGCAGGAGACAATCAACAAGTA
GAAGGCGATGTTCGAATCCCACCGAACTTCTCAATAGGGGCACCCCCAATGATCAGCCAGATGGATGTGATGGAAAAACACCATCAAGAAATAATTGGTAAGCTCGACAA
AGTTTACTCTGTGCTAGGAGCCTTGGTGGATACTTTGAGGGAGATACACGAGCTTGCCAACCCCCCAAACTCAAAATTCAAGATGCCCGGAGATGTTGGGACTGGTATTG
ACCCTACAACAAAAGACGATGATGTGGAGGGAAAAGAAGAAACTGATGAAAAAGATGAGCAAGATGACCATGGATTAGAGAAAAATCCTTCTCATCGAAGGGAAGACGAC
GATGGAGGACCAACAGGTGGGAAACAGCAACAGGGGTTGACCACCCCCGGACCAACAACCCTTGTACAGACTGAAACTCGTGTAGATGGCGAAGGCACGGGAGATGGCGG
GACAAAGAAAACAGGAGGTGGTGAAGGCACAAAGGCCTGTGATGATGTCGACGAGACAATAAACAAGGCTATACTGTCAATAGATGAGGCCAAGGTGATTGAAAAGTTTA
ATAGGGACCGCAAGGGTAAAGCGGTTATGATGGCCAAGGGAATAAAAATTAAGGAACCTACCACTCCGCTCATTCAAAATAGGACACCCCTCCGTGAGGTCAACGGGACC
ATAACTCGGGGGGGAGCCAAGCGCTCAGATTGTGTGGAAGGAGTGGGGAGCCTGCAAGCCACAGGAATTTATGTGGACGCGATGAGGGGCACGTGGACAAAAGAATCGAT
GGAATCCCTACCGCCAGAATTCTTCCAGCCGTCTTTTGATCTTCATCTCAGTCAGGGTTAA
Protein sequenceShow/hide protein sequence
MATQKQPIERLHNHDDSAWVVVGDFNATLLYEEKEGGNVVRESQLQLCRDTMDDCGLQDLEFSGDMFTLSNRQERDNQINERIDRIVVKDEYLLLFPNASIEHLMWARSD
HRPILLNGIMEEREFRTTKQSRRFHFEEVWHPDCKQIISDLDCWNIQENDHQRLENCLRRRRARFKRWGKGTSFSIRQNILTNQRILQDLYKNPPPWDFAEIKRIEDILD
KALEDEEIYWKQRLRENQIDLTLQDIKTKVTFYLNGKLMAPFKKCEIEIAVSQIHPFKAPGPDDFPALFYQKYWSEVGNITSLNCLDILNLRRSVRDWNDTHIALIPKVK
KPTLISDFIPISLCNVSYKIISKVLANRMKWILQEIVSENQSAFVPGRSIFYNVRYLGDPYRASNQENFAPKISHLFFADDSLDFCRASNEQVWALKSILSQYEQASGQK
INVEKSALFVSPNVHIELRTVIFGLLEMSVVDNLGKYLGVPSAFTRRRRDDFKEIKQRIWQTLQGWKGKYFSIGGREILLKSIAQSIPTYLMSCFHMPKTLCADIHSMMA
WFWWGSTESGKRIHWKKWSMVCLPKEQGGLNFRDMELFNKALLAKQVWRLFTNPSLLVSKVIKGRYANQVSLLSAPIKSNCSVFWRGFVWARGLLESGIRKRIGDGKSTF
FFKDPWVPKETTFKPLPITRGAYQEGVFVSDFITASMEWDVNKLKDVVDIDDLNIIKTIPISLSHDEDQWIWHYCSHDLSPMFPTCKSKLETIDHALCGCKRATTICDVM
FQRVDSEIPIQNNFADRVAWLVRHLDSESFEKACITFWSLWNDRNSYNNNVAIMDWVERCEWVHEYWTKTRISFQDRSNPTLSNQRENGLTLMPQTQLAPPNDGIMVTNA
SSTEFGALLFTDAACSPYPNGSRYGAIITEASGRLSGAMEFFDSTSYTPLAAEIKALIHGIRLLQRLQITKAHVLSDSVNAIKMIMGDKQITSDVYHWILQIQELRESFE
FLAFKYVSRQSNRKVDYLAKHALSHGRSMLCPSGLTNRACAFVKTIFQRKTPLRTTCFLSVVRAPFPTIVRRSLSDVVRSPLSDAIRSSPIRSSPSVQPPTFDRSASPAQ
TVHAFDPAAVLAPGSTAERLHSTHLCSPYSPPQSVEHLRGMPKDKLVSTRASDRLKAAGVTPGRKPPEQTSPITLGSEQDSEEAMSTTVSVAKGSGEKTKGVKRDRDDGG
PSKKVTPSKKTKVRDWTKKTNDEIEKPTEARSNKKTKRSKQTKNTDKASHVTQEVAPETSEDTAKHDTEDTESDSVTNDNSTSDEGEEQGKKKASLAKKEAPKKKKGGKK
GKKLKTMVEEGDTVRVDDDYFMSPSKRSKALKINLCCRTEIMDTINNILGDRCREAFRNTCFGHLLDFTFKKTSSQLLLHLIQHQCKPKRTSELYFKIGGKILKFGLREF
ALITGLNCGPLPQLDKDKLQDSSRFKDEYFADDEGVRRKTLNIVFNAMKHGVETDLVKMAQLYCLESFLLPRQEKVHIEEEHVLMVEDQELFTTYPWGRVAFTLLTNYMQ
KASVSRGSVGIGMGGFVYAILAWAYEVIPALSAPPTNYTRRIRNTVPRIINWEVEAQPEWRELHAKIFQSPSLEVVPLDPTDTEMQMSYFQPFLQDELASRRLAGDNQQV
EGDVRIPPNFSIGAPPMISQMDVMEKHHQEIIGKLDKVYSVLGALVDTLREIHELANPPNSKFKMPGDVGTGIDPTTKDDDVEGKEETDEKDEQDDHGLEKNPSHRREDD
DGGPTGGKQQQGLTTPGPTTLVQTETRVDGEGTGDGGTKKTGGGEGTKACDDVDETINKAILSIDEAKVIEKFNRDRKGKAVMMAKGIKIKEPTTPLIQNRTPLREVNGT
ITRGGAKRSDCVEGVGSLQATGIYVDAMRGTWTKESMESLPPEFFQPSFDLHLSQG