; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041572 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041572
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:20848682..20852270
RNA-Seq ExpressionLag0041572
SyntenyLag0041572
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]8.1e-6925.17Show/hide
Query:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL----------------------------------------S
        MD+GF+G  FTW NRR G   + ER+DR L +  W   F       L    SDH P++                                         S
Subjt:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL----------------------------------------S

Query:  LSGGA--RAVNSFGCKIQRCLSNLSRWGRPRMECR-------------------------------GWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFH
          G +    V  F    +R L++L  W +   E R                                 + ++LV+EEVYWKQ S+  WLK GD+NT++FH
Subjt:  LSGGA--RAVNSFGCKIQRCLSNLSRWGRPRMECR-------------------------------GWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFH

Query:  SRVSQRRRFNEIRGLEDDNGQ-------------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGA
        S+ S RRR N+I G+EDD G                                                        E++  AL  + P KAPGPDGL  A
Subjt:  SRVSQRRRFNEIRGLEDDNGQ-------------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGA

Query:  FYRHSWCIT---------------------------------------EYQPISLCNVAYKLVSKVLVNRMKE---------------------------
        F++  W I                                        E++PISLCNV Y++V+K + NR+K                            
Subjt:  FYRHSWCIT---------------------------------------EYQPISLCNVAYKLVSKVLVNRMKE---------------------------

Query:  ------RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHY-----------------
              R  KG + G    +LKLD+SKAYDRVEW +LE+ M  +GF  +W+ LI  C+++  FS  +     G + P RG+                   
Subjt:  ------RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHY-----------------

Query:  -------------------------------------AHICSSFALRGFLECY-----MVLRCQESEYGFEYAGSGE-----------------------
                                             A +     L+G  +CY      +   ++S   F    S E                       
Subjt:  -------------------------------------AHICSSFALRGFLECY-----MVLRCQESEYGFEYAGSGE-----------------------

Query:  PDFAG---------------------------------LGYSVSQPVSGFA----------------------------SHG-------SILVERCYEGM
        P   G                                 L  +V+Q V  +A                             HG       S+   +   G+
Subjt:  PDFAG---------------------------------LGYSVSQPVSGFA----------------------------SHG-------SILVERCYEGM

Query:  GFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVT
        GFRDL  FNQAL+AKQ WR+++  +S +ARV+K RY+  + F  A +GS PSFIWRS+LWG ++++KG+RWRIG+G  V VY    +P  +  +  S  T
Subjt:  GFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVT

Query:  LAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSGKAGMGS
        L H+T VA+LI +  +W  + + QHF  +++  I+ I L  G   D ++W ++K G+  + S
Subjt:  LAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSGKAGMGS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.7e-7127.67Show/hide
Query:  LESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG------------------------------------------------
        +  LL EEE++W+Q S+D W K GDRNT+WFH++ S RRR NEI+GL D  G                                                
Subjt:  LESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG------------------------------------------------

Query:  -------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCITEYQPISLC-------NVAYKLVSKVLVNRMKERNRKGEQGG------------TGWASL
               +EE+++AL  IHP+KA GPD  S AFY++ W I   Q +S C       ++  K +S++++ R  +      Q               G+ SL
Subjt:  -------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCITEYQPISLC-------NVAYKLVSKVLVNRMKERNRKGEQGG------------TGWASL

Query:  KLDMSKAYDRVEWIYLEKIMLKMGFVPEW---------------------------------VELISLCLSSVRFSFNVIDV---RCGDVIPSRGIHYAH
        KLDMSKAYDRVEW +LE +MLKMGF   W                                 + ++S  L  V    +V ++   +C      +   Y  
Subjt:  KLDMSKAYDRVEWIYLEKIMLKMGFVPEW---------------------------------VELISLCLSSVRFSFNVIDV---RCGDVIPSRGIHYAH

Query:  ICSSFALRGFLECYMVLRCQESEYG------------FEY------------------AGSGEPDFAGL-------------------------------
          SS  ++  L   MV  CQ    G            F Y                   G  E     +                               
Subjt:  ICSSFALRGFLECYMVLRCQESEYG------------FEY------------------AGSGEPDFAGL-------------------------------

Query:  GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIG
        G S       + +  S+ + +C  GMGFRDLE+FN+ALLAKQCWRI+   +S L+RVLKGRYF    F+ A +   PS+IWRS+LWGR+LL+KG+RWRIG
Subjt:  GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIG

Query:  NGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLIT-TSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG-----------------
        NG +V +YG N VP+   LK+ SS  L   ++V++L+    G W  +++R  F+P E   I++I +  G   DR+IW YEK+G                 
Subjt:  NGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLIT-TSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG-----------------

Query:  -----------------------------------------------KAGM-------------------------------------------------
                                                       K G+                                                 
Subjt:  -----------------------------------------------KAGM-------------------------------------------------

Query:  ----GSFEVLVVILWVVWNCRNYWKFRG---VVLPAG--LIDWALSYISVFREATQVRGMGMVGGVAQNVITWVPPVNGWYKANVDSAYCECQFQAGLGV
              FE L V++W +WN RN   F      V   G  L++WA  Y   FREA      G V   A+  I W PP  G YK N D+++      AGLG+
Subjt:  ----GSFEVLVVILWVVWNCRNYWKFRG---VVLPAG--LIDWALSYISVFREATQVRGMGMVGGVAQNVITWVPPVNGWYKANVDSAYCECQFQAGLGV

Query:  VFRSSAGEVMLSA------VMARDHKSALNTTKQLHQYSRIG
        +  +  G+VM +A      + + D   A+   + L   S IG
Subjt:  VFRSSAGEVMLSA------VMARDHKSALNTTKQLHQYSRIG

XP_024035599.1 uncharacterized protein LOC112096407 [Citrus clementina]1.5e-6729.15Show/hide
Query:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARAVNSFGCKIQRCLSNLSRWGR----PRMECRGWLE
        DLG RG +FTW NR+ G++ + E++DR L N  W+  F +   T+L    SDH P+++ +    R++  F  K    L     W       ++    WL+
Subjt:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARAVNSFGCKIQRCLSNLSRWGR----PRMECRGWLE

Query:  SLLVEEE-----------------VYWKQ---SSKDHWLK--LGD-RNTRWFHSRVSQRRRFNEIR----------------------------------
          +  EE                 + W +     +D  LK  LG  +  +  H +++  R    +                                   
Subjt:  SLLVEEE-----------------VYWKQ---SSKDHWLK--LGD-RNTRWFHSRVSQRRRFNEIR----------------------------------

Query:  -------GLEDDNGQEEVLLALKHIHPNKAPGPDGLSGAFYRHSW----------C-----------------------------ITEYQPISLCNVAYK
                LE     EE+  AL  + P KAPGPDGL   F++  W          C                             +TEY+PISLCNV Y 
Subjt:  -------GLEDDNGQEEVLLALKHIHPNKAPGPDGLSGAFYRHSW----------C-----------------------------ITEYQPISLCNVAYK

Query:  LVSKVLVNRMKE---------------------------------RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSV
        LV+K + NR+K                                  R+ KG++  +   +LKLD+ KAYDRVEW +L+ ++ ++GF  +W+ LI  C+++ 
Subjt:  LVSKVLVNRMKE---------------------------------RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSV

Query:  RFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESEYGFEYAGSGEPDFAGLGYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQA
         FS  +     G + P RG+     C        L  Y+ L C             E D  G+ +   + +S     G         G+GFRD+  FNQA
Subjt:  RFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESEYGFEYAGSGEPDFAGLGYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQA

Query:  LLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLI
        LLAKQ WR++Q   S +A+V+K RY+  TDFL A +GS PSFIWRS+LWGR++L+KG RWRIGNG  + +  +N +P  +  K+    +L    +V+ LI
Subjt:  LLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLI

Query:  TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG
          + QWNE +I Q F+  +  +I +I L      D IIW Y++ G
Subjt:  TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]8.1e-6923.95Show/hide
Query:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL---------------------------------------SLS
        D+ ++G  +TW N R G   V ER+DR + N AW+ +F +   T++D   SDH P+++                                       SL 
Subjt:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL---------------------------------------SLS

Query:  GGARAVNSFGC-------------------------KIQRCLSNLSRW---------GRPRMECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHS
        G   AVN                             K+++ ++ L            G    E    ++ +L ++E+YWKQ S+  WLK GD+NT++FH 
Subjt:  GGARAVNSFGC-------------------------KIQRCLSNLSRW---------GRPRMECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHS

Query:  RVSQRRRFNEIRGLEDDNGQ-------------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGAF
        + S R++ N I G+E+  G                                                        EEV+ AL  + P KAPGPDGL   F
Subjt:  RVSQRRRFNEIRGLEDDNGQ-------------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGAF

Query:  YRHSW----------C-----------------------------ITEYQPISLCNVAYKLVSKVLVNRMKE----------------------------
        ++  W          C                             +T+++PISLCNV Y++V+K + NR+K                             
Subjt:  YRHSW----------C-----------------------------ITEYQPISLCNVAYKLVSKVLVNRMKE----------------------------

Query:  -----RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELI-SLCLSSVRFSFN--VIDVRCGDVIPSRGIHYAHICSSFA-------
             R+ KG + G    +LKLD+SKAYD++EW++LE+ M  +GF   WV LI SL LSS +      +  +  G+ +    + +A     F        
Subjt:  -----RNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELI-SLCLSSVRFSFN--VIDVRCGDVIPSRGIHYAHICSSFA-------

Query:  --LRGFLECY-------------------------------------------------MVLRCQES----------------EYGFEYAGSGE------
          L+   +CY                                                 M+ R + S                ++ F  AG  E      
Subjt:  --LRGFLECY-------------------------------------------------MVLRCQES----------------EYGFEYAGSGE------

Query:  ----PDFA--------GLGYSVSQPVSGF---------ASHGS----ILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFL
            P +A        G+   + + ++ F         A H S    +   +C  GMGFRD   FNQAL+AKQ WRI+Q   S +A++L+ RYF   DF+
Subjt:  ----PDFA--------GLGYSVSQPVSGF---------ASHGS----ILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFL

Query:  GAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITITLRDGL
         A LGS PSFIWRS+LWGR+++ KG +WRIGNG ++ ++  N +P     K  S  +L  D  VA LI     WNE LI +HF+  +   I+ I L    
Subjt:  GAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITITLRDGL

Query:  SGDRIIWRYEKSGKAGMGS---------------------------------------------------------------------------------
          D +IW Y+K G+  + S                                                                                 
Subjt:  SGDRIIWRYEKSGKAGMGS---------------------------------------------------------------------------------

Query:  ----------------------------------------FEVLVVILWVVWNCRNYWKFRGV-VLPAGLIDWALSYISVFREATQVRGMGMVGGVAQNV
                                                 E+LV ILW++WN RN W F+GV  +P   +  A + +  FR         +    +  +
Subjt:  ----------------------------------------FEVLVVILWVVWNCRNYWKFRGV-VLPAGLIDWALSYISVFREATQVRGMGMVGGVAQNV

Query:  ITWVPPVNGWYKANVDSAYCECQFQAGLGVVFRSSAGEVMLSAV
          W PP  G++K NVD+A    +  AGLG V R  AG V+ +AV
Subjt:  ITWVPPVNGWYKANVDSAYCECQFQAGLGVVFRSSAGEVMLSAV

XP_030925054.1 uncharacterized protein LOC115952115 [Quercus lobata]5.6e-7028.45Show/hide
Query:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSL--------------------------------SGGARAVN
        DLGF G  FTWCNRR G + VW R+DR +  + W   FP   + HLD   SDH+P+LLS                                 S G   V 
Subjt:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSL--------------------------------SGGARAVN

Query:  S----FGCKIQRCLSNLSRWGRPR-------------------------------MECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRR
        +    F  KI  C +NL  W +                                  E R  ++ L   EE  WKQ S++ WLK GD+NTR+FH R +QR 
Subjt:  S----FGCKIQRCLSNLSRWGRPR-------------------------------MECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRR

Query:  RFNEIRGLEDDNG-----------------------------------------------------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCI-
        R N I GLEDDNG                                                       EV  AL  + P  APG DG+S  FY+  W I 
Subjt:  RFNEIRGLEDDNG-----------------------------------------------------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCI-

Query:  --------------------------------------TEYQPISLCNVAYKLVSKVLVNRMKE-------------------------------RNRKG
                                              ++++PISLCNV YKL++KV+ NR+K+                                 ++ 
Subjt:  --------------------------------------TEYQPISLCNVAYKLVSKVLVNRMKE-------------------------------RNRKG

Query:  EQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIH-------YAHICSSFALRGFLECYMVLRC
         +G  G+ +LKLDMSKAYDRVEW ++E IM  +G       +I  C+ SV +S  +     G++ PSRG+        Y  +  +  L+G L+       
Subjt:  EQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIH-------YAHICSSFALRGFLECYMVLRC

Query:  QESEYGFEYAGSGEPDFA--------GL-------------GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLK
        +  E   +      P +         GL             GY+       + S   +   +   GMGF+++E FN ALLAKQ WR++Q S S   RV K
Subjt:  QESEYGFEYAGSGEPDFA--------GL-------------GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLK

Query:  GRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLA-HDTQVANLITTS-GQWNEELIRQHFSPQEV
         R+FP    L A   +  S+ W+S+L  R+++ KG+ WRIGNG +V +     +P  S   + S + L   +T+V++LI    G W  E + + F P E 
Subjt:  GRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLA-HDTQVANLITTS-GQWNEELIRQHFSPQEV

Query:  SLIITITLRDGLSGDRIIWRYEKSGKAGMGS
        SL++ I L      DR+ W    SG+    S
Subjt:  SLIITITLRDGLSGDRIIWRYEKSGKAGMGS

TrEMBL top hitse value%identityAlignment
A0A2N9F775 Reverse transcriptase domain-containing protein2.2e-6731.06Show/hide
Query:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSG---GARA---VNSF--------GC-------------
        +DLG+RG ++TW N R  A  +  R+DR L   AW   FP + V+H   S SDH  L++       G R    +  F         C             
Subjt:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSG---GARA---VNSF--------GC-------------

Query:  ----------KIQRCLSNLSRWGRP-----------RMEC--------------------RGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQ
                  KI RC   L  W R            +ME                     +  +  LL ++E++W+Q S++ WL+ GDRNT++FH +  Q
Subjt:  ----------KIQRCLSNLSRWGRP-----------RMEC--------------------RGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQ

Query:  RRRFNEIRGLEDDNGQ-----------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGAFYRHS--
        RR  N IRG+ D NG+                                                      E+  A   +HP+K+PGPD        HS  
Subjt:  RRRFNEIRGLEDDNGQ-----------------------------------------------------EEVLLALKHIHPNKAPGPDGLSGAFYRHS--

Query:  ---------WCITEYQPISLCNVAYKLVSKVLVNRMKERNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVI
                   I++Y+PISL NV   LV+  L++ ++ R R    G     ++KLDMSKAYDRVEW +LE++M +MGF   W+ L+  C+ +  +S  + 
Subjt:  ---------WCITEYQPISLCNVAYKLVSKVLVNRMKERNRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVI

Query:  DVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESEYGFEYAGSGEPDFAG---LGYSVSQPVSGFASHGSILVERC---------YE--------
            G + PSRGI      S +    FL C   L     + G E   +G     G   + + +    S F    SI  E C         YE        
Subjt:  DVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESEYGFEYAGSGEPDFAG---LGYSVSQPVSGFASHGSILVERC---------YE--------

Query:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSS
        GMGFRDL  FN ALLAKQ WR++ Q  S  ARV K +YFP   FL A LGS PSFIWRS+L  R+LL +GIRW +GNG  V ++  +    D  L+ R  
Subjt:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSS

Query:  VTLAHDTQ-VANLI-TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG
           A + Q VA LI TT G W+  ++++ F P     I  + L +    D ++W+   +G
Subjt:  VTLAHDTQ-VANLI-TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG

A0A2N9FGD9 Reverse transcriptase domain-containing protein6.3e-6731.88Show/hide
Query:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARAVN--------------SFGCK--IQRCLSNLSR
        ++LGF G  FTW N + G   V ER+DRCL        FP + V+HL    SDHRPL + L    RA+                 GC+  I+    +   
Subjt:  MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARAVN--------------SFGCK--IQRCLSNLSR

Query:  WGRPRM------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG-----------------QE
          + R+                  +    L +L  +EE  WKQ S+  WL+ GDRNT++FH + + R+R N I G+ D  G                  +
Subjt:  WGRPRM------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG-----------------QE

Query:  EVLLALKHIHPNKAPGPDGLSGAFYRHSW---------------------------------------CITEYQPISLCNVAYKLVSKVLVNRMKE---R
        EV L LK + P KA GPDG+S  FY+  W                                        + +++PISLCNV YKLV+KVL NR+K+    
Subjt:  EVLLALKHIHPNKAPGPDGLSGAFYRHSW---------------------------------------CITEYQPISLCNVAYKLVSKVLVNRMKE---R

Query:  NRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQES
         +  +    G+ +LKLDM KAYDRVEW++LE IMLKMGF   WV +I  CL +V +S  +     G   PSRG+      S +    FL C   L+   +
Subjt:  NRKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQES

Query:  EYGFEYAGSGEPDFAGLGYSVSQPVSGFASHGSILVER-CYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWR
        +     +  G      L  +V Q +  +  +   L ++ C E         +  A   ++ WR+I   SS L +VL  +YFP    + A   SR SF W+
Subjt:  EYGFEYAGSGEPDFAGLGYSVSQPVSGFASHGSILVER-CYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWR

Query:  SLLWGRELLEKGIRWRIGNGVNVPVYGAN-LVPHDSCLKVRSSVTLAHDTQVANLI-TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIW
        S+L  R++++ G RWRIG G +  ++    ++P  S L +     L  + +V++LI  +SG WN  LI Q F P +  LI +I L   LS D ++W
Subjt:  SLLWGRELLEKGIRWRIGNGVNVPVYGAN-LVPHDSCLKVRSSVTLAHDTQVANLI-TTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIW

A0A2N9GTH1 CCHC-type domain-containing protein3.7e-7531.36Show/hide
Query:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL-----SLSGGARAVNSFG----CKIQRCLSNLSRWGRPRM--
        DLGF G AFTW NRRL +E V  R+DRC+ N  W  LFP   V H+  + SDH  LL+      +    R    F      KI+R   +L  W + ++  
Subjt:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLL-----SLSGGARAVNSFG----CKIQRCLSNLSRWGRPRM--

Query:  -----------------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDN---------GQEEVL
                                     + R  +  L+ ++E++W+Q S+  WL  GDRNT+++H+  SQR+R N+I GL D+N          +EE+ 
Subjt:  -----------------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDN---------GQEEVL

Query:  LALKHIHPNKAPGPDGLSGAFYRHSW---------------------------------------CITEYQPISLCNVAYKLVSKVLVNRMKE---RNRK
         AL  + P+KAPG DG++   ++  W                                        +T+++ ISLCNV YK+VSK+LVNRMK    R   
Subjt:  LALKHIHPNKAPGPDGLSGAFYRHSW---------------------------------------CITEYQPISLCNVAYKLVSKVLVNRMKE---RNRK

Query:  GEQ----------------------------GGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIH
          Q                            G     + KLDMSKAYDRVEW YL+ I+LK+GF   WV+LI  C+SS  +S  V     G + PSRG+ 
Subjt:  GEQ----------------------------GGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIH

Query:  YAHICSSFALRGFLEC----YMVLRCQESEYGFEYAGSGEPDFAGLGYSV----------SQPVSGFASHGSILVERC----------------------
             S +    FL C      +LR +E E      G  E   +  G  +          S  +S F    S+  E C                      
Subjt:  YAHICSSFALRGFLEC----YMVLRCQESEYGFEYAGSGEPDFAGLGYSV----------SQPVSGFASHGSILVERC----------------------

Query:  --------YEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVP
                  GMGFR+L++FN+ALLAKQ WR+IQ   +  +R LK +YFP T FL A L S  S+IWRS+   R +L  G+RWR+ +G  + V+    +P
Subjt:  --------YEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVP

Query:  HDSCLKVRSSVTLAH-DTQVANLITTS-GQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSGKAGMGS
          S  KV + V + + D  V +LI  +  +WN   +++ F P++V +I  I L      D++IW    +GK  + S
Subjt:  HDSCLKVRSSVTLAH-DTQVANLITTS-GQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSGKAGMGS

A0A6J1DAR4 uncharacterized protein LOC1110189548.4e-7227.67Show/hide
Query:  LESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG------------------------------------------------
        +  LL EEE++W+Q S+D W K GDRNT+WFH++ S RRR NEI+GL D  G                                                
Subjt:  LESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNG------------------------------------------------

Query:  -------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCITEYQPISLC-------NVAYKLVSKVLVNRMKERNRKGEQGG------------TGWASL
               +EE+++AL  IHP+KA GPD  S AFY++ W I   Q +S C       ++  K +S++++ R  +      Q               G+ SL
Subjt:  -------QEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCITEYQPISLC-------NVAYKLVSKVLVNRMKERNRKGEQGG------------TGWASL

Query:  KLDMSKAYDRVEWIYLEKIMLKMGFVPEW---------------------------------VELISLCLSSVRFSFNVIDV---RCGDVIPSRGIHYAH
        KLDMSKAYDRVEW +LE +MLKMGF   W                                 + ++S  L  V    +V ++   +C      +   Y  
Subjt:  KLDMSKAYDRVEWIYLEKIMLKMGFVPEW---------------------------------VELISLCLSSVRFSFNVIDV---RCGDVIPSRGIHYAH

Query:  ICSSFALRGFLECYMVLRCQESEYG------------FEY------------------AGSGEPDFAGL-------------------------------
          SS  ++  L   MV  CQ    G            F Y                   G  E     +                               
Subjt:  ICSSFALRGFLECYMVLRCQESEYG------------FEY------------------AGSGEPDFAGL-------------------------------

Query:  GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIG
        G S       + +  S+ + +C  GMGFRDLE+FN+ALLAKQCWRI+   +S L+RVLKGRYF    F+ A +   PS+IWRS+LWGR+LL+KG+RWRIG
Subjt:  GYSVSQPVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIG

Query:  NGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLIT-TSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG-----------------
        NG +V +YG N VP+   LK+ SS  L   ++V++L+    G W  +++R  F+P E   I++I +  G   DR+IW YEK+G                 
Subjt:  NGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLIT-TSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG-----------------

Query:  -----------------------------------------------KAGM-------------------------------------------------
                                                       K G+                                                 
Subjt:  -----------------------------------------------KAGM-------------------------------------------------

Query:  ----GSFEVLVVILWVVWNCRNYWKFRG---VVLPAG--LIDWALSYISVFREATQVRGMGMVGGVAQNVITWVPPVNGWYKANVDSAYCECQFQAGLGV
              FE L V++W +WN RN   F      V   G  L++WA  Y   FREA      G V   A+  I W PP  G YK N D+++      AGLG+
Subjt:  ----GSFEVLVVILWVVWNCRNYWKFRG---VVLPAG--LIDWALSYISVFREATQVRGMGMVGGVAQNVITWVPPVNGWYKANVDSAYCECQFQAGLGV

Query:  VFRSSAGEVMLSA------VMARDHKSALNTTKQLHQYSRIG
        +  +  G+VM +A      + + D   A+   + L   S IG
Subjt:  VFRSSAGEVMLSA------VMARDHKSALNTTKQLHQYSRIG

A0A803NU77 Uncharacterized protein7.2e-7127.29Show/hide
Query:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARA-VNSFGCK--------------------------
        +L F  D FTW N+   +  V ER+D    N  W   F   ++ HL F  SDHR +L+++S    A V  F  +                          
Subjt:  DLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARA-VNSFGCK--------------------------

Query:  ---------IQRCLSNLSRWGRPRM--------------------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQR
                 I  C + L +W R R                                      L+ LL +EE YWKQ S+  WL+ GD NTR+FH +V  R
Subjt:  ---------IQRCLSNLSRWGRPRM--------------------------------ECRGWLESLLVEEEVYWKQSSKDHWLKLGDRNTRWFHSRVSQR

Query:  RRFNEIRGLEDDNGQEEVLL-------------------------------------------------------ALKHIHPNKAPGPDGLSGAFYRHSW
        R  N IR L DDNG E   L                                                       AL  +  + +PG DG+S  FY + W
Subjt:  RRFNEIRGLEDDNGQEEVLL-------------------------------------------------------ALKHIHPNKAPGPDGLSGAFYRHSW

Query:  CI---------------------------------------TEYQPISLCNVAYKLVSKVLVNRMK-------------------------------ERN
         I                                       T+ +PISLCNV YKLVSK +V R++                                  
Subjt:  CI---------------------------------------TEYQPISLCNVAYKLVSKVLVNRMK-------------------------------ERN

Query:  RKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESE
        +  ++G  G+A++KLDMSKA+DRVEW +++ +ML +GF    V LI+ C+SSV FSF + D   G V+PSRGI      S          Y+ + C E  
Subjt:  RKGEQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESE

Query:  YGFEYAGSGEPDFAGLGYSVSQPVSG--FASHGSILVERCYE------------------GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFP
                   +  GL  + S P     F    S+L+ R                     GMGF+    FNQA+LAKQ WR+    +S L+R+LK RY+ 
Subjt:  YGFEYAGSGEPDFAGLGYSVSQPVSG--FASHGSILVERCYE------------------GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFP

Query:  FTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITIT
         + FL +  GS PS  W+ ++WG+ELL KG+RW++G+G ++       +P  +  K   S     + QV++LIT   QWN EL+   F   +V  I+ I 
Subjt:  FTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHDSCLKVRSSVTLAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITIT

Query:  LRDGLSGDRIIWRYEKSG----KAGMGSFEVLVVILWVV--WNCRNYW-KFRGVVLPAGL-------IDWALSYISVFR------------------EAT
        L    +GD++IW YE +G    K+G      L   L  V     +N+W KF  + LP+ +       I+  L   +                     +A 
Subjt:  LRDGLSGDRIIWRYEKSG----KAGMGSFEVLVVILWVV--WNCRNYW-KFRGVVLPAGL-------IDWALSYISVFR------------------EAT

Query:  QVRGMGMVGGVA------QNVITWVPPVNGWYKANVDSAYCECQFQAGLGVVFRSSAGEVM
        Q         V+      Q   +W+ P  G  K N D+A  +     G G + R+S GEV+
Subjt:  QVRGMGMVGGVA------QNVITWVPPVNGWYKANVDSAYCECQFQAGLGVVFRSSAGEVM

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003108.0e-1951.19Show/hide
Query:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVY
        G+GFRDL  FNQALLAKQ +RII Q  + L+R+L+ RYFP +  +   +G+RPS+ WRS++ GRELL +G+   IG+G++  V+
Subjt:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVY

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.3e-0545.45Show/hide
Query:  EQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELIS
        ++G  GW  LKLD+ KAYDR+ W YLE  ++  GF   W+  I+
Subjt:  EQGGTGWASLKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELIS

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-2234.68Show/hide
Query:  CYE---GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLV---PH
        CY+   G+GF+D+E FN ALL KQ WR++ +  S +A+V K RYF  +D L A LGSRPSF+W+S+   +E+L +G R  +GNG ++ ++    +   P 
Subjt:  CYE---GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLV---PH

Query:  DSCLKV-----RSSVTLAHDTQVANLITTSG-QWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG
         + L++     +   +++   +V++LI  SG +W +++I   F   E  LI  +        D   W Y  SG
Subjt:  DSCLKV-----RSSVTLAHDTQVANLITTSG-QWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.7e-2051.19Show/hide
Query:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVY
        G+GFRDL  FNQALLAKQ +RII Q  + L+R+L+ RYFP +  +   +G+RPS+ WRS++ GRELL +G+   IG+G++  V+
Subjt:  GMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTAGGGTTCAGAGGGGATGCTTTTACATGGTGTAATAGAAGGCTAGGTGCAGAAACGGTATGGGAGAGGATTGACAGGTGTCTTGGCAACTTGGCATGGCAGAA
GTTGTTTCCGGAGCATGTGGTAACCCACCTTGACTTTAGTCGCTCAGACCACAGGCCCCTTTTGCTTTCATTATCTGGTGGGGCTCGAGCAGTCAACAGTTTTGGGTGCA
AAATTCAAAGATGTCTGTCTAATCTGAGTAGGTGGGGGAGGCCGAGAATGGAATGCAGAGGATGGCTTGAATCACTTCTAGTGGAGGAGGAGGTGTATTGGAAGCAGTCG
TCCAAGGATCATTGGCTTAAATTGGGTGACAGGAATACCCGCTGGTTCCATTCTCGAGTCTCTCAGAGGAGGAGATTTAACGAGATAAGGGGTTTGGAGGACGATAATGG
GCAAGAGGAGGTTTTGTTGGCATTGAAGCACATTCATCCTAATAAGGCTCCGGGACCAGATGGCTTATCTGGGGCTTTCTATCGGCATTCATGGTGTATTACTGAATACC
AACCTATATCACTCTGCAACGTGGCTTACAAACTGGTGTCGAAAGTGCTAGTGAATCGTATGAAAGAGCGAAACAGAAAGGGCGAACAGGGTGGGACAGGGTGGGCCTCG
CTTAAATTAGATATGAGCAAGGCGTATGATAGGGTGGAGTGGATTTATTTGGAGAAGATTATGTTGAAGATGGGCTTTGTGCCAGAATGGGTCGAGTTGATATCTCTGTG
CCTTTCATCAGTCCGGTTCTCTTTTAATGTGATTGACGTCAGATGTGGGGATGTTATCCCGAGCAGGGGGATCCATTATGCCCATATTTGTTCCTCCTTTGCGTTGAGGG
GCTTTCTTGAATGTTACATGGTGCTGAGGTGTCAAGAGTCCGAGTACGGCTTCGAATATGCGGGCTCTGGTGAGCCAGATTTTGCAGGTTTAGGTTACAGCGTGTCACAA
CCAGTATCTGGGTTTGCCAGCCATGGCTCGATTTTGGTGGAGCGGTGCTATGAAGGTATGGGATTTCGTGATCTGGAGGTTTTTAACCAAGCCTTGTTGGCTAAGCAGTG
TTGGAGGATTATCCAACAGTCGTCCTCGTTTCTTGCTCGTGTGCTAAAGGGTCGGTATTTTCCTTTTACGGATTTTTTAGGGGCGGGCCTGGGGTCGAGGCCCTCCTTTA
TTTGGAGGAGTTTACTGTGGGGGAGAGAGCTCTTAGAGAAGGGCATTCGATGGCGAATTGGGAATGGTGTAAACGTCCCCGTCTATGGAGCTAATTTGGTTCCTCATGAC
TCTTGCCTCAAGGTGCGTTCCTCGGTTACATTAGCACATGATACTCAGGTGGCCAATCTTATTACGACATCTGGTCAGTGGAACGAGGAGCTGATTCGACAGCATTTTAG
CCCCCAAGAGGTAAGTCTTATTATTACTATTACTTTGCGAGATGGATTGTCTGGTGATAGAATTATTTGGCGCTATGAGAAATCTGGAAAGGCTGGAATGGGCTCGTTTG
AGGTACTGGTGGTGATTTTGTGGGTTGTCTGGAATTGTCGTAACTATTGGAAGTTTAGGGGCGTTGTCTTGCCTGCAGGACTTATCGATTGGGCATTGAGTTATATCTCA
GTGTTTCGAGAGGCTACTCAAGTTAGAGGTATGGGCATGGTAGGCGGAGTGGCTCAGAATGTAATAACATGGGTCCCGCCTGTGAATGGGTGGTATAAGGCAAACGTTGA
TTCTGCTTACTGTGAGTGTCAATTTCAAGCGGGTTTGGGTGTGGTTTTTCGGAGTTCTGCAGGTGAGGTTATGCTTTCTGCAGTAATGGCACGTGATCATAAATCAGCCT
TGAACACCACCAAGCAGTTGCACCAGTATTCTCGAATAGGATTTTGCGTTGGGAGCTTGTGGGAGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTAGGGTTCAGAGGGGATGCTTTTACATGGTGTAATAGAAGGCTAGGTGCAGAAACGGTATGGGAGAGGATTGACAGGTGTCTTGGCAACTTGGCATGGCAGAA
GTTGTTTCCGGAGCATGTGGTAACCCACCTTGACTTTAGTCGCTCAGACCACAGGCCCCTTTTGCTTTCATTATCTGGTGGGGCTCGAGCAGTCAACAGTTTTGGGTGCA
AAATTCAAAGATGTCTGTCTAATCTGAGTAGGTGGGGGAGGCCGAGAATGGAATGCAGAGGATGGCTTGAATCACTTCTAGTGGAGGAGGAGGTGTATTGGAAGCAGTCG
TCCAAGGATCATTGGCTTAAATTGGGTGACAGGAATACCCGCTGGTTCCATTCTCGAGTCTCTCAGAGGAGGAGATTTAACGAGATAAGGGGTTTGGAGGACGATAATGG
GCAAGAGGAGGTTTTGTTGGCATTGAAGCACATTCATCCTAATAAGGCTCCGGGACCAGATGGCTTATCTGGGGCTTTCTATCGGCATTCATGGTGTATTACTGAATACC
AACCTATATCACTCTGCAACGTGGCTTACAAACTGGTGTCGAAAGTGCTAGTGAATCGTATGAAAGAGCGAAACAGAAAGGGCGAACAGGGTGGGACAGGGTGGGCCTCG
CTTAAATTAGATATGAGCAAGGCGTATGATAGGGTGGAGTGGATTTATTTGGAGAAGATTATGTTGAAGATGGGCTTTGTGCCAGAATGGGTCGAGTTGATATCTCTGTG
CCTTTCATCAGTCCGGTTCTCTTTTAATGTGATTGACGTCAGATGTGGGGATGTTATCCCGAGCAGGGGGATCCATTATGCCCATATTTGTTCCTCCTTTGCGTTGAGGG
GCTTTCTTGAATGTTACATGGTGCTGAGGTGTCAAGAGTCCGAGTACGGCTTCGAATATGCGGGCTCTGGTGAGCCAGATTTTGCAGGTTTAGGTTACAGCGTGTCACAA
CCAGTATCTGGGTTTGCCAGCCATGGCTCGATTTTGGTGGAGCGGTGCTATGAAGGTATGGGATTTCGTGATCTGGAGGTTTTTAACCAAGCCTTGTTGGCTAAGCAGTG
TTGGAGGATTATCCAACAGTCGTCCTCGTTTCTTGCTCGTGTGCTAAAGGGTCGGTATTTTCCTTTTACGGATTTTTTAGGGGCGGGCCTGGGGTCGAGGCCCTCCTTTA
TTTGGAGGAGTTTACTGTGGGGGAGAGAGCTCTTAGAGAAGGGCATTCGATGGCGAATTGGGAATGGTGTAAACGTCCCCGTCTATGGAGCTAATTTGGTTCCTCATGAC
TCTTGCCTCAAGGTGCGTTCCTCGGTTACATTAGCACATGATACTCAGGTGGCCAATCTTATTACGACATCTGGTCAGTGGAACGAGGAGCTGATTCGACAGCATTTTAG
CCCCCAAGAGGTAAGTCTTATTATTACTATTACTTTGCGAGATGGATTGTCTGGTGATAGAATTATTTGGCGCTATGAGAAATCTGGAAAGGCTGGAATGGGCTCGTTTG
AGGTACTGGTGGTGATTTTGTGGGTTGTCTGGAATTGTCGTAACTATTGGAAGTTTAGGGGCGTTGTCTTGCCTGCAGGACTTATCGATTGGGCATTGAGTTATATCTCA
GTGTTTCGAGAGGCTACTCAAGTTAGAGGTATGGGCATGGTAGGCGGAGTGGCTCAGAATGTAATAACATGGGTCCCGCCTGTGAATGGGTGGTATAAGGCAAACGTTGA
TTCTGCTTACTGTGAGTGTCAATTTCAAGCGGGTTTGGGTGTGGTTTTTCGGAGTTCTGCAGGTGAGGTTATGCTTTCTGCAGTAATGGCACGTGATCATAAATCAGCCT
TGAACACCACCAAGCAGTTGCACCAGTATTCTCGAATAGGATTTTGCGTTGGGAGCTTGTGGGAGCAATAA
Protein sequenceShow/hide protein sequence
MDLGFRGDAFTWCNRRLGAETVWERIDRCLGNLAWQKLFPEHVVTHLDFSRSDHRPLLLSLSGGARAVNSFGCKIQRCLSNLSRWGRPRMECRGWLESLLVEEEVYWKQS
SKDHWLKLGDRNTRWFHSRVSQRRRFNEIRGLEDDNGQEEVLLALKHIHPNKAPGPDGLSGAFYRHSWCITEYQPISLCNVAYKLVSKVLVNRMKERNRKGEQGGTGWAS
LKLDMSKAYDRVEWIYLEKIMLKMGFVPEWVELISLCLSSVRFSFNVIDVRCGDVIPSRGIHYAHICSSFALRGFLECYMVLRCQESEYGFEYAGSGEPDFAGLGYSVSQ
PVSGFASHGSILVERCYEGMGFRDLEVFNQALLAKQCWRIIQQSSSFLARVLKGRYFPFTDFLGAGLGSRPSFIWRSLLWGRELLEKGIRWRIGNGVNVPVYGANLVPHD
SCLKVRSSVTLAHDTQVANLITTSGQWNEELIRQHFSPQEVSLIITITLRDGLSGDRIIWRYEKSGKAGMGSFEVLVVILWVVWNCRNYWKFRGVVLPAGLIDWALSYIS
VFREATQVRGMGMVGGVAQNVITWVPPVNGWYKANVDSAYCECQFQAGLGVVFRSSAGEVMLSAVMARDHKSALNTTKQLHQYSRIGFCVGSLWEQ