; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0041075 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0041075
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:11687276..11695585
RNA-Seq ExpressionLag0041075
SyntenyLag0041075
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO59710.1 reverse transcriptase [Corchorus capsularis]4.9e-11428.64Show/hide
Query:  QQLSYSPNHIDGWVDRGVSL--WRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKG
        Q +S+S +HID WVD    L  WR +  YG    G + ++W LL+    +   +W   GDF  +L   E +GGR + EA++ +F  A+D+C L D+G++G
Subjt:  QQLSYSPNHIDGWVDRGVSL--WRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKG

Query:  ATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSAR----------------WRTEAPI--------LFARGVGI-
          FTW  G    + + ER+DR +        FP   +TH+  S SDH P+LL+ E    R                W  EA           F  G+G+ 
Subjt:  ATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSAR----------------WRTEAPI--------LFARGVGI-

Query:  --------------------------------------------------------------------------------SSF-----------------
                                                                                        +SF                 
Subjt:  --------------------------------------------------------------------------------SSF-----------------

Query:  ------------------------------------------------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCC
                                                                F  EE+  AL  IHP KAP PDG+   F++  W +VG DV + C
Subjt:  ------------------------------------------------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCC

Query:  LNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG--
        L+  +  +  +  N+T IVLIPK   PK IT ++PISLCNV YKI+SKVLVNR+K IL V IS +QSAF+PG  + DN ++ +E +H+LK ++ G+ G  
Subjt:  LNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG--

Query:  -------KAYDRVK-----------VFDKE-----IHFPRTCSYCVL--------------------------------------KGSL-----------
               KAYDRV+            FD+      +   R+ S+ V+                                      KG             
Subjt:  -------KAYDRVK-----------VFDKE-----IHFPRTCSYCVL--------------------------------------KGSL-----------

Query:  --------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSS----------------------------TIMG-------------
                       LF +A   E+  V   L+ YE   GQ INF+KS++ FS +                            T +G             
Subjt:  --------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSS----------------------------TIMG-------------

Query:  VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCW
        +  +I  W  +  S GGREV++KS++QAIP Y MN F  P+ L +DI+  ++RFWW    + R I+W+ W+ +CK K  GG GFRD+E FNQA LAKQ W
Subjt:  VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCW

Query:  RIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRN
        +++ +P S  +RVLK RYFP   FL    G  PS+ WRS++ GR+LL  G+RWR GN + VR+              W  +          G + D S  
Subjt:  RIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRN

Query:  AVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSS--WWKGCWKMGVPNQIKI
         VA L +    SW+ +L+R  F   E   I+ I L + L  D ++WHF+   +YS+KSGYR+  +  L   +  +  E L    ++   W   VP ++++
Subjt:  AVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSS--WWKGCWKMGVPNQIKI

Query:  FLWRLCLEQLSTVDNLSVRGIDV
        F WRL  E L  +D+L  R +DV
Subjt:  FLWRLCLEQLSTVDNLSVRGIDV

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]4.9e-12233.26Show/hide
Query:  SYSPNHIDGWVDR--GVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF
        SYS +HID  V+   G  +WRF+ IYG P+   + +TW+L+K+L G     WL+GGDF  ILS  E EGG  + E ++  F  A++   L DL F G  F
Subjt:  SYSPNHIDGWVDR--GVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF

Query:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSM---------------------------EAVSARWRTE------APILFA
        TW   R+GE+ +  R+DR + N +  ++FP   + H++ + SDH P+L+ +                           + V A W T         +L  
Subjt:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSM---------------------------EAVSARWRTE------APILFA

Query:  RGVGISSF--------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITD
         G G  SF                   EEV +AL  +HP+KAP PDG S  FYQ  W VVG DVV      LN+      +N T + LIPK K+P+ +  
Subjt:  RGVGISSF--------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITD

Query:  YKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRVK------VFDK--------
         +PISLCNV YKI SKVL NR+K +L+ +IS  QSAF+PG  + DN++L +E  H LK ++ G  G         KAYDRV+      V DK        
Subjt:  YKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRVK------VFDK--------

Query:  --EIHFPRTCSYCVLKG-------------------SLALFFRAKEGEARVV---------HG--------------------------------ILQYY
           +   RT SY  L                     S  LF    E  +R++         HG                                I Q Y
Subjt:  --EIHFPRTCSYCVLKG-------------------SLALFFRAKEGEARVV---------HG--------------------------------ILQYY

Query:  ERAFGQTINFDKSIISFSSST----------IMG-------------------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMN
        E+  GQ IN++KS ISFS +           ++G                               V+ + QGW+ K  S  G+EVL+K++VQ+IP Y M+
Subjt:  ERAFGQTINFDKSIISFSSST----------IMG-------------------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMN

Query:  CFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSY
        CF LPK L  ++ R MARFWW     +R+IHWV+W  +C PK  GG GFRD+ +FN A LAKQ WR++  P S  ++V K RYFP  DF++  +    S+
Subjt:  CFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSY

Query:  IWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISL
         WRS++ GR+LL KG+R++ GN + + + +   +    R      +  S   +G   +        V  L +     W E LL   F+P EV  I+ I L
Subjt:  IWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISL

Query:  RHMLSKDRVIWHFEKCEVYSIKSGYRLSQM--ALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVR
              DR++ HF+K   YS+K+GY ++++   L   TS SS     +  W   WK+ VP ++++  WRL    L T   L+ R
Subjt:  RHMLSKDRVIWHFEKCEVYSIKSGYRLSQM--ALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVR

XP_030922958.1 uncharacterized protein LOC115949824 [Quercus lobata]7.1e-11331.05Show/hide
Query:  SYSPNHIDGWVDRGVS--LWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF
        SYS +HID  +    S  +WRF+  YG PQ+ ++  +W+LL+ LR      W   GDF  ILS  E  GG  ++++++  F   +++C   DLG+ G  F
Subjt:  SYSPNHIDGWVDRGVS--LWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF

Query:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSARWRTEAPILFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDG
        TWCN +SG   ++ R+DR    +   + F   ++ H+  S SDH  L                                         ++   KAP+   
Subjt:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSARWRTEAPILFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDG

Query:  LSGVFYQHSWSVVGP--DVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVD
             ++  W+      +V N  LN+LN+ +S A +N+T I L+PK K+P  + D++PISLCNV YK++SK L NR+K IL  +IS NQSAF+    + D
Subjt:  LSGVFYQHSWSVVGP--DVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVD

Query:  NTILGYECIHTLKAKQVGR---------TGKAYDRVK--VFDKEIH---------------FPRTCSYCVLKGSL-------------------------
        N ++ YE +H +K K+ G            KA+DRV+    DK +                        ++ GS+                         
Subjt:  NTILGYECIHTLKAKQVGR---------TGKAYDRVK--VFDKEIH---------------FPRTCSYCVLKGSL-------------------------

Query:  -------------------------------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSST----------IMG--------
                                              LF +A   E   +  IL+ YE A GQ IN +KS I FS +T          I+G        
Subjt:  -------------------------------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSST----------IMG--------

Query:  -----------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK
                               V  ++  WKGK+ S+GG+E+L+K++ QAIP YTM+CF LP  L  D+ R +  FWW   +++ ++ WV WK MCKPK
Subjt:  -----------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK

Query:  SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVAD
        SLGG GFR+L  FN A LAKQ WRI   P+S  +R+LK +YFP  D LNVG+G+ PS++WRS+    E++  G RWR GN   + +              
Subjt:  SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVAD

Query:  WCEEDLSTTARGQ-LGMVSDF-SRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRL-SQMALLAQTSSS
        W ++ L T +  + +  + DF     V+ L +     W   ++R  F P E   IL I L   L +D +IW   K  V+++KS Y + +++        S
Subjt:  WCEEDLSTTARGQ-LGMVSDF-SRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRL-SQMALLAQTSSS

Query:  SSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGIDV
        SS +S S  WK  WKM +P +IKIF WR C   L     ++ RGI +
Subjt:  SSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGIDV

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]2.9e-11429.35Show/hide
Query:  SYSPNHIDGWVDRGV-SLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT
        S+S NHID  VD GV   WRF+  YG P    +  +WSLL+ L       WL  GDF  I   +E  G   + E ++  F  A+D+C L DLGF G  FT
Subjt:  SYSPNHIDGWVDRGV-SLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT

Query:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEA-------------VSARW------------------------------
        WCN R+G   VW R+DR +  +     FP   + H+D   SDH+P+LL  ++               A W                              
Subjt:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEA-------------VSARW------------------------------

Query:  ------------------------------------------------RTEAPILFAR------------------------------------------
                                                        RTE   L ++                                          
Subjt:  ------------------------------------------------RTEAPILFAR------------------------------------------

Query:  ----------GVG----------------------ISSFQP-------------FHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLN
                   +G                      ++S QP             +  EEVL AL  + P  AP PDG+S +FY+  W +VG DV++  L+
Subjt:  ----------GVG----------------------ISSFQP-------------FHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLN

Query:  ILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG----
         LN+ +    LN T I LIPK K+PK + +++PISLCNV YK+++KV+VNR+K IL  +I  +QSAF+ G  + DN ++ +E +H LK K  GR G    
Subjt:  ILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG----

Query:  -----KAYDRVK-VF----DKEIHFP-----------RTCSYCVL-------------KGSL---------------------ALFFRAKEGEARVVHGI
             KAYD+V+ VF     + + FP            T SY VL              G +                      LF RA+  E + +  +
Subjt:  -----KAYDRVK-VF----DKEIHFP-----------RTCSYCVL-------------KGSL---------------------ALFFRAKEGEARVVHGI

Query:  LQYYERAFGQTINFDKSIISFSSST----------IMGVQA-------------------------------QIQGWKGKVFSVGGREVLLKSIVQAIPC
        L  YE+  GQ IN +K+ I FS++T          ++GV A                               ++QGWK K+ S  GREVL+KS++QAIP 
Subjt:  LQYYERAFGQTINFDKSIISFSSST----------IMGVQA-------------------------------QIQGWKGKVFSVGGREVLLKSIVQAIPC

Query:  YTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGS
        YTM+CF LPK LI ++   + +FWW    E+R++HWV W+ +C+ K  GG GF+D+E FN + LAKQ WR++  P S   RV K R+FP    L     S
Subjt:  YTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGS

Query:  RPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFIL
          SY W+S+I  R+++ +G+ WR GN   VR+R             W     + +    L  V+  ++  V+ L +S    WN  L+   F P +   I 
Subjt:  RPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFIL

Query:  TISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGI
         + L   L  DR+IW      +++ KS Y+L         + +SS  +   +W+  W++ VPN++K F WR C + L T+ NL  R I
Subjt:  TISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGI

XP_039834390.1 uncharacterized protein LOC120695147 [Panicum virgatum]1.4e-11332.02Show/hide
Query:  LSYSPNHIDGWVDRG-VSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF
        LS+S  HID  +  G  S W+ + IYG  +   K  TW LL+ L+ +    WL  GDF  IL + E EGG P+AE     F  A++EC+L DLGF G  F
Subjt:  LSYSPNHIDGWVDRG-VSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF

Query:  TWCNGRS-GEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLS---------------MEAVSARWRTEAPI--------------------
        TW N        V ER+DR + N A Q  FP+  + + D   SDHRP+++                M+   ARW  E                       
Subjt:  TWCNGRS-GEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLS---------------MEAVSARWRTEAPI--------------------

Query:  -----LFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISL
                R +  +   PF  +EV  AL  +   KAP  DG+  VFY+  WS+VG  V    L +LN  V     N+T+IVL+PK KSP+ + D +PISL
Subjt:  -----LFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISL

Query:  CNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRV-------------------------
        CNV YK++SKVL NR+K +L  +IS +QSAF+PG  + DN +L YE  H L  ++ G+ G         KAYDRV                         
Subjt:  CNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRV-------------------------

Query:  ----------------KVFDK--------------------------------EIHFPRTC------SYCVLKGSLALFFRAKEGEARVVHGILQYYERA
                        K++ +                                 I   R C      ++        +  RA +G+A+ +  ILQ YE  
Subjt:  ----------------KVFDK--------------------------------EIHFPRTC------SYCVLKGSLALFFRAKEGEARVVHGILQYYERA

Query:  FGQTINFDKSIISFSSST----------------------IMG-------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFC
         GQ IN DKS + FS +T                       +G                   V  +IQGW+ K+ S  G+E+L+K++ QAIP Y M+CF 
Subjt:  FGQTINFDKSIISFSSST----------------------IMG-------------------VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFC

Query:  LPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWR
        L K L  ++S  + R+WW+ + ED +IHW++W  +  PK+ GG GFRDL +FNQA LA+Q WR++    +   +VLK +YFP G  L        SY WR
Subjt:  LPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWR

Query:  SLIWGRELLGKGIRWRTGNKKKVRV-RSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRH
        S++ G +L+ +G+ WR GN + V +   P      TR           T RG + +        V  L +    SW+E L++  FS  +   IL + + +
Subjt:  SLIWGRELLGKGIRWRTGNKKKVRV-RSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRH

Query:  MLSKDRVIWHFEKCEVYSIKSGYRL------SQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGIDV
        +  +DR  WHF+K  ++S+KS Y+L      ++M   A +S S    +    W   W M VPN++K+F+WRL    L+   NL  RG+ V
Subjt:  MLSKDRVIWHFEKCEVYSIKSGYRL------SQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGIDV

TrEMBL top hitse value%identityAlignment
A0A1R3GNW3 Reverse transcriptase2.4e-11428.64Show/hide
Query:  QQLSYSPNHIDGWVDRGVSL--WRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKG
        Q +S+S +HID WVD    L  WR +  YG    G + ++W LL+    +   +W   GDF  +L   E +GGR + EA++ +F  A+D+C L D+G++G
Subjt:  QQLSYSPNHIDGWVDRGVSL--WRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKG

Query:  ATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSAR----------------WRTEAPI--------LFARGVGI-
          FTW  G    + + ER+DR +        FP   +TH+  S SDH P+LL+ E    R                W  EA           F  G+G+ 
Subjt:  ATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSAR----------------WRTEAPI--------LFARGVGI-

Query:  --------------------------------------------------------------------------------SSF-----------------
                                                                                        +SF                 
Subjt:  --------------------------------------------------------------------------------SSF-----------------

Query:  ------------------------------------------------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCC
                                                                F  EE+  AL  IHP KAP PDG+   F++  W +VG DV + C
Subjt:  ------------------------------------------------------QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCC

Query:  LNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG--
        L+  +  +  +  N+T IVLIPK   PK IT ++PISLCNV YKI+SKVLVNR+K IL V IS +QSAF+PG  + DN ++ +E +H+LK ++ G+ G  
Subjt:  LNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG--

Query:  -------KAYDRVK-----------VFDKE-----IHFPRTCSYCVL--------------------------------------KGSL-----------
               KAYDRV+            FD+      +   R+ S+ V+                                      KG             
Subjt:  -------KAYDRVK-----------VFDKE-----IHFPRTCSYCVL--------------------------------------KGSL-----------

Query:  --------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSS----------------------------TIMG-------------
                       LF +A   E+  V   L+ YE   GQ INF+KS++ FS +                            T +G             
Subjt:  --------------ALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSS----------------------------TIMG-------------

Query:  VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCW
        +  +I  W  +  S GGREV++KS++QAIP Y MN F  P+ L +DI+  ++RFWW    + R I+W+ W+ +CK K  GG GFRD+E FNQA LAKQ W
Subjt:  VQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCW

Query:  RIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRN
        +++ +P S  +RVLK RYFP   FL    G  PS+ WRS++ GR+LL  G+RWR GN + VR+              W  +          G + D S  
Subjt:  RIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRN

Query:  AVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSS--WWKGCWKMGVPNQIKI
         VA L +    SW+ +L+R  F   E   I+ I L + L  D ++WHF+   +YS+KSGYR+  +  L   +  +  E L    ++   W   VP ++++
Subjt:  AVAFLENSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSS--WWKGCWKMGVPNQIKI

Query:  FLWRLCLEQLSTVDNLSVRGIDV
        F WRL  E L  +D+L  R +DV
Subjt:  FLWRLCLEQLSTVDNLSVRGIDV

A0A2N9G7B6 Uncharacterized protein3.1e-11426.63Show/hide
Query:  PWAFDRSLVVIKEPRDVTSELETEFNDCLFWVQIHNVPMKWPTKDMAQHLGGKIGVVEEVGSSNENDWIGSILRIRFVFGWTDHFEGGSICMMMMGRILG
        PW +D+ LV+++   +  +  E  F +  FWVQ+H +P++    ++A  LG  +G +  V    +    G  +RIR     T     G   M+  GR + 
Subjt:  PWAFDRSLVVIKEPRDVTSELETEFNDCLFWVQIHNVPMKWPTKDMAQHLGGKIGVVEEVGSSNENDWIGSILRIRFVFGWTDHFEGGSICMMMMGRILG

Query:  VQFCHSR--RKCEGVGASSS---------------EVGEDQYGEWLRVG--------VFLGEGIRR--------EPEGDVHENEASVSESTVGEGLSVPS
        + F + R    C   G  S                ++ + Q+G WLR              EGI++        +P         + ++S     L   +
Subjt:  VQFCHSR--RKCEGVGASSS---------------EVGEDQYGEWLRVG--------VFLGEGIRR--------EPEGDVHENEASVSESTVGEGLSVPS

Query:  IDTQPESCTVEGGKKARWGHNVVVES-----------------------------GGSFQSAID-----------NCPKDYVAG-----------RLERA
        I T P   T    +      + ++ S                             G + Q +I            NC      G           RL RA
Subjt:  IDTQPESCTVEGGKKARWGHNVVVES-----------------------------GGSFQSAID-----------NCPKDYVAG-----------RLERA

Query:  NS----VLQQNWEQ-------NCHITAHCVYLHRNSRVFYTIFLL---TLGTQQLSYSPNHIDGWVDRG-VSLWRFSRIYGFPQAGLKVMTWSLLKHLRG
               L + W+         C +     ++ ++      + LL    +  +  S+S +HID  V+      WRF+  YG P+   +  +W+LL+ L  
Subjt:  NS----VLQQNWEQ-------NCHITAHCVYLHRNSRVFYTIFLL---TLGTQQLSYSPNHIDGWVDRG-VSLWRFSRIYGFPQAGLKVMTWSLLKHLRG

Query:  NPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRP
             W   GDF  ++  +E +G   ++E ++  F   +DEC  +DLGF G  FTW N R  + T WER+DR +        FP  ++ H++   SDH+P
Subjt:  NPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRP

Query:  LLLSME------------------------AVSARWRTEA------------------------------------------------------------
        + ++ E                         V   W+ EA                                                            
Subjt:  LLLSME------------------------AVSARWRTEA------------------------------------------------------------

Query:  -------PILFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYK
                ++    +     + F  EEV  AL  + P KAP PDGL  +FYQH W ++G DV    L  LN+      +N T I LIPK ++P+ + +++
Subjt:  -------PILFARGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYK

Query:  PISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRVK------VFDKE---------
        PISLCNV YKI+SKVL NR+K +L  ++S +QSAFIPG  + DN ++ +E +H ++ ++ G+ G         KAYDRV+      V +K          
Subjt:  PISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------KAYDRVK------VFDKE---------

Query:  -IHFPRTCSYCVLKGSLALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSSTIMGVQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFC
         +    T SY +L          KE  A  + G+           + F    + F  +T   V  +IQ    +     GRE+L+KS+ QAIP Y M+CF 
Subjt:  -IHFPRTCSYCVLKGSLALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSSTIMGVQAQIQGWKGKVFSVGGREVLLKSIVQAIPCYTMNCFC

Query:  LPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWR
        LP +LI +I   + RFWW  + E  ++HW+ W T+CKPK  GG G RDL  FN+A LAKQ WR++  P+S   +V K +YF       V   ++ SY WR
Subjt:  LPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWR

Query:  SLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDF--SRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLR
        S++  R+L+ KG  WR G  K + +              W ++ L  +   ++        S + V  L +S   SW  EL++  F P E   IL I L 
Subjt:  SLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDF--SRNAVAFLENSSFSSWNEELLRHHFSPREVHFILTISLR

Query:  HMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGI
             D +IW   K  +Y+++SGY L           +S +  +S  WK  W + +P++I+ FLWR C   L T  NL  R +
Subjt:  HMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGI

A0A2N9HNV6 Uncharacterized protein4.3e-11633.67Show/hide
Query:  SYSPNHIDGWV--DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF
        SYSP HID  +  + GV  WRF+  YG P+   +  +W+L++ L       WLV GDF  I+ + E  G R +A+  +++F  A+  CEL DLGF+G  F
Subjt:  SYSPNHIDGWV--DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATF

Query:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDH-----RPLLLSMEAVSARW---RTEAPILF----------ARGVGISS--------
        TW N +S E  ++ ++DR ++      LFP   +  + F  SDH     +  +L +      W   R E   L           +  +GI          
Subjt:  TWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDH-----RPLLLSMEAVSARW---RTEAPILF----------ARGVGISS--------

Query:  ---------FQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYK
                  + F   EV +AL  +H  KAP PDG+S +F+Q  W +VG D  +   + LN+    + +N T IV+IPK KSP+ I  ++PISLCNV YK
Subjt:  ---------FQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYK

Query:  IMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRT---------GKAYDRVKVFDKEIHFPR----------------TCSY
         +SKVL NR+K IL  +IS +QSAF+    + DN ++ +E +H LK K+ G+T          KAYDRV+    E    R                T SY
Subjt:  IMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRT---------GKAYDRVKVFDKEIHFPR----------------TCSY

Query:  CVLKGSLA----------------------------------LFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSSTIMGVQAQIQGWKGKVFS
         VL   L+                                  LFF A   E R+V  IL  YE A G      +S I         V  ++QGWK K+ S
Subjt:  CVLKGSLA----------------------------------LFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSSTIMGVQAQIQGWKGKVFS

Query:  VGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVL
          GRE+L+K++ QAIP Y M+CFC+P  +   I+     FWW      R++HW SW  MC  K+ GG GFR+L+ FN A LAKQ WRI+ +P S   R+ 
Subjt:  VGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVL

Query:  KGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWN
        K +YFP   F+   + +  S+IWRS+   REL+ KG RW  G+   +R+             +W              ++   +R  V  L + SF SWN
Subjt:  KGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWN

Query:  EELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSW---WKGCWKMGVPNQIKIFLWRLC
        EEL+   F P E   I  I L  +   DR IW      ++S++S Y +  +   +   S  +S S S W   WK  W + VP +IK+F+W+ C
Subjt:  EELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSW---WKGCWKMGVPNQIKIFLWRLC

A0A2N9HNV6 Uncharacterized protein3.4e-0433.33Show/hide
Query:  PWAFDRSLVVIKEPRDVTSELETEFNDCLFWVQIHNVPMKWPTKDMAQHLGGKIGVVEEVGSSNEN
        PW    + +++KE     +  E +FN   FW+Q+H+VP  W +K+  + +G ++G+V+EV  + EN
Subjt:  PWAFDRSLVVIKEPRDVTSELETEFNDCLFWVQIHNVPMKWPTKDMAQHLGGKIGVVEEVGSSNEN

A0A2N9HNV6 Uncharacterized protein7.4e-11630.78Show/hide
Query:  SYSPNHIDGWV-DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT
        S S  HI   V   G S W  +  YG P+A  +  +W LL+ + G    +WL  GDF  I+SH ++  G  +  A + +F   +D C+L+D     + FT
Subjt:  SYSPNHIDGWV-DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT

Query:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSM-------------------------------EAVSARWRTE---------
        WCNG    + V ER+DR L NL   + F   ++  +D+  SDHR L++ +                                 +  +WR E         
Subjt:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSM-------------------------------EAVSARWRTE---------

Query:  ---------------------------APILFA------RGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNR
                                     +L A       G+  +  QPF  EEV++A+  +HP KAP  DGL  +FYQ+ WS V  DVV   LNIL+  
Subjt:  ---------------------------APILFA------RGVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNR

Query:  VSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------
             LNET+I LIPK + P  + +Y+PISLCNV YKI+SK LVNR++  L+ +IS +QSAF+ G  + DN I+GYE +H ++ K   R G         
Subjt:  VSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTG---------

Query:  -KAYDRVK--------------------------------VFDKEIH---------------FPRTCSYCVLKGSLAL----------------------
         KAYDRV+                                + + EI                FP    +C    S  L                      
Subjt:  -KAYDRVK--------------------------------VFDKEIH---------------FPRTCSYCVLKGSLAL----------------------

Query:  ----------FFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSST----------IMGVQA-------------------------------QIQ
                  FF A     +  + +L  Y  A GQ +NF KS + F  +            +GV+                                 ++
Subjt:  ----------FFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSST----------IMGVQA-------------------------------QIQ

Query:  GWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYP
        GWKG +FS+GG EVL+K+IVQAIP YTM+ + L K  I  I R  ARFWW    + ++IHW  WK +C+PK  GG GFRDLE+FNQA LAKQ WR ++ P
Subjt:  GWKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYP

Query:  TSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLE
         S  ++VLK  YFP    L    G+  S++WRSL+WG+E++ KG RWR GN ++VRV     L   T    + +         QL +V     + V    
Subjt:  TSFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLE

Query:  NSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLE
              W+E  +R +F+  +   IL +   H   +D+V+WH+ +   Y+++SGYR++  A + ++ ++S  + +  WW   WK+ +  ++K F+W+L   
Subjt:  NSSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLE

Query:  QLSTVDNLSVRGI
         L T  NL +R +
Subjt:  QLSTVDNLSVRGI

A0A803NHG3 Uncharacterized protein4.1e-11430.48Show/hide
Query:  SYSPNHIDGWV-DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT
        S S  HI   V   G S W  +  YG P+A  +  +W LL+ + G    SWL  GDF  I+S  E  GGR +  A + +F   +D C+L+D     + FT
Subjt:  SYSPNHIDGWV-DRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGRPKAEAELSSFHGAIDECELLDLGFKGATFT

Query:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVS--------ARWRTEAP------------------------------
        WCNG    + V ER+DR L NL   + F   ++  +D+  SDHR L++ +             RWR E                                
Subjt:  WCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVS--------ARWRTEAP------------------------------

Query:  ---------------------------------ILFAR--------------------------GVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGV
                                         +L  +                          G+  +  QPF  EEV++A+  +HP KAP  DGL  +
Subjt:  ---------------------------------ILFAR--------------------------GVGISSFQPFHREEVLRALNHIHPNKAPRPDGLSGV

Query:  FYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGY
        FYQ  WS V  DVV  CLNIL+       LNET+  LIPK + P  + +Y+PISLCNV YKI+SK L NR++  L+ +IS +QSAF+ G  + DN I+GY
Subjt:  FYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFIPGHCVVDNTILGY

Query:  ECIHTLKAKQVGRTG----------KAYDRVK-VFDKEI--------HFPRTCSYCVLKGSLALFFRAK-EGEARVVHGILQ------------------
        E +H ++ K   R G          KAYDRV+  F +E+         +      CV   + +     + +G+     GI Q                  
Subjt:  ECIHTLKAKQVGRTG----------KAYDRVK-VFDKEI--------HFPRTCSYCVLKGSLALFFRAK-EGEARVVHGILQ------------------

Query:  --YYERAFGQTINF---------------DKSIISFSS-------------------STIMGVQA-------------------------------QIQG
            E A  Q +                 D S+I F                     +  +GV+                                +++G
Subjt:  --YYERAFGQTINF---------------DKSIISFSS-------------------STIMGVQA-------------------------------QIQG

Query:  WKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPT
        WKG +FS+G +EVL+K+IVQAIP YTM+C+ L K  I  I R  ARFWW    + ++IHW  W+ +C+PK  GG GFRDLE+FNQA LAKQ WR ++ P 
Subjt:  WKGKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPT

Query:  SFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLEN
        S  S+VLK  YFP    L    G+  S++WRSL+WG+E++ KG RWR GN ++VRV     L   T    + +         QL +V     + V     
Subjt:  SFPSRVLKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLEN

Query:  SSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQ
             W+E  +R +F+  +   IL +   H   +D+V+WH+ +   Y+++SGYR++  A + ++ ++   + +  WW+  WK+ +P ++K F W+L    
Subjt:  SSFSSWNEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQ

Query:  LSTVDNLSVRGI
        L T  NL  R +
Subjt:  LSTVDNLSVRGI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.2e-1130.57Show/hide
Query:  QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKK-KSPKCITDYKPISLCNVWYKIMSKVLVNR
        +P    E++  +N +   K+P PDG +  FYQ     + P ++    +I    +      E  I+LIPK  +      +++PISL N+  KI++K+L NR
Subjt:  QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKK-KSPKCITDYKPISLCNVWYKIMSKVLVNR

Query:  MKGILNVLISHNQSAFIPGHCVVDNTILGYECI-HTLKAKQVGRTGKAYDRVKVFDK
        ++  +  LI H+Q  FIPG     N       I H  +AK       + D  K FDK
Subjt:  MKGILNVLISHNQSAFIPGHCVVDNTILGYECI-HTLKAKQVGRTGKAYDRVKVFDK

P08548 LINE-1 reverse transcriptase homolog3.3e-1231.87Show/hide
Query:  QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKK-KSPKCITDYKPISLCNVWYKIMSKVLVNR
        +P    E+   + ++   K+P PDG +  FYQ     + P ++N   NI    +      E  I LIPK  K P    +Y+PISL N+  KI++K+L NR
Subjt:  QPFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKK-KSPKCITDYKPISLCNVWYKIMSKVLVNR

Query:  MKGILNVLISHNQSAFIPGHCVVDNTILGYECI-HTLKAKQVGRTGKAYDRVKVFDKEIH
        ++  +  +I H+Q  FIPG     N       I H  K K       + D  K FD   H
Subjt:  MKGILNVLISHNQSAFIPGHCVVDNTILGYECI-HTLKAKQVGRTGKAYDRVKVFDKEIH

P11369 LINE-1 retrotransposable element ORF2 protein2.4e-1032.08Show/hide
Query:  PFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPK-KKSPKCITDYKPISLCNVWYKIMSKVLVNRM
        P   +E+   +N +   K+P PDG S  FYQ     + P +      I           E  I LIPK +K P  I +++PISL N+  KI++K+L NR+
Subjt:  PFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPK-KKSPKCITDYKPISLCNVWYKIMSKVLVNRM

Query:  KGILNVLISHNQSAFIPGHCVVDNTILGYECIHTL-KAKQVGRTGKAYDRVKVFDKEIH
        +  +  +I  +Q  FIPG     N       IH + K K       + D  K FDK  H
Subjt:  KGILNVLISHNQSAFIPGHCVVDNTILGYECIHTL-KAKQVGRTGKAYDRVKVFDKEIH

P14381 Transposon TX1 uncharacterized 149 kDa protein2.1e-1428.66Show/hide
Query:  PFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMK
        P   +E+ +AL  +  NK+P  DGL+  F+Q  W  +GPD                     ++ L+PKK   + I +++P+SL +  YKI++K +  R+K
Subjt:  PFHREEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMK

Query:  GILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTGKAYDRVKVFDKEIH
         +L  +I  +QS  +PG  + DN  L  + +H  +   +     + D+ K FD+  H
Subjt:  GILNVLISHNQSAFIPGHCVVDNTILGYECIHTLKAKQVGRTGKAYDRVKVFDKEIH

P93295 Uncharacterized mitochondrial protein AtMg003106.4e-3250.4Show/hide
Query:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK-SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLN
        A+P Y M+CF L K L   ++ AM  FWW+  +  R+I WV+W+ +CK K   GG GFRDL  FNQA LAKQ +RI+  P +  SR+L+ RYFP    + 
Subjt:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK-SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLN

Query:  VGVGSRPSYIWRSLIWGRELLGKGI
          VG+RPSY WRS+I GRELL +G+
Subjt:  VGVGSRPSYIWRSLIWGRELLGKGI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-0530.68Show/hide
Query:  EEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMS
        +E+  A+  +  NKAP PD  +  F+  SW VV    +                N T I LIPK      ++ ++P+S C V YKI++
Subjt:  EEVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMS

AT3G09510.1 Ribonuclease H-like superfamily protein7.0e-1023.56Show/hide
Query:  LKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSW
        +K RYF     L+  V  + SY W SL+ G  LL KG R   G+ + +R+        D  V       L+T    +   +++       F    S+  W
Subjt:  LKGRYFPGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSW

Query:  NEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDN
        ++  +       +  FI  I L      D++IW++     Y+++SGY L          + +            W + +  ++K FLWR   + L+T + 
Subjt:  NEELLRHHFSPREVHFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDN

Query:  LSVRGIDV
        L+ RG+ +
Subjt:  LSVRGIDV

AT4G29090.1 Ribonuclease H-like superfamily protein6.3e-3530.93Show/hide
Query:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNV
        A+P YTM CF LPK +   I   +A FWW   +E + +HW +W  +   K+ GG GF+D+E FN A L KQ WR++  P S  ++V K RYF   D LN 
Subjt:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLNV

Query:  GVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREV
         +GSRPS++W+S+   +E+L +G R   GN + + +     L  D++ A      +      +   VS   +  V+ L + S   W ++++   F   E 
Subjt:  GVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREV

Query:  HFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSE-SLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVR
          I  +        D   W +     Y++KSGY +    +  ++S    SE SL+  ++  WK     +I+ FLW+     L     L+ R
Subjt:  HFILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSE-SLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-3350.4Show/hide
Query:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK-SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLN
        A+P Y M+CF L K L   ++ AM  FWW+  +  R+I WV+W+ +CK K   GG GFRDL  FNQA LAKQ +RI+  P +  SR+L+ RYFP    + 
Subjt:  AIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPK-SLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYFPGGDFLN

Query:  VGVGSRPSYIWRSLIWGRELLGKGI
          VG+RPSY WRS+I GRELL +G+
Subjt:  VGVGSRPSYIWRSLIWGRELLGKGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTACTGTTCCTTGGGCTTTTGATCGGTCTCTTGTTGTGATTAAGGAACCGAGAGATGTTACGTCGGAGTTGGAAACAGAGTTTAATGATTGTCTTTTTTGGGTGCA
GATCCATAATGTCCCTATGAAGTGGCCGACGAAGGACATGGCGCAGCACCTGGGAGGCAAGATTGGGGTGGTTGAGGAAGTGGGCAGTTCAAATGAGAATGATTGGATAG
GCTCTATCTTGCGGATTAGGTTTGTCTTCGGTTGGACTGACCACTTCGAAGGGGGTTCCATATGCATGATGATGATGGGAAGGATTCTTGGTGTCCAGTTTTGTCATTCG
AGAAGGAAATGTGAGGGTGTTGGGGCGAGTTCTTCGGAAGTGGGGGAGGATCAATACGGTGAGTGGCTGAGAGTAGGTGTTTTTCTTGGGGAAGGTATCCGACGGGAACC
AGAGGGTGATGTGCATGAGAATGAGGCAAGTGTATCTGAAAGCACTGTTGGGGAGGGACTGTCAGTACCTTCTATTGATACTCAACCTGAGAGTTGCACTGTGGAGGGAG
GGAAGAAAGCTCGGTGGGGGCATAACGTTGTTGTGGAATCCGGAGGTTCGTTTCAGTCTGCTATTGATAATTGCCCAAAAGATTATGTTGCTGGGCGACTGGAGAGAGCA
AATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACATCACAGCTCACTGTGTATACTTGCACAGGAATTCTAGAGTCTTTTATACCATATTTTTACTCACTCTAGG
AACCCAACAGCTATCATATTCTCCCAATCACATAGATGGGTGGGTGGACAGAGGGGTGAGTCTGTGGCGGTTCTCTAGGATTTATGGCTTTCCCCAAGCGGGGTTGAAGG
TGATGACATGGTCGCTGCTGAAACACTTGCGTGGGAATCCAGGGACGTCGTGGTTGGTGGGTGGAGATTTCATCGCTATTTTGTCTCACCAGGAGAATGAGGGTGGTAGG
CCTAAGGCAGAGGCTGAACTGTCGAGTTTTCACGGTGCAATTGATGAGTGTGAGTTGCTTGATTTGGGTTTCAAGGGGGCTACTTTCACTTGGTGTAATGGGAGGTCGGG
GGAGAAAACAGTATGGGAGAGGATTGATAGATGTTTGACTAATCTGGCATTGCAGGAATTGTTCCCAATGCATGAGTTGACCCATATGGACTTTAGCCGGTCGGATCATA
GGCCCCTGTTGCTCTCGATGGAGGCAGTTTCAGCGAGATGGAGGACTGAGGCTCCCATTTTGTTCGCCCGTGGAGTTGGCATTAGTAGCTTCCAGCCTTTTCATCGTGAA
GAGGTTCTTCGAGCGTTGAATCATATCCATCCCAATAAAGCTCCTAGGCCAGATGGTCTGTCTGGAGTCTTCTACCAGCATTCTTGGTCGGTAGTTGGGCCTGATGTGGT
TAATTGTTGTTTGAACATTCTAAATAACAGAGTCTCTTCAGCACCTTTGAATGAAACGATGATTGTGTTGATCCCTAAGAAGAAGAGCCCCAAGTGTATCACAGATTACA
AGCCAATCTCTTTGTGCAATGTTTGGTACAAAATTATGTCTAAGGTTTTGGTGAATCGCATGAAGGGTATACTCAATGTGTTGATTTCCCATAATCAGAGTGCTTTTATT
CCAGGGCACTGCGTTGTCGATAACACCATCCTAGGGTATGAATGCATTCATACTCTGAAAGCGAAGCAGGTTGGGAGAACAGGCAAGGCTTATGACAGAGTAAAGGTCTT
CGATAAGGAGATCCATTTTCCTCGTACCTGTTCTTATTGTGTGCTGAAGGGCTCTCTCGCTCTTTTTTTTCGGGCTAAAGAGGGTGAAGCTCGGGTTGTGCATGGCATCC
TCCAGTACTATGAGCGAGCATTCGGGCAAACTATAAACTTTGATAAGTCTATCATCTCCTTTAGTTCGAGCACAATAATGGGTGTTCAAGCTCAGATTCAAGGTTGGAAG
GGCAAGGTTTTCTCTGTAGGGGGCAGAGAGGTTTTGCTAAAATCCATTGTGCAGGCTATCCCATGTTACACTATGAATTGTTTCTGCTTGCCTAAAAAACTGATACATGA
CATTAGTAGGGCCATGGCGCGATTTTGGTGGAATGGGGATAAGGAGGATAGAAGGATCCATTGGGTGAGTTGGAAGACGATGTGCAAGCCAAAAAGTTTGGGTGGAAGGG
GTTTCAGAGATTTGGAAATTTTCAACCAAGCATTTCTGGCTAAACAGTGTTGGAGGATTGTCCAGTATCCTACTTCGTTTCCTTCTCGTGTGTTGAAGGGGCGATATTTT
CCTGGTGGTGATTTCTTGAATGTAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGTCTAATTTGGGGGAGGGAGCTGTTGGGAAAGGGTATTCGCTGGAGGACTGG
GAATAAGAAGAAAGTTAGGGTACGGTCCCCGGTTACCTTGGAACCTGATACTCGAGTGGCTGATTGGTGTGAAGAAGACTTGTCAACAACGGCCCGCGGACAACTCGGAA
TGGTCTCTGATTTTTCCAGAAATGCGGTCGCATTTCTGGAGAATTCCAGCTTTTCCTCTTGGAATGAAGAGCTTCTTCGGCATCACTTTAGCCCTCGCGAGGTACATTTT
ATCCTTACTATTTCCTTGCGGCATATGTTGTCTAAGGATAGGGTCATATGGCACTTTGAGAAGTGTGAGGTCTATTCGATTAAGAGTGGGTATCGGCTTAGCCAAATGGC
TTTGCTTGCTCAAACCTCATCTTCTTCTTCGAGTGAGTCACTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGGTCCCAAACCAAATTAAGATTTTCTTGTGGAGGC
TTTGCTTGGAGCAATTATCCACTGTTGATAATTTGAGTGTCAGAGGCATTGATGTTTTGAATAAGTTTAGAGGGCGGGTTCCTTTAGTGGGGCTTGTGGATTGGGCTGTG
AATTATCTTGCTGTTTTTCGAGGGGCATGTCGGATCTACTCAGAGGGGATTAGAGGAGTGCCCTGGAGTGCAGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTT
AATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGAAGGCTAGCGTCGAGACGCTAGCTCTTGAGC
GTCTCGACGCGCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTACGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCG
TATAAGAGGCAGCGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTACTGTTCCTTGGGCTTTTGATCGGTCTCTTGTTGTGATTAAGGAACCGAGAGATGTTACGTCGGAGTTGGAAACAGAGTTTAATGATTGTCTTTTTTGGGTGCA
GATCCATAATGTCCCTATGAAGTGGCCGACGAAGGACATGGCGCAGCACCTGGGAGGCAAGATTGGGGTGGTTGAGGAAGTGGGCAGTTCAAATGAGAATGATTGGATAG
GCTCTATCTTGCGGATTAGGTTTGTCTTCGGTTGGACTGACCACTTCGAAGGGGGTTCCATATGCATGATGATGATGGGAAGGATTCTTGGTGTCCAGTTTTGTCATTCG
AGAAGGAAATGTGAGGGTGTTGGGGCGAGTTCTTCGGAAGTGGGGGAGGATCAATACGGTGAGTGGCTGAGAGTAGGTGTTTTTCTTGGGGAAGGTATCCGACGGGAACC
AGAGGGTGATGTGCATGAGAATGAGGCAAGTGTATCTGAAAGCACTGTTGGGGAGGGACTGTCAGTACCTTCTATTGATACTCAACCTGAGAGTTGCACTGTGGAGGGAG
GGAAGAAAGCTCGGTGGGGGCATAACGTTGTTGTGGAATCCGGAGGTTCGTTTCAGTCTGCTATTGATAATTGCCCAAAAGATTATGTTGCTGGGCGACTGGAGAGAGCA
AATTCTGTGCTGCAGCAAAACTGGGAGCAGAACTGCCACATCACAGCTCACTGTGTATACTTGCACAGGAATTCTAGAGTCTTTTATACCATATTTTTACTCACTCTAGG
AACCCAACAGCTATCATATTCTCCCAATCACATAGATGGGTGGGTGGACAGAGGGGTGAGTCTGTGGCGGTTCTCTAGGATTTATGGCTTTCCCCAAGCGGGGTTGAAGG
TGATGACATGGTCGCTGCTGAAACACTTGCGTGGGAATCCAGGGACGTCGTGGTTGGTGGGTGGAGATTTCATCGCTATTTTGTCTCACCAGGAGAATGAGGGTGGTAGG
CCTAAGGCAGAGGCTGAACTGTCGAGTTTTCACGGTGCAATTGATGAGTGTGAGTTGCTTGATTTGGGTTTCAAGGGGGCTACTTTCACTTGGTGTAATGGGAGGTCGGG
GGAGAAAACAGTATGGGAGAGGATTGATAGATGTTTGACTAATCTGGCATTGCAGGAATTGTTCCCAATGCATGAGTTGACCCATATGGACTTTAGCCGGTCGGATCATA
GGCCCCTGTTGCTCTCGATGGAGGCAGTTTCAGCGAGATGGAGGACTGAGGCTCCCATTTTGTTCGCCCGTGGAGTTGGCATTAGTAGCTTCCAGCCTTTTCATCGTGAA
GAGGTTCTTCGAGCGTTGAATCATATCCATCCCAATAAAGCTCCTAGGCCAGATGGTCTGTCTGGAGTCTTCTACCAGCATTCTTGGTCGGTAGTTGGGCCTGATGTGGT
TAATTGTTGTTTGAACATTCTAAATAACAGAGTCTCTTCAGCACCTTTGAATGAAACGATGATTGTGTTGATCCCTAAGAAGAAGAGCCCCAAGTGTATCACAGATTACA
AGCCAATCTCTTTGTGCAATGTTTGGTACAAAATTATGTCTAAGGTTTTGGTGAATCGCATGAAGGGTATACTCAATGTGTTGATTTCCCATAATCAGAGTGCTTTTATT
CCAGGGCACTGCGTTGTCGATAACACCATCCTAGGGTATGAATGCATTCATACTCTGAAAGCGAAGCAGGTTGGGAGAACAGGCAAGGCTTATGACAGAGTAAAGGTCTT
CGATAAGGAGATCCATTTTCCTCGTACCTGTTCTTATTGTGTGCTGAAGGGCTCTCTCGCTCTTTTTTTTCGGGCTAAAGAGGGTGAAGCTCGGGTTGTGCATGGCATCC
TCCAGTACTATGAGCGAGCATTCGGGCAAACTATAAACTTTGATAAGTCTATCATCTCCTTTAGTTCGAGCACAATAATGGGTGTTCAAGCTCAGATTCAAGGTTGGAAG
GGCAAGGTTTTCTCTGTAGGGGGCAGAGAGGTTTTGCTAAAATCCATTGTGCAGGCTATCCCATGTTACACTATGAATTGTTTCTGCTTGCCTAAAAAACTGATACATGA
CATTAGTAGGGCCATGGCGCGATTTTGGTGGAATGGGGATAAGGAGGATAGAAGGATCCATTGGGTGAGTTGGAAGACGATGTGCAAGCCAAAAAGTTTGGGTGGAAGGG
GTTTCAGAGATTTGGAAATTTTCAACCAAGCATTTCTGGCTAAACAGTGTTGGAGGATTGTCCAGTATCCTACTTCGTTTCCTTCTCGTGTGTTGAAGGGGCGATATTTT
CCTGGTGGTGATTTCTTGAATGTAGGGGTGGGCTCCCGTCCCTCGTATATTTGGAGGAGTCTAATTTGGGGGAGGGAGCTGTTGGGAAAGGGTATTCGCTGGAGGACTGG
GAATAAGAAGAAAGTTAGGGTACGGTCCCCGGTTACCTTGGAACCTGATACTCGAGTGGCTGATTGGTGTGAAGAAGACTTGTCAACAACGGCCCGCGGACAACTCGGAA
TGGTCTCTGATTTTTCCAGAAATGCGGTCGCATTTCTGGAGAATTCCAGCTTTTCCTCTTGGAATGAAGAGCTTCTTCGGCATCACTTTAGCCCTCGCGAGGTACATTTT
ATCCTTACTATTTCCTTGCGGCATATGTTGTCTAAGGATAGGGTCATATGGCACTTTGAGAAGTGTGAGGTCTATTCGATTAAGAGTGGGTATCGGCTTAGCCAAATGGC
TTTGCTTGCTCAAACCTCATCTTCTTCTTCGAGTGAGTCACTGTCCAGTTGGTGGAAGGGGTGTTGGAAAATGGGGGTCCCAAACCAAATTAAGATTTTCTTGTGGAGGC
TTTGCTTGGAGCAATTATCCACTGTTGATAATTTGAGTGTCAGAGGCATTGATGTTTTGAATAAGTTTAGAGGGCGGGTTCCTTTAGTGGGGCTTGTGGATTGGGCTGTG
AATTATCTTGCTGTTTTTCGAGGGGCATGTCGGATCTACTCAGAGGGGATTAGAGGAGTGCCCTGGAGTGCAGAAATGGTTGATTGGAGCTATAATGCCATATTAGAGTT
AATCGGGTGCTCGGGGCGTGAAAAGATGCAAAGGAATGAAAAGAGTAAAAGTGGAGAAAAGTCAAATCTCGGTCAACAGAAGGCTAGCGTCGAGACGCTAGCTCTTGAGC
GTCTCGACGCGCACATTCCATATCAGATTAGGCGCGTAAAGCTTACAGCGTCGAGACGCTACGATAGGAAGCGTCCCGACGCTACCGTTTTTCCTTATTCAGAACGCGCG
TATAAGAGGCAGCGTCGCGACGCTGTCTTGACAGCGTCTCGACGCTAA
Protein sequenceShow/hide protein sequence
MGTVPWAFDRSLVVIKEPRDVTSELETEFNDCLFWVQIHNVPMKWPTKDMAQHLGGKIGVVEEVGSSNENDWIGSILRIRFVFGWTDHFEGGSICMMMMGRILGVQFCHS
RRKCEGVGASSSEVGEDQYGEWLRVGVFLGEGIRREPEGDVHENEASVSESTVGEGLSVPSIDTQPESCTVEGGKKARWGHNVVVESGGSFQSAIDNCPKDYVAGRLERA
NSVLQQNWEQNCHITAHCVYLHRNSRVFYTIFLLTLGTQQLSYSPNHIDGWVDRGVSLWRFSRIYGFPQAGLKVMTWSLLKHLRGNPGTSWLVGGDFIAILSHQENEGGR
PKAEAELSSFHGAIDECELLDLGFKGATFTWCNGRSGEKTVWERIDRCLTNLALQELFPMHELTHMDFSRSDHRPLLLSMEAVSARWRTEAPILFARGVGISSFQPFHRE
EVLRALNHIHPNKAPRPDGLSGVFYQHSWSVVGPDVVNCCLNILNNRVSSAPLNETMIVLIPKKKSPKCITDYKPISLCNVWYKIMSKVLVNRMKGILNVLISHNQSAFI
PGHCVVDNTILGYECIHTLKAKQVGRTGKAYDRVKVFDKEIHFPRTCSYCVLKGSLALFFRAKEGEARVVHGILQYYERAFGQTINFDKSIISFSSSTIMGVQAQIQGWK
GKVFSVGGREVLLKSIVQAIPCYTMNCFCLPKKLIHDISRAMARFWWNGDKEDRRIHWVSWKTMCKPKSLGGRGFRDLEIFNQAFLAKQCWRIVQYPTSFPSRVLKGRYF
PGGDFLNVGVGSRPSYIWRSLIWGRELLGKGIRWRTGNKKKVRVRSPVTLEPDTRVADWCEEDLSTTARGQLGMVSDFSRNAVAFLENSSFSSWNEELLRHHFSPREVHF
ILTISLRHMLSKDRVIWHFEKCEVYSIKSGYRLSQMALLAQTSSSSSSESLSSWWKGCWKMGVPNQIKIFLWRLCLEQLSTVDNLSVRGIDVLNKFRGRVPLVGLVDWAV
NYLAVFRGACRIYSEGIRGVPWSAEMVDWSYNAILELIGCSGREKMQRNEKSKSGEKSNLGQQKASVETLALERLDAHIPYQIRRVKLTASRRYDRKRPDATVFPYSERA
YKRQRRDAVLTASRR