; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022434 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022434
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr7:28614328..28623516
RNA-Seq ExpressionLag0022434
SyntenyLag0022434
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEZ31191.1 DNA-directed DNA polymerase [Tanacetum cinerariifolium]2.5e-8031.06Show/hide
Query:  VTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHF
        V  VP  T+KL  FPFSL+G+A  WLD  PP+SI +W DL  KFI +FFP +     R EII+F Q  NE+   +WERF+ L+R+CPHHG      L+ F
Subjt:  VTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHF

Query:  YSGLDQASKALVNA---STNGSFLMKSANEAHAILDTIATNNQHWGETELTILK------NPIKAVETKANSTMQTHIKAIHSKMFGLTMGNQANIAPAN
        Y+     SK   NA   S++ S        A ++ D +     H+ E  L  +K       PIKA +  A   M+          F   + +     P  
Subjt:  YSGLDQASKALVNA---STNGSFLMKSANEAHAILDTIATNNQHWGETELTILK------NPIKAVETKANSTMQTHIKAIHSKMFGLTMGNQANIAPAN

Query:  AISSPCCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIE
               D   E     L E   V+V  K+P  L D G F IPC     +   AL DLGAS N M  S++K+L +       + L+L +R+++ P G  E
Subjt:  AISSPCCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIE

Query:  NVLVKV----------------------------------------------------------------------------------------------
        NV VKV                                                                                              
Subjt:  NVLVKV----------------------------------------------------------------------------------------------

Query:  ----------------------------------------LSSHKKAICWTTAYIQGISPSFCMHRINLEED------SGWVSPVHCVPKKGGMTVVENK
                                                L S K+AI W    I+GI P FC H+I LEED        WVSPVH V KKGGMTVV N 
Subjt:  ----------------------------------------LSSHKKAICWTTAYIQGISPSFCMHRINLEED------SGWVSPVHCVPKKGGMTVVENK

Query:  KNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-
        +N+L+PTR +TGWR                        ML+RL    YYCF DG+S Y QI I P+DQEKTTFTCPYGTFA++RMPF LCNA  TFQR  
Subjt:  KNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY-

Query:  -----------------------CKQAFETLKSALSSSLVMIEPDWTQPFELM--------NLILKSR--------------------------------
                               C QAF TLK  L+ +L++I P+W QPF+LM          +L  R                                
Subjt:  -----------------------CKQAFETLKSALSSSLVMIEPDWTQPFELM--------NLILKSR--------------------------------

Query:  ---------------RKGTKNQMEDHLSRL-GVETQVDKRLDIQESFADETI--LAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD
                        KG KN   D LSRL      V    +I E F  ETI  LA      PWFAD+ NY          +TQQ  KF KD
Subjt:  ---------------RKGTKNQMEDHLSRL-GVETQVDKRLDIQESFADETI--LAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD

XP_023521407.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111785222 [Cucurbita pepo subsp. pepo]4.7e-7932.5Show/hide
Query:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKANSTMQTH
        +E+L  AWERF+ ++RKCPHHGLP  I +E FY+GL+ A+K +V+AS NG+ L K+ NEA+ IL+ IA+NN  W +      +     +E  A S++   
Subjt:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKANSTMQTH

Query:  IKAIHSKMFGLTMGNQANI-------APANAISSPCCDIC-------------------------------REYETVALTECSNVLVKSKVPRNLKDQGS
        + ++ + +  L +G  + I       A  N  ++  C  C                                E++ V L E  + ++K+K+P   KD GS
Subjt:  IKAIHSKMFGLTMGNQANI-------APANAISSPCCDIC-------------------------------REYETVALTECSNVLVKSKVPRNLKDQGS

Query:  FTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVKV---------------------------------
        FTIP SIGG+ +GRALCDLG+S N M  S++K+LGIGEARP TVTLQL +RS T+P GKIE++L++V                                 
Subjt:  FTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVKV---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------LSSHKKAICWTTAYIQGISPSFCMHRINLEE--------------------------
                                                   L  HK AI WT A I+GISPS CMH+I LEE                          
Subjt:  -------------------------------------------LSSHKKAICWTTAYIQGISPSFCMHRINLEE--------------------------

Query:  -----------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPE
                   +S  VSP+ CVPKKGG+TV+ N+ NELIPTR + GWR                        MLDRLA K++YCF DGYS YNQITI+PE
Subjt:  -----------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPE

Query:  DQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFET
        DQEKTTFTCPYG FAFRRMPFGLCNAPATFQR C  A  T
Subjt:  DQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFET

XP_027090491.1 uncharacterized protein LOC113711532 [Coffea arabica]8.9e-7827.7Show/hide
Query:  FSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAW
        F GL  E+ +K  ++F  +C+S K  G+ E+ +K++ FPFSL+  A+ WL   PP SIT+W+ L +KF++K+FP++  A  R EI   +Q  +ESL   W
Subjt:  FSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAW

Query:  ERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETE-------------------------LTILK
        ERF++L  KCP H +   +++++FY  L    +++++A+  G+ + K+   A  +++ +A N+Q +G  E                         +  L+
Subjt:  ERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETE-------------------------LTILK

Query:  N-------PIKAVETKANSTMQTHIKAIHSKMFGLTM-------------------------------GNQANIAPAN----------------------
        N        I  +E++    + +  +A    +  +T+                               GN+    P+N                      
Subjt:  N-------PIKAVETKANSTMQTHIKAIHSKMFGLTM-------------------------------GNQANIAPAN----------------------

Query:  -------------AISSPCCDICREY--------------------ETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSY
                      I+ P  D  ++                     E V + E  + +++ K+P    D G FTIPC IG  ++G  + DLG S N M  
Subjt:  -------------AISSPCCDICREY--------------------ETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSY

Query:  SVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV-----------------------------------------------------------------
        S++  L +G  +   + +QL +R+  +P G IE+V                                                                 
Subjt:  SVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV-----------------------------------------------------------------

Query:  ---------------------------------------------LVKVLSSHKKAICWTTAYIQGISPSFCMHRINLEE--------------------
                                                     L +VL  HK+AI WT A I+GISP+ CMHRI LEE                    
Subjt:  ---------------------------------------------LVKVLSSHKKAICWTTAYIQGISPSFCMHRINLEE--------------------

Query:  -----------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQ
                         DS WVSPV  VPKK G+TV  N++ EL+P R  TGWR                        M++RLA + YYCF DG+S Y Q
Subjt:  -----------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQ

Query:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETLKSALSSSLVMIEPDWTQPFELM
        I IAP+DQEKTTFTCP+GTFA+RRMPFGLCNAPATFQR C  AF  LK  L++S ++  PDW  PFE+M
Subjt:  ITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETLKSALSSSLVMIEPDWTQPFELM

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]4.4e-8531.13Show/hide
Query:  VTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHF
        + GV +D ++L+ FPFSL+  A++W  S P +SI +W +LA KF+ KFFP    AK R +I +F Q  +ESL  AWERF+ L+RKCP+HG+  ++ + +F
Subjt:  VTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHF

Query:  YSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGE-----------------TELTILKNPIKA------------------------VET
        Y+GL   ++ L++A+  G+F+ KSANEA  +L+ +A  NQ W                   T+LT  +N  ++                         ET
Subjt:  YSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGE-----------------TELTILKNPIKA------------------------VET

Query:  KAN-STMQTHIKAIHSKMFGLTMGNQANIAPAN------AI-----------------------------------------SSPCCDI-----------
        +++   +QT +  + +++     GN  +    N      AI                                         +SP   I           
Subjt:  KAN-STMQTHIKAIHSKMFGLTMGNQANIAPAN------AI-----------------------------------------SSPCCDI-----------

Query:  -------------------------------------------------CREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGA
                                                           ++ETVALTE  + +++ K+P  LKD GSFTIPC+IG      ALCDLGA
Subjt:  -------------------------------------------------CREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGA

Query:  STNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK-------------------------------------------------------
        S N M  SVFK+L +GEA+P TVTLQL +RSL HP G IE+VLVK                                                       
Subjt:  STNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK-------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------VLSSHKKAICWT
                                                                                                VL  HKKAI WT
Subjt:  ----------------------------------------------------------------------------------------VLSSHKKAICWT

Query:  TAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------
         A I+GISPS  MHRI +EE                                     DS WVSPV  VPKKGGMTVV+N+KNELIPTRT+TGWR      
Subjt:  TAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------

Query:  ------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETL
                          MLD+LA + YYCF DGYS Y+QI IAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR     F  L
Subjt:  ------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETL

XP_034899370.1 LOW QUALITY PROTEIN: uncharacterized protein LOC118037487 [Populus alba]1.4e-7827.29Show/hide
Query:  EFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTA
        +F GL ++D +     FL IC +FK  GV +D ++L+ FPFSL+  A+ WL+S P +S+ SW DLA+KF+ KFFP    AK R+EI +F Q  +E L   
Subjt:  EFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTA

Query:  WERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGE-------------------------------
        WER++ L+R+CPHHGLP ++ +++FY+GL+ +++ L++A++ G+F+ KS ++A+ +L+ +A NN  W                                 
Subjt:  WERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGE-------------------------------

Query:  --TEL-----------------------------------------------------------------TILKNP------------------------
          T+L                                                                 T++KNP                        
Subjt:  --TEL-----------------------------------------------------------------TILKNP------------------------

Query:  -IKAVETKANSTMQTHIKAIHS----------------------------------------KMFGLTMGNQA--------------NIAPANAISSPCC
              T+ N+ +Q    +I +                                        K    T GN++              N+  ++ +  P  
Subjt:  -IKAVETKANSTMQTHIKAIHS----------------------------------------KMFGLTMGNQA--------------NIAPANAISSPCC

Query:  DICR-------------------------------------------EYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNR
        +I +                                           EYETVALTE  + +++ K+P  LKD GSFTIPCSIG     +ALCDLGAS N 
Subjt:  DICR-------------------------------------------EYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNR

Query:  MSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK-----------------------------------------------------------
        M  S+FK+LG+GEARP TVTLQL +RSL HP G IE+VLVK                                                           
Subjt:  MSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK-----------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------VLSSHKKAICWTTAYIQGISPSFCMHRINLEEDSGWVSPV---------HCVPKKGGMTVVENKKNELIPTRTITGW
                               VL  HK A+ W  A I+GISPS CMH+I LE++   V P               KGGMTVV++  N LIPTR +TGW
Subjt:  -----------------------VLSSHKKAICWTTAYIQGISPSFCMHRINLEEDSGWVSPV---------HCVPKKGGMTVVENKKNELIPTRTITGW

Query:  R------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETL
        R                        MLDRLA   YYCF DGYS YNQI IAPEDQEKTTFTCPYGTF FRRMPFGLCNAPATFQR     F  +
Subjt:  R------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETL

TrEMBL top hitse value%identityAlignment
A0A2G9HH15 Reverse transcriptase2.1e-7727.72Show/hide
Query:  EFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTA
        +F GL +E+ ++    FL+IC + +  GV +D L+L+ F FSL GDA  W +S P +SIT+W  L E+FI KFF     A  R EI++FRQ  +E++  A
Subjt:  EFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTA

Query:  WERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNN------------------------------------
        W RF++++R CP+H +P  I +  FY GL    K  ++     SFL  +  E H +L+ +  N+                                    
Subjt:  WERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNN------------------------------------

Query:  --------QHWGET------------------ELTILKNPIK--------------------------------------------AVETKANSTMQTHI
                QH   T                   +  + N  K                                             ++ K  S  +T I
Subjt:  --------QHWGET------------------ELTILKNPIK--------------------------------------------AVETKANSTMQTHI

Query:  K-----AIHSKMFGLTMGNQANI--------APANAISSPCCDI---CR---------------------------------------------------
        +     A + KM    +G  AN          P+N   +P  D    C+                                                   
Subjt:  K-----AIHSKMFGLTMGNQANI--------APANAISSPCCDI---CR---------------------------------------------------

Query:  -------------EYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGK
                     +YETVALTE  + ++++K+P  LKD GSFTIPC+IG    GRALCDLGAS N M YS+++ LG+GEA+P ++TLQL +RSLT+P G 
Subjt:  -------------EYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGK

Query:  IENVLVK---------------------------------------------------------------------------------------------
        IE++LVK                                                                                             
Subjt:  IENVLVK---------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------VLSSHKK
                                                                                                     VL +HK 
Subjt:  ---------------------------------------------------------------------------------------------VLSSHKK

Query:  AICWTTAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR-
        AI WT A I+GISPSFCMH+I LE+                                     DS WVSPV CVPKKGG+TVV N  NELIPTRT+TGWR 
Subjt:  AICWTTAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR-

Query:  -----------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFET
                               MLDRLA K +YCF DGYS YNQI I PEDQEKTTFTCPYGTF FR+MPFGLCNAPATFQR C  A  T
Subjt:  -----------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFET

A0A6A3BRM8 Reverse transcriptase2.1e-7725.04Show/hide
Query:  LDVLDGEEFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSY
        LD L+   F G+  ED  +  + FL +C SF+  GV ED LKLK FP+SL+  A AWL   P  S+ SW DL + F+ ++ P N N + R EI SFRQ  
Subjt:  LDVLDGEEFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSY

Query:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKANSTMQTH
        +ES+   W+R++ L+RKC +HG   +  +  FY+G++  ++ L++AS NG+ L KS  EA AILD IA N+  +  + L   +    A E +A  ++ T 
Subjt:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKANSTMQTH

Query:  IKAIHSKMFGLTMGNQANIAP-ANAISS-------------------------------------PC---------------------------------
        + AI + +  L      N  P +N  ++                                     PC                                 
Subjt:  IKAIHSKMFGLTMGNQANIAP-ANAISS-------------------------------------PC---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------CDIC---REYETVA-LTE-CSNVLVKSKVPRNLKDQGSFTIPCSIG
                                                               DIC   R+ ETVA  TE CS++   SK+P    D GSF IPCSIG
Subjt:  ------------------------------------------------------CDIC---REYETVA-LTE-CSNVLVKSKVPRNLKDQGSFTIPCSIG

Query:  GRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK------------------------------------------
            G+ALCDLG+S N M  S+F +LGIG+ARP +V LQL ++S   P G++E+V+V+                                          
Subjt:  GRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVK------------------------------------------

Query:  ------------------------------------------------------------------------------------------VLSSHKKAIC
                                                                                                  VL  HKKAI 
Subjt:  ------------------------------------------------------------------------------------------VLSSHKKAIC

Query:  WTTAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR----
        WT    +GISP+ CMH+I LE+                                     +S WVSPV C+PKK G TVV N+ NEL+PTRT+TGWR    
Subjt:  WTTAYIQGISPSFCMHRINLEE-------------------------------------DSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR----

Query:  --------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQ----------------
                            MLDRLA K +YCF DGYS YNQI IAPEDQE TTFTCPYGTFAFRRMPFGLCNAPATFQR C Q                
Subjt:  --------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQ----------------

Query:  ----------------------------------------------------------------------------AFETLKSALSSSLVMIEPDWTQPF
                                                                                    AF  LK  L S+ +++ PDWT  F
Subjt:  ----------------------------------------------------------------------------AFETLKSALSSSLVMIEPDWTQPF

Query:  ELM--------NLILKSR----------------------------------------------------------------------------------
        ELM         ++L  R                                                                                  
Subjt:  ELM--------NLILKSR----------------------------------------------------------------------------------

Query:  ----RKGTKNQMEDHLSRLGVETQVDKRLDIQESFADETILAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD
            RKGT+NQ+ DHLSRL   ++    +DIQE F DE IL      IPW+AD VN+LVSG+ P +  +Q   KF  D
Subjt:  ----RKGTKNQMEDHLSRLGVETQVDKRLDIQESFADETILAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD

A0A6L2JHA4 Reverse transcriptase domain-containing protein8.4e-7427.94Show/hide
Query:  VPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSG
        VP D +KLK FP+SL+G+A  W D  PPNSI +W DL  KF+ +FFP +     + EI  F Q + E+   AWERF+ ++R CPHHG    + ++ F +G
Subjt:  VPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSG

Query:  LDQASKALVNASTNGSFLMKSANEAHAIL-------------------------------------DTIAT-----------------------------
        L+   +  +NA+  G+ L K+  EA  I+                                     D I+T                             
Subjt:  LDQASKALVNASTNGSFLMKSANEAHAIL-------------------------------------DTIAT-----------------------------

Query:  ---------------------------NNQHWG-----------------ETELTILKNPIKAVETKANSTMQTHIK-----------AIHSKMFGLTMG
                                   NN +WG                 +  +  L+N   +++ +  +  Q  +K           A  + +  +  G
Subjt:  ---------------------------NNQHWG-----------------ETELTILKNPIKAVETKANSTMQTHIK-----------AIHSKMFGLTMG

Query:  ---NQANIA---PANAISSP-----------------------------CCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALC
           NQ++     P+N I +P                               D   E   + L E  +V++  K+P  L D G F IPC   G +V  AL 
Subjt:  ---NQANIA---PANAISSP-----------------------------CCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALC

Query:  DLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV------------------------------------------------------
        DL AS NRM  S++K+L + E  P  +TL+L +RS+THP G  E+V                                                      
Subjt:  DLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV------------------------------------------------------

Query:  ---------------------------LVKVLSSHKKAICWTTAYIQGISPSFCMHRINLEEDSGWV---SPVHCVPKKGGMTVVENKKNELIPTRTITG
                                   L+KVL SHK+AI W    I+GI P FC H+I +EED   V   SP+HCVPKKGG+TVVEN+ NELIPTR +T 
Subjt:  ---------------------------LVKVLSSHKKAICWTTAYIQGISPSFCMHRINLEEDSGWV---SPVHCVPKKGGMTVVENKKNELIPTRTITG

Query:  WR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETLKSALS
        WR                        ML+RLA   +YCF DG+S Y QI I P  QEKTTFTCP GTFA+RRMPFG+ NAP TFQR              
Subjt:  WR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETLKSALS

Query:  SSLVMIEPDWTQPFELMNLILKSRRKGTKNQMEDHLSRLGVETQVDKRLDIQESFADETILAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD
                                    +N  +D          V +  DI E+F  ET+  +      WFA++ N+           +QQ KKF KD
Subjt:  SSLVMIEPDWTQPFELMNLILKSRRKGTKNQMEDHLSRLGVETQVDKRLDIQESFADETILAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKD

A0A6L2JJI0 Reverse transcriptase domain-containing protein (Fragment)2.9e-7430.57Show/hide
Query:  LDVLDGEEFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSY
        ++++  ++F G   ED H   + F +I S+ +V  VP  ++KL  FPFSL+G A  WL+  PP SI +W+DL  KFI +FFP +     R +I  F+Q +
Subjt:  LDVLDGEEFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSY

Query:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGS-------------------FLMKSANEAHA-------------------
        +ES    WERF  L+R CPHHG      L+ FY+ L+   +  +N++T  S                    L+   N++ A                   
Subjt:  NESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGS-------------------FLMKSANEAHA-------------------

Query:  ----ILDTIATN-----------NQHWGETELTILKN------------PIKAV-------------------ETKANSTMQTHIKAIHSKMFGLTMGNQ
            +++T   N           +Q+    + +++++            P+ A+                     +AN  ++ + +      F ++  + 
Subjt:  ----ILDTIATN-----------NQHWGETELTILKN------------PIKAV-------------------ETKANSTMQTHIKAIHSKMFGLTMGNQ

Query:  ANIAPANAISSPCC----DICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVE
          + P  A +        +   E     + E  + ++ +K+PR L D   F IPC   G +   AL DLGAS N M +S+++ L + E  P  +TL+L +
Subjt:  ANIAPANAISSPCC----DICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVE

Query:  RSLTHPIGKIENVLVKV-----------------------------------LSSHK---------KAICWTTAYIQGISPSFCMHRINLEED-------
        RS++ PIG  ++V  KV                                   +  HK         +AI +      GI+P FC H+I +EED       
Subjt:  RSLTHPIGKIENVLVKV-----------------------------------LSSHK---------KAICWTTAYIQGISPSFCMHRINLEED-------

Query:  --------SGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEK
                   ++PVHCVPKKGG T+VEN++NELI TR +TGWR                        ML+RLA   YYCF DG+  Y QI I P DQEK
Subjt:  --------SGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEK

Query:  TTFTCPYGTFAFRRMPFGLCNAPATFQR
        TTFTCPYGTFA+RRMPFGLCNAP TFQR
Subjt:  TTFTCPYGTFAFRRMPFGLCNAPATFQR

A0A6L2P089 Reverse transcriptase domain-containing protein2.6e-7533.87Show/hide
Query:  ICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAF
        I S+ K   VP D +KL  FP+SL+G A  W D  PPNSI +W DL  KF+ + FP +     + EI  F Q + E+   AWERF+ ++R CPHHG    
Subjt:  ICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSITSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAF

Query:  IILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKAN-STMQTHIKAIHSKMFGLTMGNQANIAPANAI
          +  FY GL+   +  +N +  G+ L K+  EA  I++                 K+ ++    K N S M T  +   SKM              + +
Subjt:  IILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGETELTILKNPIKAVETKAN-STMQTHIKAIHSKMFGLTMGNQANIAPANAI

Query:  SSPCCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV
             D     + + L +     V    P    ++   T  C+    N     CD   +TNR   SV    G+ +   + V   L +          +  
Subjt:  SSPCCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTNRMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENV

Query:  LVKVLSSHKKAICWTTAYIQGISPSFCMH----------------RINLE---------------------EDSGWVSPVHCVPKKGGMTVVENKKNELI
        L+KVL SHK+AI W    I+GI P FC H                R+NL+                      DS WVSP+HCVPKKGG+TVVEN+ NELI
Subjt:  LVKVLSSHKKAICWTTAYIQGISPSFCMH----------------RINLE---------------------EDSGWVSPVHCVPKKGGMTVVENKKNELI

Query:  PTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY------
        PTR +TGWR                        ML+RL    + CF DG+S Y QI I P DQEKTTFTCPYGTFA+R+MPFG CN+P TFQR       
Subjt:  PTRTITGWR------------------------MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRY------

Query:  ----------------------------CKQAFETLKSALSSSLVMIEPDWTQPFELM
                                    C  AFETLK  L+ +L+++ PDW  PFELM
Subjt:  ----------------------------CKQAFETLKSALSSSLVMIEPDWTQPFELM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.9e-0735.71Show/hide
Query:  DSGWVSPVHCVPKKGGMT-------VVENKK-NEL-------IPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMP
        +S + SP+  VPKK   +       V++ +K NE+       IP        +L +L   NY+   D    ++QI + PE   KT F+  +G + + RMP
Subjt:  DSGWVSPVHCVPKKGGMT-------VVENKK-NEL-------IPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMP

Query:  FGLCNAPATFQR
        FGL NAPATFQR
Subjt:  FGLCNAPATFQR

P20825 Retrovirus-related Pol polyprotein from transposon 2975.5e-0637.27Show/hide
Query:  EEDSGWVSPVHCVPKKGGMT-------VVENKK-NEL-IPTR-TITGW-RMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG
        E +S + SP   VPKK   +       V++ +K NE+ IP R  I     +L +L    Y+   D    ++QI +  E   KT F+   G + + RMPFG
Subjt:  EEDSGWVSPVHCVPKKGGMT-------VVENKK-NEL-IPTR-TITGW-RMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFG

Query:  LCNAPATFQR
        L NAPATFQR
Subjt:  LCNAPATFQR

P31843 RNA-directed DNA polymerase homolog5.5e-0650Show/hide
Query:  MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATF
        + DRLA   ++   D  S Y Q+ IA  D+ KTT    YG+F FR MPFGL NA ATF
Subjt:  MLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.1e-0935.45Show/hide
Query:  SPVHCVPKKGG--------MTVVENKKNELIPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR
        SPV  VPKK G         T+ +   ++  P   I    +L R+     +   D +S Y+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R
Subjt:  SPVHCVPKKGG--------MTVVENKKNELIPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR

Query:  YCKQAFETLK
        Y    F  L+
Subjt:  YCKQAFETLK

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.1e-0935.45Show/hide
Query:  SPVHCVPKKGG--------MTVVENKKNELIPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR
        SPV  VPKK G         T+ +   ++  P   I    +L R+     +   D +S Y+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R
Subjt:  SPVHCVPKKGG--------MTVVENKKNELIPTRTITGWRMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQR

Query:  YCKQAFETLK
        Y    F  L+
Subjt:  YCKQAFETLK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTTAATTGAGCTTAGGTCGTGTCGGGAGTCGATCACGCTAATTGGTGTTAATTCAAGCCA
ATTACAGAATTTTTGGATCCATCTTGATGTCTTGGATGGAGAGGAGTTTTCAGGACTTTTGAATGAAGATCTGCATAAGGATACCAAGCAATTTTTAAGAATTTGTAGTT
CGTTTAAAGTAACAGGAGTACCAGAAGACACATTAAAACTAAAATGTTTTCCTTTTTCATTACAAGGAGATGCAGAAGCTTGGTTAGATTCATTTCCACCAAACTCCATC
ACATCTTGGAATGACTTGGCAGAAAAGTTTATAGAGAAGTTTTTTCCGTCCAACAATAATGCTAAGTATAGAGTAGAGATCATTTCCTTCAGACAATCATACAATGAGTC
TTTAGACACAGCATGGGAACGATTCCAAAGGCTGGTTCGAAAGTGTCCTCATCATGGGTTGCCTGCTTTCATCATTCTTGAGCATTTCTATAGTGGGCTTGATCAAGCTT
CAAAAGCCTTAGTCAATGCGTCTACAAATGGATCTTTCCTGATGAAATCCGCAAACGAAGCACATGCCATACTGGATACAATAGCTACGAATAACCAACATTGGGGGGAG
ACTGAACTTACAATTTTGAAGAATCCAATAAAAGCTGTAGAGACGAAAGCAAATTCAACCATGCAAACTCATATTAAAGCTATTCATAGCAAGATGTTTGGCTTGACTAT
GGGAAATCAAGCAAACATCGCTCCCGCAAATGCTATCTCTTCTCCTTGTTGTGATATTTGTAGGGAATACGAGACAGTTGCATTAACAGAATGCTCCAATGTGTTGGTTA
AAAGCAAGGTTCCTCGTAATTTAAAAGATCAAGGAAGTTTTACTATCCCTTGCTCCATAGGAGGACGGAACGTTGGCAGAGCACTATGCGATCTTGGAGCCAGTACCAAT
CGGATGTCGTATTCGGTCTTTAAGCAGTTAGGAATAGGAGAAGCCAGGCCGATGACTGTCACCTTGCAACTAGTAGAGAGATCATTAACACATCCCATTGGCAAAATAGA
GAATGTACTAGTCAAGGTCCTATCTAGCCACAAAAAGGCGATATGTTGGACAACTGCATATATTCAGGGAATCAGTCCATCGTTTTGCATGCATAGGATCAATCTAGAAG
AAGACAGTGGCTGGGTCAGCCCAGTGCATTGCGTCCCTAAGAAAGGTGGAATGACTGTAGTTGAAAATAAGAAGAATGAATTGATTCCAACGAGAACGATCACGGGATGG
CGAATGTTGGATAGATTGGCATGGAAAAATTACTACTGCTTCCATGATGGCTACTCAACATACAACCAAATTACTATAGCCCCAGAAGATCAAGAAAAGACCACATTCAC
ATGTCCATACGGGACATTCGCTTTTCGAAGGATGCCATTTGGACTCTGTAACGCTCCAGCCACCTTTCAAAGGTATTGCAAGCAAGCTTTTGAGACTCTAAAGAGTGCAC
TTAGTTCATCTCTAGTCATGATTGAACCAGACTGGACTCAGCCTTTCGAACTAATGAATTTGATATTGAAATCCAGGAGGAAAGGGACCAAAAATCAAATGGAAGATCAT
CTGTCAAGGCTTGGGGTAGAGACACAGGTGGATAAGAGGTTGGACATTCAAGAGTCGTTTGCAGATGAAACGATATTGGCAGTAAAGGTAATTGAGATTCCATGGTTTGC
AGACTACGTGAACTACTTAGTCAGTGGACTAAAGCCTCCAGAAGCCACAACACAACAACTGAAAAAGTTTCTGAAAGATGACAATGCCCCCATTCTCCCCCAAACTTACA
CTGGAGCGCAAGGAATTATGGAGATCAAGGCTGAGTTGCGCGAGGTGGAGGCATGGTTCCGCGCATTTGTGATGGGCCTGCGCCTTGACGTCTTTGTTGTCCAACTGGCT
GAGCGAACAGTTCGAACTGTTATTGGCGCAGCAACTCCTGCAGTTGGCAAATTCGCTCTTGCTGCAATTGCAAGGACTCTACGACCTCTTGAATTATCTGCCGCATCTGA
GAAAAGATCAGAGTTTCGTCTACATCAATGGTTAGATCAGCCGCAGGCTGACTTGTGCTCGCAGATGCTGCGTGTTGTGTTGGATTGGCGCAGGTCTTCTGTGGGCTGCG
GAAGAGGTGGGTTCAGGTCAATGGGCGGTGATGGTGGTGGGGCAAGTTTTCTTGCGATGGTGGCGAGGAACCCTGAGTTGGCGCAGTCAGTGCTTTCCTGCGCTGATGGA
GTTGAATACGGGAAATTGTTGATTGGAGCTAAAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGTACATGGATGCTTTGGATCATTGCATCTGTATCGAATACAAGGTTAATTGAGCTTAGGTCGTGTCGGGAGTCGATCACGCTAATTGGTGTTAATTCAAGCCA
ATTACAGAATTTTTGGATCCATCTTGATGTCTTGGATGGAGAGGAGTTTTCAGGACTTTTGAATGAAGATCTGCATAAGGATACCAAGCAATTTTTAAGAATTTGTAGTT
CGTTTAAAGTAACAGGAGTACCAGAAGACACATTAAAACTAAAATGTTTTCCTTTTTCATTACAAGGAGATGCAGAAGCTTGGTTAGATTCATTTCCACCAAACTCCATC
ACATCTTGGAATGACTTGGCAGAAAAGTTTATAGAGAAGTTTTTTCCGTCCAACAATAATGCTAAGTATAGAGTAGAGATCATTTCCTTCAGACAATCATACAATGAGTC
TTTAGACACAGCATGGGAACGATTCCAAAGGCTGGTTCGAAAGTGTCCTCATCATGGGTTGCCTGCTTTCATCATTCTTGAGCATTTCTATAGTGGGCTTGATCAAGCTT
CAAAAGCCTTAGTCAATGCGTCTACAAATGGATCTTTCCTGATGAAATCCGCAAACGAAGCACATGCCATACTGGATACAATAGCTACGAATAACCAACATTGGGGGGAG
ACTGAACTTACAATTTTGAAGAATCCAATAAAAGCTGTAGAGACGAAAGCAAATTCAACCATGCAAACTCATATTAAAGCTATTCATAGCAAGATGTTTGGCTTGACTAT
GGGAAATCAAGCAAACATCGCTCCCGCAAATGCTATCTCTTCTCCTTGTTGTGATATTTGTAGGGAATACGAGACAGTTGCATTAACAGAATGCTCCAATGTGTTGGTTA
AAAGCAAGGTTCCTCGTAATTTAAAAGATCAAGGAAGTTTTACTATCCCTTGCTCCATAGGAGGACGGAACGTTGGCAGAGCACTATGCGATCTTGGAGCCAGTACCAAT
CGGATGTCGTATTCGGTCTTTAAGCAGTTAGGAATAGGAGAAGCCAGGCCGATGACTGTCACCTTGCAACTAGTAGAGAGATCATTAACACATCCCATTGGCAAAATAGA
GAATGTACTAGTCAAGGTCCTATCTAGCCACAAAAAGGCGATATGTTGGACAACTGCATATATTCAGGGAATCAGTCCATCGTTTTGCATGCATAGGATCAATCTAGAAG
AAGACAGTGGCTGGGTCAGCCCAGTGCATTGCGTCCCTAAGAAAGGTGGAATGACTGTAGTTGAAAATAAGAAGAATGAATTGATTCCAACGAGAACGATCACGGGATGG
CGAATGTTGGATAGATTGGCATGGAAAAATTACTACTGCTTCCATGATGGCTACTCAACATACAACCAAATTACTATAGCCCCAGAAGATCAAGAAAAGACCACATTCAC
ATGTCCATACGGGACATTCGCTTTTCGAAGGATGCCATTTGGACTCTGTAACGCTCCAGCCACCTTTCAAAGGTATTGCAAGCAAGCTTTTGAGACTCTAAAGAGTGCAC
TTAGTTCATCTCTAGTCATGATTGAACCAGACTGGACTCAGCCTTTCGAACTAATGAATTTGATATTGAAATCCAGGAGGAAAGGGACCAAAAATCAAATGGAAGATCAT
CTGTCAAGGCTTGGGGTAGAGACACAGGTGGATAAGAGGTTGGACATTCAAGAGTCGTTTGCAGATGAAACGATATTGGCAGTAAAGGTAATTGAGATTCCATGGTTTGC
AGACTACGTGAACTACTTAGTCAGTGGACTAAAGCCTCCAGAAGCCACAACACAACAACTGAAAAAGTTTCTGAAAGATGACAATGCCCCCATTCTCCCCCAAACTTACA
CTGGAGCGCAAGGAATTATGGAGATCAAGGCTGAGTTGCGCGAGGTGGAGGCATGGTTCCGCGCATTTGTGATGGGCCTGCGCCTTGACGTCTTTGTTGTCCAACTGGCT
GAGCGAACAGTTCGAACTGTTATTGGCGCAGCAACTCCTGCAGTTGGCAAATTCGCTCTTGCTGCAATTGCAAGGACTCTACGACCTCTTGAATTATCTGCCGCATCTGA
GAAAAGATCAGAGTTTCGTCTACATCAATGGTTAGATCAGCCGCAGGCTGACTTGTGCTCGCAGATGCTGCGTGTTGTGTTGGATTGGCGCAGGTCTTCTGTGGGCTGCG
GAAGAGGTGGGTTCAGGTCAATGGGCGGTGATGGTGGTGGGGCAAGTTTTCTTGCGATGGTGGCGAGGAACCCTGAGTTGGCGCAGTCAGTGCTTTCCTGCGCTGATGGA
GTTGAATACGGGAAATTGTTGATTGGAGCTAAAATTTAA
Protein sequenceShow/hide protein sequence
MSRTWMLWIIASVSNTRLIELRSCRESITLIGVNSSQLQNFWIHLDVLDGEEFSGLLNEDLHKDTKQFLRICSSFKVTGVPEDTLKLKCFPFSLQGDAEAWLDSFPPNSI
TSWNDLAEKFIEKFFPSNNNAKYRVEIISFRQSYNESLDTAWERFQRLVRKCPHHGLPAFIILEHFYSGLDQASKALVNASTNGSFLMKSANEAHAILDTIATNNQHWGE
TELTILKNPIKAVETKANSTMQTHIKAIHSKMFGLTMGNQANIAPANAISSPCCDICREYETVALTECSNVLVKSKVPRNLKDQGSFTIPCSIGGRNVGRALCDLGASTN
RMSYSVFKQLGIGEARPMTVTLQLVERSLTHPIGKIENVLVKVLSSHKKAICWTTAYIQGISPSFCMHRINLEEDSGWVSPVHCVPKKGGMTVVENKKNELIPTRTITGW
RMLDRLAWKNYYCFHDGYSTYNQITIAPEDQEKTTFTCPYGTFAFRRMPFGLCNAPATFQRYCKQAFETLKSALSSSLVMIEPDWTQPFELMNLILKSRRKGTKNQMEDH
LSRLGVETQVDKRLDIQESFADETILAVKVIEIPWFADYVNYLVSGLKPPEATTQQLKKFLKDDNAPILPQTYTGAQGIMEIKAELREVEAWFRAFVMGLRLDVFVVQLA
ERTVRTVIGAATPAVGKFALAAIARTLRPLELSAASEKRSEFRLHQWLDQPQADLCSQMLRVVLDWRRSSVGCGRGGFRSMGGDGGGASFLAMVARNPELAQSVLSCADG
VEYGKLLIGAKI