; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014390 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014390
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr12:265608..268781
RNA-Seq ExpressionLag0014390
SyntenyLag0014390
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN67073.1 hypothetical protein VITISV_011746 [Vitis vinifera]9.1e-3826.36Show/hide
Query:  VLTVKLNENNYLLWREMVLAILRGQKALSGWVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENL
        ++++KL++ NYL+W   ++ +L                   A+     +++EVW  L   +++  R RV+ L+  LQ+  +GSMK  ++L   K  S+ L
Subjt:  VLTVKLNENNYLLWREMVLAILRGQKALSGWVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENL

Query:  QLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVG
           G P+  DDL+ Y+     ++   +V     +                              +  W ADSGA  HI A++ ++ ++  YTG E++ VG
Subjt:  QLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVG

Query:  NGTKLDIY----------QDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPN-LFLVAAS---------------GLNKSSIVSSKFVSRCLG-MFQL
        NG  L I           + K + K +LH        Q    L+  +Q  ++ N LF++  +               G ++  +   +  S  +     L
Subjt:  NGTKLDIY----------QDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPN-LFLVAAS---------------GLNKSSIVSSKFVSRCLG-MFQL

Query:  ENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVV
             I    S+WH RLGHAS  +V+ ++   +L    +  K  FC +CQLG     KD                                    S +  
Subjt:  ENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVV

Query:  LHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTY-
        L  LP P S  +S+ P   P P                 AP L     SSS + S                    HHPM T + +G  +P+ +  F  Y 
Subjt:  LHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTY-

Query:  ------VGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSP
                S+      P S ++A   P W VAM  E  A+  N+TW   P P   N++G+KWV KVK  S G VDR KA +VA GF Q  GIDF ETFSP
Subjt:  ------VGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSP

Query:  VVKTPTI
        V+K  T+
Subjt:  VVKTPTI

OMO62750.1 hypothetical protein CCACVL1_22657 [Corchorus capsularis]1.5e-3726.54Show/hide
Query:  LRGQKALSGWVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLD
        LR  K +   +  S + A+   V S  TS + W+ L K+++   R+R+  L+  L STK+ +  +AEY   MKQ  + L L G  I  DD +LY+L GL 
Subjt:  LRGQKALSGWVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLD

Query:  SEYILIVCSIEDKDITTWQE-----LALFRLHLQMDFSSI---YSTLEI---------------LNDPK-------------------------------
         E+  I  +I  ++     E     L  F   L+ +  S     ST  +                N P                                
Subjt:  SEYILIVCSIEDKDITTWQE-----LALFRLHLQMDFSSI---YSTLEI---------------LNDPK-------------------------------

Query:  -------------------------WLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVN
                                 W  DSGA++H+ AD+ N+ + ++Y G E++ +G+G+ L+I   K   K++  +    F  Q  L + Q +Q++V+
Subjt:  -------------------------WLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVN

Query:  PNLFLVAASGLNKSSIVSSKFVSRCLGMFQLEN-----------------SKSI-VEPKSL---------WHRRLGHASESVVNTVIKAWNLSASFNEKF
         + F        KS+ VS +F +    +  L+                  SKS+ ++P  +         WH+RLGH+S+ ++N  IK ++L    NE F
Subjt:  PNLFLVAASGLNKSSIVSSKFVSRCLGMFQLEN-----------------SKSI-VEPKSL---------WHRRLGHASESVVNTVIKAWNLSASFNEKF

Query:  AFCAACQLGTT----FLLKDISAKVLVV---GF----------SFLVMLYSMKL-----SFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPS
          C +C    +    F +  +          GF          + L +L+   L     S+ F++ + + +  PS N+        QS+    +    P+
Subjt:  AFCAACQLGTT----FLLKDISAKVLVV---GF----------SFLVMLYSMKL-----SFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPS

Query:  PLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVV---VSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALG
        P    E    P N          +S+E+ + D S    IP+ VV      N      +HPM T   + I +P N   FL        I S+P  V +A+ 
Subjt:  PLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVV---VSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALG

Query:  SPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        +P W++AM+ EI A+ +N TW   P  P  N++G KW+ ++K + D S+ + KA +VAKGF+Q PGIDF ETFSP VK  TI
Subjt:  SPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

RVW41854.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]8.2e-3925.2Show/hide
Query:  VLTVKLNENNYLLWREMVLAILR--------------GQKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR
        ++++KL++ NYL+W   ++ +L+                K+L G                  W+  +++  + A+V    +++EVW  L   +++  R R
Subjt:  VLTVKLNENNYLLWREMVLAILR--------------GQKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR

Query:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSI----EDKDIT---TWQELALFRLHL------------
        V+ L+  LQ+  +GSMK  ++L   K  S+ L   G P+  DDL+ Y++ GL+  +   + S+     DK+++      EL  + L L            
Subjt:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSI----EDKDIT---TWQELALFRLHL------------

Query:  ------------------------------------------------------QMDFSSIYSTLEIL--------------------------------
                                                              Q  FS+  S  +I                                 
Subjt:  ------------------------------------------------------QMDFSSIYSTLEIL--------------------------------

Query:  ------NDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIY----------QDKASRKVILH--------ETLKDFLYQLELLLIQSSQS
               +  W ADSGA  HI A++ ++ ++  YTG E++VVGNG  L I           + K + K +LH         ++  F      L I +   
Subjt:  ------NDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIY----------QDKASRKVILH--------ETLKDFLYQLELLLIQSSQS

Query:  IVNPNLFLVAASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPK---SLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTFLLKDISAK
            ++   A     +S         R + + +     ++V  K   S+WH RLGHAS  +V+ ++   +L    +  K  FC  CQLG   + + +   
Subjt:  IVNPNLFLVAASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPK---SLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTFLLKDISAK

Query:  VLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLS------SQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTS
                       + +FP K GL  +   P  +     LP P S SLS      S P+   SP S  E   +PT  SP ++ L       S I G   
Subjt:  VLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLS------SQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTS

Query:  FPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTY-------VGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNL
         P P L +  F++     HHPM T + +G  +P+ +  F  Y         S+      P S ++A   P W VAM  E  A+  N+TW   P P   N+
Subjt:  FPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTY-------VGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNL

Query:  IGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        +G+KWV KVK  S G VDR KA +VA GF Q  GIDF ETFSPV+K  T+
Subjt:  IGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

RVW62129.1 Retrovirus-related Pol polyprotein from transposon RE2 [Vitis vinifera]2.5e-4327.8Show/hide
Query:  VLTVKLNENNYLLWREMVLAILRG--------------QKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR
        ++++KL++ NYL+W   ++ +L+                K+L G                  W+  +++  + A V    +++EVW  L   +++  R R
Subjt:  VLTVKLNENNYLLWREMVLAILRG--------------QKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR

Query:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIY----------S
        V+ L+  LQ+  +GSMK  ++L   K  S+ L   G P+  DDL+ Y+ F   S +  + C  E+++I    E   F           Y          S
Subjt:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIY----------S

Query:  TLEI----------LNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLV
        T  +            +  W ADSGA  HI A++ ++ ++  YTG E++ VGNG  L I    A+   +L    +  LY ++L                 
Subjt:  TLEI----------LNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLV

Query:  AASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTF--------LLKDISAKVLVV
         +  +NKS  +S+                 I  P S+WH RLGHAS  +V+ ++   +L    +  K  FC  CQLG  +         L  ++ KVL+ 
Subjt:  AASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTF--------LLKDISAKVLVV

Query:  GFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVS
               +   + +FP K GL                   QSS L     S+ SP S  E   +PT  SP ++ L       S I G    P P L +  
Subjt:  GFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVS

Query:  FNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSV
        F++     HHPM T + +G  +P+ +  F  Y          PL             A+  E  A+  N+TW   P P   N++G+KWV KVK  S G V
Subjt:  FNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSV

Query:  DRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        DR KA +VA GF Q  GIDF ETFSPV+K  T+
Subjt:  DRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

RZB67542.1 Retrovirus-related Pol polyprotein from transposon RE1 [Glycine soja]8.2e-4728.53Show/hide
Query:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKD
        M + IA  ++  +TSKE+W   + +  A  ++R   L+    +T+KG MKM +YL  MK  ++ L+LAG+PIS  DL++  L GLD++Y  +V  + D+ 
Subjt:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKD

Query:  ITTWQELAL------------------------FRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDK
           WQ L                            + L    S+  ++     D +W  DSGA+NH+      +   S+  G  SL+VGNG +L I    
Subjt:  ITTWQELAL------------------------FRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDK

Query:  ASRKVILHETLKDFLYQLELLL-IQSSQSIVNPNLFLVAASGLNKSSIVSSKFVSRCL-------GMFQLENSKSIVEP--------KSLWHRRLGHASE
        +++  + +  L + LY  E+   + S   +   N  LV          V  K   + L       G++QL + KS V          K  WHR+LGH + 
Subjt:  ASRKVILHETLKDFLYQLELLL-IQSSQSIVNPNLFLVAASGLNKSSIVSSKFVSRCL-------GMFQLENSKSIVEP--------KSLWHRRLGHASE

Query:  SVVNTVIKAWNLSASFNEKFAFCAACQLGTTFLL-----------------KDI--SAKVL-----------VVGFSFLVMLYSMK--------------
         V+  V+K  N+ AS N++F+FC ACQ G   LL                  D+   A +L           +  FS    ++ +K              
Subjt:  SVVNTVIKAWNLSASFNEKFAFCAACQLGTTFLL-----------------KDI--SAKVL-----------VVGFSFLVMLYSMK--------------

Query:  -LSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSID--------------GSTSFPIPQLV
         +   F   + +      D    H +  P  S ++ Q       L+V E     +  +   A   V   E +S+D                TS   P+  
Subjt:  -LSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSID--------------GSTSFPIPQLV

Query:  VVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVG--STDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTH
            N       H M+T + +GI++PK     L Y+G   T     +P +V EA   P WK AMDAE  A+  N TW   P     N+I SKWV K K  
Subjt:  VVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVG--STDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTH

Query:  SDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        +DGS++R KA +VAKGF Q  G+D+ ETFSPV+K+ T+
Subjt:  SDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

TrEMBL top hitse value%identityAlignment
A0A2N9G872 Uncharacterized protein5.0e-4224.55Show/hide
Query:  SSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQ------------------------------KALSGW----------VFRS
        SSS+D+ IS   T  S +   P   ++T+KL  +NYLLWR  ++  LRGQ                                 + W          +  S
Subjt:  SSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQ------------------------------KALSGW----------VFRS

Query:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK-
        ++  + A+VV   T++EVW  L +++++  RAR  Q+   L + +KG + +A++       ++ L     P++  +L+ +++ GL SEY  +V S++ + 
Subjt:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK-

Query:  DITTWQEL------------------------ALF------------------------------------------------------------RLHLQ
        D  + +EL                        A F                                                             LH  
Subjt:  DITTWQEL------------------------ALF------------------------------------------------------------RLHLQ

Query:  MDFSSIY------------STLEILNDPKWLADSGATNHIVADVGNMVVKS-KYTGGESLVVGNGTKLDIY----------QDKASRKVILH--ETLKDF
          F + Y            +T ++  DP W  D+GAT+H+ +D GN+ ++S +Y G E + VGNG  L I+          Q     + +LH  +  K+ 
Subjt:  MDFSSIY------------STLEILNDPKWLADSGATNHIVADVGNMVVKS-KYTGGESLVVGNGTKLDIY----------QDKASRKVILH--ETLKDF

Query:  LYQLELLLIQSSQSIVNPNLFLV--------AASGLNKSSIVSSKFVSR--CLGMFQLENSKSIVEPKSL--WHRRLGHASESVVNTVIKAWNLSASFNE
        +   +     ++    +P+ FLV           G +K  +      +   C   F    +  + E  SL  WH RLGH +  +V+ V+  + L  S N+
Subjt:  LYQLELLLIQSSQSIVNPNLFLV--------AASGLNKSSIVSSKFVSR--CLGMFQLENSKSIVEPKSL--WHRRLGHASESVVNTVIKAWNLSASFNE

Query:  KFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNG-SPRVA
           FC+AC    +  L   S+   +   S L ++YS             TS   +D  V+  +  P  ++   QP++R SP      +  PTN   P ++
Subjt:  KFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNG-SPRVA

Query:  PLLVSSVESSSI-DGSTSFPIPQLVVVSFNTHVVT--------GHHPMQTCANSGIFQPKNWGS---------FLTYVGSTDAIVSKPLSVKEALGSPIW
         +L S + S  +   S S P+P+   +   +  ++          HPM T + + I +PK +            L   G+ D + ++P    +A+  P W
Subjt:  PLLVSSVESSSI-DGSTSFPIPQLVVVSFNTHVVT--------GHHPMQTCANSGIFQPKNWGS---------FLTYVGSTDAIVSKPLSVKEALGSPIW

Query:  KVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        ++AM+ E  A+ RN+TW+  P     N+IG KWV ++K H++GS++R KA +VAKGF+Q PG+D+ ETFSPV+K  T+
Subjt:  KVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

A0A2N9HTS2 Uncharacterized protein5.0e-4224.55Show/hide
Query:  SSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQ------------------------------KALSGW----------VFRS
        SSS+D+ IS   T  S +   P   ++T+KL  +NYLLWR  ++  LRGQ                                 + W          +  S
Subjt:  SSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQ------------------------------KALSGW----------VFRS

Query:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK-
        ++  + A+VV   T++EVW  L +++++  RAR  Q+   L + +KG + +A++       ++ L     P++  +L+ +++ GL SEY  +V S++ + 
Subjt:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK-

Query:  DITTWQEL------------------------ALF------------------------------------------------------------RLHLQ
        D  + +EL                        A F                                                             LH  
Subjt:  DITTWQEL------------------------ALF------------------------------------------------------------RLHLQ

Query:  MDFSSIY------------STLEILNDPKWLADSGATNHIVADVGNMVVKS-KYTGGESLVVGNGTKLDIY----------QDKASRKVILH--ETLKDF
          F + Y            +T ++  DP W  D+GAT+H+ +D GN+ ++S +Y G E + VGNG  L I+          Q     + +LH  +  K+ 
Subjt:  MDFSSIY------------STLEILNDPKWLADSGATNHIVADVGNMVVKS-KYTGGESLVVGNGTKLDIY----------QDKASRKVILH--ETLKDF

Query:  LYQLELLLIQSSQSIVNPNLFLV--------AASGLNKSSIVSSKFVSR--CLGMFQLENSKSIVEPKSL--WHRRLGHASESVVNTVIKAWNLSASFNE
        +   +     ++    +P+ FLV           G +K  +      +   C   F    +  + E  SL  WH RLGH +  +V+ V+  + L  S N+
Subjt:  LYQLELLLIQSSQSIVNPNLFLV--------AASGLNKSSIVSSKFVSR--CLGMFQLENSKSIVEPKSL--WHRRLGHASESVVNTVIKAWNLSASFNE

Query:  KFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNG-SPRVA
           FC+AC    +  L   S+   +   S L ++YS             TS   +D  V+  +  P  ++   QP++R SP      +  PTN   P ++
Subjt:  KFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNG-SPRVA

Query:  PLLVSSVESSSI-DGSTSFPIPQLVVVSFNTHVVT--------GHHPMQTCANSGIFQPKNWGS---------FLTYVGSTDAIVSKPLSVKEALGSPIW
         +L S + S  +   S S P+P+   +   +  ++          HPM T + + I +PK +            L   G+ D + ++P    +A+  P W
Subjt:  PLLVSSVESSSI-DGSTSFPIPQLVVVSFNTHVVT--------GHHPMQTCANSGIFQPKNWGS---------FLTYVGSTDAIVSKPLSVKEALGSPIW

Query:  KVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        ++AM+ E  A+ RN+TW+  P     N+IG KWV ++K H++GS++R KA +VAKGF+Q PG+D+ ETFSPV+K  T+
Subjt:  KVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

A0A438FQA1 Retrovirus-related Pol polyprotein from transposon RE21.2e-4327.8Show/hide
Query:  VLTVKLNENNYLLWREMVLAILRG--------------QKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR
        ++++KL++ NYL+W   ++ +L+                K+L G                  W+  +++  + A V    +++EVW  L   +++  R R
Subjt:  VLTVKLNENNYLLWREMVLAILRG--------------QKALSG------------------WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRAR

Query:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIY----------S
        V+ L+  LQ+  +GSMK  ++L   K  S+ L   G P+  DDL+ Y+ F   S +  + C  E+++I    E   F           Y          S
Subjt:  VNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIY----------S

Query:  TLEI----------LNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLV
        T  +            +  W ADSGA  HI A++ ++ ++  YTG E++ VGNG  L I    A+   +L    +  LY ++L                 
Subjt:  TLEI----------LNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLV

Query:  AASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTF--------LLKDISAKVLVV
         +  +NKS  +S+                 I  P S+WH RLGHAS  +V+ ++   +L    +  K  FC  CQLG  +         L  ++ KVL+ 
Subjt:  AASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFN-EKFAFCAACQLGTTF--------LLKDISAKVLVV

Query:  GFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVS
               +   + +FP K GL                   QSS L     S+ SP S  E   +PT  SP ++ L       S I G    P P L +  
Subjt:  GFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVS

Query:  FNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSV
        F++     HHPM T + +G  +P+ +  F  Y          PL             A+  E  A+  N+TW   P P   N++G+KWV KVK  S G V
Subjt:  FNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSV

Query:  DRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        DR KA +VA GF Q  GIDF ETFSPV+K  T+
Subjt:  DRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

A0A445H1W7 Retrovirus-related Pol polyprotein from transposon RE14.0e-4728.53Show/hide
Query:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKD
        M + IA  ++  +TSKE+W   + +  A  ++R   L+    +T+KG MKM +YL  MK  ++ L+LAG+PIS  DL++  L GLD++Y  +V  + D+ 
Subjt:  MTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKD

Query:  ITTWQELAL------------------------FRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDK
           WQ L                            + L    S+  ++     D +W  DSGA+NH+      +   S+  G  SL+VGNG +L I    
Subjt:  ITTWQELAL------------------------FRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDIYQDK

Query:  ASRKVILHETLKDFLYQLELLL-IQSSQSIVNPNLFLVAASGLNKSSIVSSKFVSRCL-------GMFQLENSKSIVEP--------KSLWHRRLGHASE
        +++  + +  L + LY  E+   + S   +   N  LV          V  K   + L       G++QL + KS V          K  WHR+LGH + 
Subjt:  ASRKVILHETLKDFLYQLELLL-IQSSQSIVNPNLFLVAASGLNKSSIVSSKFVSRCL-------GMFQLENSKSIVEP--------KSLWHRRLGHASE

Query:  SVVNTVIKAWNLSASFNEKFAFCAACQLGTTFLL-----------------KDI--SAKVL-----------VVGFSFLVMLYSMK--------------
         V+  V+K  N+ AS N++F+FC ACQ G   LL                  D+   A +L           +  FS    ++ +K              
Subjt:  SVVNTVIKAWNLSASFNEKFAFCAACQLGTTFLL-----------------KDI--SAKVL-----------VVGFSFLVMLYSMK--------------

Query:  -LSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSID--------------GSTSFPIPQLV
         +   F   + +      D    H +  P  S ++ Q       L+V E     +  +   A   V   E +S+D                TS   P+  
Subjt:  -LSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSID--------------GSTSFPIPQLV

Query:  VVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVG--STDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTH
            N       H M+T + +GI++PK     L Y+G   T     +P +V EA   P WK AMDAE  A+  N TW   P     N+I SKWV K K  
Subjt:  VVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVG--STDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTH

Query:  SDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        +DGS++R KA +VAKGF Q  G+D+ ETFSPV+K+ T+
Subjt:  SDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

A0A803NUC9 Uncharacterized protein4.4e-4628.61Show/hide
Query:  SSSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQK--------------------------------------------ALSG
        +SSSS TP       IS+ F++  S   ++KL+ NN+ LW+ MV  I+RG +                                             L G
Subjt:  SSSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQK--------------------------------------------ALSG

Query:  WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCS
        W++ SMT AIA+ V+  +T+  +W  LE++Y A  RA ++ LR  +Q T+KGS  MA+YL + +  +++L  A        +I Y  +  D  Y      
Subjt:  WVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCS

Query:  IEDKDITTWQELALFRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI-----------------------
        + ++  T   +           FS++ +T E+LND  W ADSGA+NH+ +D G +  K++Y G E + +G+G KL I                       
Subjt:  IEDKDITTWQELALFRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI-----------------------

Query:  ------------------------------YQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLVAASGLNKSSIV---------SSKFVSRCL
                                       +D+++ KV+LHETLKD LYQL     QSSQS  +P   + A +       V          S+  S  +
Subjt:  ------------------------------YQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLVAASGLNKSSIV---------SSKFVSRCL

Query:  ---GMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFNEKF--AFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLIT
           G+    +  + +       R+  H  E  +  + +A      +++ F  A     +L TT L      +V          ++S    + F   ++IT
Subjt:  ---GMFQLENSKSIVEPKSLWHRRLGHASESVVNTVIKAWNLSASFNEKF--AFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLIT

Query:  SHSPSDNVVLHLLPF-PQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVE--SSSIDGSTSFPIPQLVVV-SFNTHVVTGHHPMQTCANSGI
           PS  +    LPF  QSS  S++P S P P +  +      +GSP      +S+V   S  + G+ S    Q+V      +  V  HHPM T    GI
Subjt:  SHSPSDNVVLHLLPF-PQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVE--SSSIDGSTSFPIPQLVVV-SFNTHVVTGHHPMQTCANSGI

Query:  FQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDF
        F+P+     L    ++   + +P SV+EAL    W  AM  E+ A+ +NKTW   P  P  +++G+KWV K+K ++DGSV R KA +VAKGF+Q PG+DF
Subjt:  FQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDF

Query:  KETFSPVVKTPTI
         ETF PV+K  T+
Subjt:  KETFSPVVKTPTI

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-0941.33Show/hide
Query:  WKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVK
        W+ A++ E+ A   N TWT    P   N++ S+WV  VK +  G+  R KA +VA+GF Q   ID++ETF+PV +
Subjt:  WKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-0941.67Show/hide
Query:  KPLSVKEALGSP----IWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        +P S+KE L  P    + K AM  E+ ++ +N T+    LP     +  KWV K+K   D  + R KA +V KGF Q  GIDF E FSPVVK  +I
Subjt:  KPLSVKEALGSP----IWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

P92520 Uncharacterized mitochondrial protein AtMg008202.8e-2154.08Show/hide
Query:  TDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        T  I  +P SV  AL  P W  AM  E+ A+ RNKTW   P P   N++G KWV K K HSDG++DR KA +VAKGF+Q  GI F ET+SPVV+T TI
Subjt:  TDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-1933.5Show/hide
Query:  SSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSF----PIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTD
        S + S    +  SP  + +  + P   S        S+  SS+     S     P P   +V+ N       H M T A +GI +P    S    +    
Subjt:  SSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSF----PIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTD

Query:  AIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPP-YVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        A  S+P +  +AL    W+ AM +EI A   N TW   P PP +V ++G +W+   K +SDGS++R KA +VAKG+ Q PG+D+ ETFSPV+K+ +I
Subjt:  AIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPP-YVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-0345Show/hide
Query:  WLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI
        WL DSGAT+HI +D  N+ +   YTGG+ ++V +G+ + I
Subjt:  WLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-2235.07Show/hide
Query:  TSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQP
        T +S S++ +L+  P P S S +S   + P P S      +PT  +    P   SS  +S+       P P ++ V+    V T  H M T A  GI +P
Subjt:  TSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPTNGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQP

Query:  KNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTF-GPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKE
            S+ T + +     S+P +  +A+    W+ AM +EI A   N TW    P PP V ++G +W+   K +SDGS++R KA +VAKG+ Q PG+D+ E
Subjt:  KNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTF-GPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKE

Query:  TFSPVVKTPTI
        TFSPV+K+ +I
Subjt:  TFSPVVKTPTI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-0340.91Show/hide
Query:  NDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI
        N   WL DSGAT+HI +D  N+     YTGG+ +++ +G+ + I
Subjt:  NDPKWLADSGATNHIVADVGNMVVKSKYTGGESLVVGNGTKLDI

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)7.0e-0426.67Show/hide
Query:  VSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK
        V+  TS+++W  ++  +   + AR  +L   L++   G M++A+Y   MK+ +++L+    P++  +L++YVL GL+ ++  I+  I+ +
Subjt:  VSFKTSKEVWKVLEKVYSATRRARVNQLRGVLQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDK

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-1746.88Show/hide
Query:  AIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        A   +P +  EA    +W  AMD EI A+    TW    LPP    IG KWV K+K +SDG+++R KA +VAKG+ Q  GIDF ETFSPV K  ++
Subjt:  AIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.0e-2254.08Show/hide
Query:  TDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI
        T  I  +P SV  AL  P W  AM  E+ A+ RNKTW   P P   N++G KWV K K HSDG++DR KA +VAKGF+Q  GI F ET+SPVV+T TI
Subjt:  TDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGPLPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGACGAAAATTCTTCTTCGTCATCTTCTTCATCTAGTGACACTCCTATATCTTTAGCAACAACAGTTATCAGTTCTTCGTTCAACCATCCTCAGAGCATTGTGTT
GACAGTCAAACTCAATGAAAACAATTATTTGTTGTGGAGGGAAATGGTTCTAGCCATTCTTCGAGGCCAAAAGGCATTGTCTGGGTGGGTTTTTAGATCAATGACCCTAG
CAATTGCGGCGAATGTTGTGAGCTTCAAAACTTCCAAGGAAGTCTGGAAGGTGTTGGAAAAGGTTTATAGTGCCACTAGAAGAGCTAGGGTTAATCAACTTCGAGGAGTT
CTCCAAAGTACCAAAAAAGGATCGATGAAGATGGCAGAGTACCTGGCTATTATGAAACAAGCTTCTGAGAATCTTCAACTAGCGGGTAATCCCATCTCTCTCGATGATCT
TATCTTATATGTGTTATTTGGTCTGGATTCGGAATATATTTTGATAGTGTGTTCAATCGAAGATAAGGACATTACCACTTGGCAAGAGCTCGCTCTATTTCGATTACACC
TTCAAATGGACTTTAGCTCGATATACTCCACACTTGAAATCCTAAATGACCCCAAGTGGCTTGCGGACAGTGGAGCAACCAACCATATTGTAGCAGATGTTGGTAATATG
GTTGTTAAATCTAAGTACACTGGTGGAGAGTCATTGGTTGTTGGTAATGGGACTAAATTAGATATATATCAAGACAAGGCTTCAAGGAAGGTGATATTGCACGAAACGCT
TAAAGATTTTTTGTACCAACTCGAGTTGCTTTTAATTCAAAGTTCCCAGTCTATTGTCAATCCTAATTTGTTTCTTGTTGCTGCTTCTGGTTTAAATAAGTCGTCTATTG
TGTCTTCTAAGTTCGTTTCTAGATGTCTTGGTATGTTTCAACTTGAAAATTCCAAGTCTATTGTTGAGCCTAAGTCTTTGTGGCATCGTCGCCTTGGTCATGCTTCTGAG
TCAGTTGTGAATACTGTTATCAAAGCTTGGAATCTGAGTGCTTCCTTTAATGAGAAATTTGCCTTTTGTGCTGCTTGTCAACTTGGTACAACTTTTCTCCTAAAGGATAT
AAGTGCCAAAGTTCTAGTGGTCGGATTTTCTTTTCTCGTCATGTTGTATTCAATGAAACTGAGTTTCCCTTTTAAGTCTGGTCTTTTGATAACTTCTCATTCACCATCTG
ATAATGTCGTTCTCCATTTGCTTCCTTTTCCTCAGTCATCATCTCTGTCTTCTCAACCTATGTCTAGGCCTTCACCATTGTCTGTTACCGAGTTTGCTGCTATGCCTACG
AATGGTTCACCAAGGGTTGCTCCTCTTCTTGTCTCTTCAGTTGAATCGTCTTCTATTGATGGCTCAACCTCTTTTCCTATACCTCAGCTTGTTGTTGTTAGCTTCAATAC
GCATGTTGTTACAGGCCATCATCCTATGCAAACTTGTGCCAATAGTGGCATTTTTCAGCCCAAGAACTGGGGCTCCTTTCTGACTTATGTTGGTTCTACTGATGCCATTG
TCTCAAAACCTTTGTCAGTAAAGGAGGCTTTAGGTTCTCCTATATGGAAGGTTGCCATGGATGCTGAGATTTTTGCTATTTATCGAAACAAAACTTGGACGTTTGGGCCT
CTTCCACCTTATGTTAACTTGATTGGTAGCAAGTGGGTTGTCAAAGTCAAAACACATTCAGATGGTTCTGTTGATCGATGCAAAGCTTGCATGGTGGCCAAGGGGTTTTA
TCAAATTCCTGGCATTGATTTCAAGGAAACTTTTAGTCCGGTTGTCAAAACTCCTACCATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGACGAAAATTCTTCTTCGTCATCTTCTTCATCTAGTGACACTCCTATATCTTTAGCAACAACAGTTATCAGTTCTTCGTTCAACCATCCTCAGAGCATTGTGTT
GACAGTCAAACTCAATGAAAACAATTATTTGTTGTGGAGGGAAATGGTTCTAGCCATTCTTCGAGGCCAAAAGGCATTGTCTGGGTGGGTTTTTAGATCAATGACCCTAG
CAATTGCGGCGAATGTTGTGAGCTTCAAAACTTCCAAGGAAGTCTGGAAGGTGTTGGAAAAGGTTTATAGTGCCACTAGAAGAGCTAGGGTTAATCAACTTCGAGGAGTT
CTCCAAAGTACCAAAAAAGGATCGATGAAGATGGCAGAGTACCTGGCTATTATGAAACAAGCTTCTGAGAATCTTCAACTAGCGGGTAATCCCATCTCTCTCGATGATCT
TATCTTATATGTGTTATTTGGTCTGGATTCGGAATATATTTTGATAGTGTGTTCAATCGAAGATAAGGACATTACCACTTGGCAAGAGCTCGCTCTATTTCGATTACACC
TTCAAATGGACTTTAGCTCGATATACTCCACACTTGAAATCCTAAATGACCCCAAGTGGCTTGCGGACAGTGGAGCAACCAACCATATTGTAGCAGATGTTGGTAATATG
GTTGTTAAATCTAAGTACACTGGTGGAGAGTCATTGGTTGTTGGTAATGGGACTAAATTAGATATATATCAAGACAAGGCTTCAAGGAAGGTGATATTGCACGAAACGCT
TAAAGATTTTTTGTACCAACTCGAGTTGCTTTTAATTCAAAGTTCCCAGTCTATTGTCAATCCTAATTTGTTTCTTGTTGCTGCTTCTGGTTTAAATAAGTCGTCTATTG
TGTCTTCTAAGTTCGTTTCTAGATGTCTTGGTATGTTTCAACTTGAAAATTCCAAGTCTATTGTTGAGCCTAAGTCTTTGTGGCATCGTCGCCTTGGTCATGCTTCTGAG
TCAGTTGTGAATACTGTTATCAAAGCTTGGAATCTGAGTGCTTCCTTTAATGAGAAATTTGCCTTTTGTGCTGCTTGTCAACTTGGTACAACTTTTCTCCTAAAGGATAT
AAGTGCCAAAGTTCTAGTGGTCGGATTTTCTTTTCTCGTCATGTTGTATTCAATGAAACTGAGTTTCCCTTTTAAGTCTGGTCTTTTGATAACTTCTCATTCACCATCTG
ATAATGTCGTTCTCCATTTGCTTCCTTTTCCTCAGTCATCATCTCTGTCTTCTCAACCTATGTCTAGGCCTTCACCATTGTCTGTTACCGAGTTTGCTGCTATGCCTACG
AATGGTTCACCAAGGGTTGCTCCTCTTCTTGTCTCTTCAGTTGAATCGTCTTCTATTGATGGCTCAACCTCTTTTCCTATACCTCAGCTTGTTGTTGTTAGCTTCAATAC
GCATGTTGTTACAGGCCATCATCCTATGCAAACTTGTGCCAATAGTGGCATTTTTCAGCCCAAGAACTGGGGCTCCTTTCTGACTTATGTTGGTTCTACTGATGCCATTG
TCTCAAAACCTTTGTCAGTAAAGGAGGCTTTAGGTTCTCCTATATGGAAGGTTGCCATGGATGCTGAGATTTTTGCTATTTATCGAAACAAAACTTGGACGTTTGGGCCT
CTTCCACCTTATGTTAACTTGATTGGTAGCAAGTGGGTTGTCAAAGTCAAAACACATTCAGATGGTTCTGTTGATCGATGCAAAGCTTGCATGGTGGCCAAGGGGTTTTA
TCAAATTCCTGGCATTGATTTCAAGGAAACTTTTAGTCCGGTTGTCAAAACTCCTACCATCTAG
Protein sequenceShow/hide protein sequence
MRDENSSSSSSSSSDTPISLATTVISSSFNHPQSIVLTVKLNENNYLLWREMVLAILRGQKALSGWVFRSMTLAIAANVVSFKTSKEVWKVLEKVYSATRRARVNQLRGV
LQSTKKGSMKMAEYLAIMKQASENLQLAGNPISLDDLILYVLFGLDSEYILIVCSIEDKDITTWQELALFRLHLQMDFSSIYSTLEILNDPKWLADSGATNHIVADVGNM
VVKSKYTGGESLVVGNGTKLDIYQDKASRKVILHETLKDFLYQLELLLIQSSQSIVNPNLFLVAASGLNKSSIVSSKFVSRCLGMFQLENSKSIVEPKSLWHRRLGHASE
SVVNTVIKAWNLSASFNEKFAFCAACQLGTTFLLKDISAKVLVVGFSFLVMLYSMKLSFPFKSGLLITSHSPSDNVVLHLLPFPQSSSLSSQPMSRPSPLSVTEFAAMPT
NGSPRVAPLLVSSVESSSIDGSTSFPIPQLVVVSFNTHVVTGHHPMQTCANSGIFQPKNWGSFLTYVGSTDAIVSKPLSVKEALGSPIWKVAMDAEIFAIYRNKTWTFGP
LPPYVNLIGSKWVVKVKTHSDGSVDRCKACMVAKGFYQIPGIDFKETFSPVVKTPTI