; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007894 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007894
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:7482717..7485352
RNA-Seq ExpressionLag0007894
SyntenyLag0007894
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]6.0e-7830Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS
        + W +   V + SYS GHI+V+I   S D +F  TGFYG+P T  R+ SW+LL RLSY     WVV GDFNEIL   +K GG  RP  QM  F+ A+ + 
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS

Query:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPIL--ASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQ
        +L             +    GS     LDR + + V    Y      HL +   SDH PIL  A  D    GA      R   FEE WT+ PE   +I++
Subjt:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPIL--ASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQ

Query:  IWQATGRDRSLSHQARVQA------------YLRILATKVERGLVGVGG----GTGILSGFIPEHLNGISEKK----LKASRIV--------------QA
         W+ T    S+S+   + A             +R   T   + L  + G       +L   + E ++ + EK+     + SR+V              +A
Subjt:  IWQATGRDRSLSHQARVQA------------YLRILATKVERGLVGVGG----GTGILSGFIPEHLNGISEKK----LKASRIV--------------QA

Query:  NGSQIRNRA-------------------------------------------------------LLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQH
        +    RNR                                                        LL+ FTR E+E  L  M  TKAPG +G+ ALF+Q++
Subjt:  NGSQIRNRA-------------------------------------------------------LLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQH

Query:  WDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVI--------
        W ++ D+ +  CL ILNG  SV+E NHT + L+PK+    +V +FRPISLC+ VYK+ A  I  RLK VL  V++  QS FV  R+I DNV+        
Subjt:  WDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVI--------

Query:  ISG------------MGLSKVDDASDGVSNK--MGRVGDELCGVSSVLSFAE-------WEA--IVRISAFLGVRQGDPLSPYLFIF-------------
        I G            + ++K  D  + V  +  M ++G     VS V+           W+   +  I    G+RQG PLSPYLF+              
Subjt:  ISG------------MGLSKVDDASDGVSNK--MGRVGDELCGVSSVLSFAE-------WEA--IVRISAFLGVRQGDPLSPYLFIF-------------

Query:  --------------------------------------------------VRRVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAW
                                                            + INY KS   +S N        I   L V        YLGLP+    
Subjt:  --------------------------------------------------VRRVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAW

Query:  AKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA--
         +  +F+ ++D++WK + GWKEK+    GKE+L+KAV QAIPTY+MSCF++P  LC E+NG+ A F+   +   KR I     + +  S +    GF   
Subjt:  AKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA--

Query:  ---IYAFLIRPCW---LSSASVYPRTSLAFY---IEFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS
             A L + CW    +  S+  R   A Y   + F E   GT    ++    WG+E+  +G+RWRVGS
Subjt:  ---IYAFLIRPCW---LSSASVYPRTSLAFY---IEFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]7.6e-8130.54Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS
        + W +   + + SYS GHI+V+I   S D +F  TGFYG+P T  R+ SW++L RLS+     WVV GDFNEIL   +K GG  RP  QM  F+ A+ + 
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS

Query:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVR--FEEGWTRFPECKDIIKQ
        +L             +    GS     LDR + + V    Y      HL L   SDH PIL          PE    R  R  FEE WT+ P+   +I++
Subjt:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVR--FEEGWTRFPECKDIIKQ

Query:  IWQATGRDRSLS----------------HQARVQAYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISEKK----LKASRIVQANGSQI---------
         W+ T   +S+S                H + V+  L+    ++      +     +L   + E ++ + EK+     + SR+ Q    +I         
Subjt:  IWQATGRDRSLS----------------HQARVQAYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISEKK----LKASRIVQANGSQI---------

Query:  -------------------------RNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPK
                                  N  LL+ FTR E+E  L  M  TKAPG +G+ ALF+Q++W ++ D+ +  CL ILNG  SV+E NHT + L+PK
Subjt:  -------------------------RNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPK

Query:  ITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISG------MGLSKVDDASDGVSNKMGRVGDE-----LCGVSS
        +    +V +FRPISLC+ VYK+ A  I  RLK VL  V++  QS FV  R+I DNV+ +        G+ K  D    +   M +  D      L  +  
Subjt:  ITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISG------MGLSKVDDASDGVSNKMGRVGDE-----LCGVSS

Query:  VLSF-AEWEAIV-------------------RISAFLGVRQGDPLSPYLF------------------------------------------IFVR----
         L F A W A V                    I    G+RQG PLSPYLF                                          +F++    
Subjt:  VLSF-AEWEAIV-------------------RISAFLGVRQGDPLSPYLF------------------------------------------IFVR----

Query:  -----------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVK
                         + INY KS F +S N        I+  L V       +YLGLP+     +  +F+ ++D++WK + GWKEK+    GKE+L+K
Subjt:  -----------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVK

Query:  AVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA-----IYAFLIRPCW---LSSASVYPRTSLAFY---I
        AV QAIPTY+MSCF++P  LC E+NG+ A F+   +   KR I     + +  S +    GF        A L + CW    +  S+  R   A Y   +
Subjt:  AVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA-----IYAFLIRPCW---LSSASVYPRTSLAFY---I

Query:  EFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS
         F E   GT    ++    WG+E+  +G+RWRVG+
Subjt:  EFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS

XP_023899813.1 uncharacterized protein LOC112011695 [Quercus suber]2.8e-7531.83Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSED--CRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS
        ++W   + V++ +YS  HI+  ++   +      TGFYGNP TT R  SW  L+ L   S LPW+  GDFNEI+  +EK GG  RP  QM+ F  AI + 
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSED--CRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS

Query:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW
           D          +  +  G+     LDR L +   +  +    V HL L   SDH P+     + ++     +  +  +FE  W +   C+++++  W
Subjt:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW

Query:  QATGRDRSLSHQAR-----VQAYLR------------------ILATKVERGLVGVGGGTGILSGFIPEHLNGISEKKLKASR---------IVQANGSQ
        +         H  R     +  +L+                     T               + G   E +     K   +SR          +Q   S 
Subjt:  QATGRDRSLSHQAR-----VQAYLR------------------ILATKVERGLVGVGGGTGILSGFIPEHLNGISEKKLKASR---------IVQANGSQ

Query:  IRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICA
          N  L R FT  EV  ALK M   KAPGP+G+  +F+QQ W    +  +   L+ LN G S    N T++VL+PKI   ++V DFRPISLC+V YKI +
Subjt:  IRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICA

Query:  NAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFLGVRQGDPLSPYLF-----
         AI  RLK  L S++S  QS FV GRLI+DNV+++   +  +     G   +M  V  ++ G+S+++  + W+   R+      R G  LS   F     
Subjt:  NAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFLGVRQGDPLSPYLF-----

Query:  IFVR---------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLV
        IF +                     + +N  K+    S NT  ++Q  I+   G        +YLGLPS     K   F  I+++V + L GWKEK+   
Subjt:  IFVR---------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLV

Query:  GGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQR
         GKEVL+KAVAQAIPTYTMSCFK+P SLC E+  +  NF+ GQR
Subjt:  GGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQR

XP_030963604.1 uncharacterized protein LOC115984727 [Quercus lobata]2.8e-7531.28Show/hide
Query:  VIWNNELNVNIFSYSAGHIN-VMIKATSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ
        + W  + N+ + S S  HI+ V+ + T +  RFTGFYG P T LRSASW LL  L     LPW+ GGDFNE+L   EK GG P P  QMQ FR  +   +
Subjt:  VIWNNELNVNIFSYSAGHIN-VMIKATSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ

Query:  LLDR---------VKNQRAGSSTWIHLDRFL-LSHVVVCF----YRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEG---RCLRPVRFEEGWTRFPECK
        L+D           KN  +G   W  LDR +  +  + CF     R F          SDH PI           PEG   +  RP RFE+ W     C 
Subjt:  LLDR---------VKNQRAGSSTWIHLDRFL-LSHVVVCF----YRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEG---RCLRPVRFEEGWTRFPECK

Query:  DIIKQIWQATGRD----------------------RSLSHQARVQAYLRILATKVERGLVGVGGGT-----------------------------GILSG
        D + + W A   D                       S+ + AR     + + +K E  +V + GG+                              +LSG
Subjt:  DIIKQIWQATGRD----------------------RSLSHQARVQAYLRILATKVERGLVGVGGGT-----------------------------GILSG

Query:  ------------------------------------------------FIPEHLNGISEKKLKASRIVQANGSQIRNRALLREFTRVEVEAALKGMSLTK
                                                        F   + +G+        R+V A+     N  L+ EFTR+EVE ALK M+  K
Subjt:  ------------------------------------------------FIPEHLNGISEKKLKASRIVQANGSQIRNRALLREFTRVEVEAALKGMSLTK

Query:  APGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGR
        APGPNG+  +F+Q +WD I D+   + L  LN  D +  LNHT++ L+PK+     + +F PI+LC+V+YK+ +  +  RLK VL  ++S  QS F   +
Subjt:  APGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGR

Query:  LISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFLG-------------------VRQGD--PLSPYLFIFVR---RVI
         ISDN++++   L  +     G   K+G V  +L  +S      EWE + +I   +G                    R GD   +   L ++ +   ++I
Subjt:  LISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFLG-------------------VRQGD--PLSPYLFIFVR---RVI

Query:  NYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSL
        N  K+    S+N     +  I+N LGV +     +YLGLP+     K A    I+DRVW  L+GWKEK+    GKEVL+K V QAIPT+ MSCF+LPV L
Subjt:  NYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSL

Query:  CTEINGLCANFYGGQR
        C +I  L   F+ GQR
Subjt:  CTEINGLCANFYGGQR

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]2.7e-7829.22Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ
        ++W N++ + + S+S  HI+ ++   SE+  R TGFYG P T+ R   W +L  LS    L W   GDFNE+L  S+K+GG PR   QMQ+FR A+ +  
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ

Query:  LLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQA
         +D            +R G   W  LDR + ++  +  +    V HL     SDHRP+L S D    G  +    +P RFE  W   P CK  + + W  
Subjt:  LLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQA

Query:  TGRDRSLSHQ-------------------ARVQAYLRILATK--------VERGLVGV---------------------------------------GGG
          R   + +                      V+  ++++  K        V+RG  G                                        G  
Subjt:  TGRDRSLSHQ-------------------ARVQAYLRILATK--------VERGLVGV---------------------------------------GGG

Query:  T-----GILSGFIPEH-------------LNGISEKKLKAS------RI---VQANGSQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQH
        T       + G   E+             L    EK  K+S      R+   VQ   +   N  L + ++  EVE A+K M+  KAPGP+G+  LFYQ +
Subjt:  T-----GILSGFIPEH-------------LNGISEKKLKAS------RI---VQANGSQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQH

Query:  WDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSK
        W  +  + +   L+ LN G  +K +NHT++ L+PK+     V +FRPISLC+V+YKI + AI  RLK +L+S++S  QS F++ RLI+DNV+I+   L  
Subjt:  WDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSK

Query:  VDDASDGVSNKMGRVGDELCGVSSV-----------LSFAE-WEAIVR-------------------ISAFLGVRQGDPLSPYLFIFVR-----------
        + +   G +  M    D       V           L F E W +++                    I+   G+RQGDPLSPYLF+F             
Subjt:  VDDASDGVSNKMGRVGDELCGVSSV-----------LSFAE-WEAIVR-------------------ISAFLGVRQGDPLSPYLFIFVR-----------

Query:  ----------------------------------------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAW
                                                            +++N DK+    S+NT+  +Q  I+N+LGV   H   +YLGLPS    
Subjt:  ----------------------------------------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAW

Query:  AKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ
         K A F  I++R+W  ++GWKEK+    GKE+++KAV Q+IPTY+MS FKLPV LC +I  +   F+ GQ
Subjt:  AKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ

TrEMBL top hitse value%identityAlignment
A0A2N9EVR9 F-box domain-containing protein1.2e-7932.52Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ
        ++W   +N+ + S+S  HI+ ++   SE+  RFTGFYG P T  R  SW LL RL+  + LPW   GDFNE++   EK G   R   QMQ FR  +    
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ

Query:  LLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQ-
         +D           N R G  TW  LDR + +   +  +    V HL  +  SDH+P+  S D   R +      +  RFEE WT    C+++I   W+ 
Subjt:  LLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQ-

Query:  ---------ATGRDRSLSHQARV---QAY--LRILATKVERGL-----VGVGGGTGILSGFIPEHLNGISEKK------------LKAS-----------
                   G+  +   + RV   Q++  ++    +VE+ L     V + GG       + + L+ +  K+            LKA            
Subjt:  ---------ATGRDRSLSHQARV---QAY--LRILATKVERGL-----VGVGGGTGILSGFIPEHLNGISEKK------------LKAS-----------

Query:  ----------RIVQANG--------------SQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELN
                  R+   NG              ++  N  L REFT  EV  ALK M+  KAPGP+G+  LFY ++W LI DE +   L  LN G  ++  N
Subjt:  ----------RIVQANG--------------SQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELN

Query:  HTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVL
        HTY+ L+PK+   ++V DFRPISLC+V+YK+ +  +  RLK +L ++VS  QS FV GRLI+DN++++   L  +       + +MG +  +L  +S   
Subjt:  HTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVL

Query:  SFAEWEAIVRISAFLGVRQGDPLSPYLFIFVR--------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDR
           EW+ + R+   +G       S   +   +        + IN  K+    S++T    +  I+N LGV       RYLGLPS    AK + F  I+++
Subjt:  SFAEWEAIVRISAFLGVRQGDPLSPYLFIFVR--------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDR

Query:  VWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ
        VW  L+GWKEK+     KE+L+K+VAQAIP Y MSCF+LP  L  +I  L   F+ GQ
Subjt:  VWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ

A0A2N9GDB5 Reverse transcriptase domain-containing protein3.4e-7930.31Show/hide
Query:  RAMFHIDDRFGLVRFSSSIRFDSLHP----------YVIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPW
        RA  H +D   L R    ++F++              ++W  E+ + + SYS  HI+ ++    +D  RFTGFYG P T  R  SW LL RL+    +PW
Subjt:  RAMFHIDDRFGLVRFSSSIRFDSLHP----------YVIWNNELNVNIFSYSAGHINVMIKATSEDC-RFTGFYGNPATTLRSASWQLLERLSYLSPLPW

Query:  VVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQLLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWD
           GDFNE++   EK G   R   QMQ FR  +    L+D           N R G  TW  LDR + +   +  +    V HL     SDH+PI  + +
Subjt:  VVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQLLD--------RVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWD

Query:  RGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQAT------------------GRD-------------------RSLSHQARVQ---------AYL
           +        RP RFEE WT    C+++IK  WQ T                  GRD                   R    ++RV+          Y 
Subjt:  RGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIWQAT------------------GRD-------------------RSLSHQARVQ---------AYL

Query:  RILATKVER-----GLVGVGGGTGILSGFIP----EHLNGISEKKL---------KASRIVQANGSQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGV
           AT+ +R      L    G        +P    +H N + +               R+V A      N  L++EFT  EVE ALK M+  KAPGP+G+
Subjt:  RILATKVER-----GLVGVGGGTGILSGFIP----EHLNGISEKKL---------KASRIVQANGSQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGV

Query:  NALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVI
          LFYQQ+W LI  + S+  L  LN G  +K +NHTY+ L+PK+     V +FRPISLC+V+YKI +  +  RLK +L  +VS  QS F+ GRLI+DN++
Subjt:  NALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVI

Query:  ISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFL-----------------------------------GVRQGDPLSPYLFIF-
        ++   L  +     G +  M    D    +S      EW+ + R+   +                                   G+RQGDPLSPYLF+  
Subjt:  ISGMGLSKVDDASDGVSNKMGRVGDELCGVSSVLSFAEWEAIVRISAFL-----------------------------------GVRQGDPLSPYLFIF-

Query:  ------------------------------------------------VRRV--------------INYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSL
                                                        V R+              IN  K+    S++T    Q  I++ L V      
Subjt:  ------------------------------------------------VRRV--------------INYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSL

Query:  GRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ
         RYLGLPS    AK + F  I++RVW  L+GWK K+    G+EVL+K+VAQAIP+Y MSCF+LP  L  EI  +   F+ GQ
Subjt:  GRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQ

A0A2N9HTH6 Reverse transcriptase domain-containing protein2.0e-8729.8Show/hide
Query:  VIWNNELNVNIFSYSAGHI--NVMIKATSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS
        + WN+E+ + + +YS  HI  NV++  + ++ RFTGFYGNP T  R  SW LL+ L +L   PW+  GDFNE+L  +EK G   RP  Q++ FR A+   
Subjt:  VIWNNELNVNIFSYSAGHI--NVMIKATSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS

Query:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW
        QL D           K +R   +    LDR L S   +  Y    V HL  +  SDH P+L             R  +   FE  W +  +CK +I+  W
Subjt:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW

Query:  QATGRDRSLSHQARVQ------AYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISE-------KKLKAS---------RIVQANGSQI---RNRALL
            +D SL  +   +      A +    TK       +     +L   + E  +G S+       KK+  S         + +   G  +    N  L+
Subjt:  QATGRDRSLSHQARVQ------AYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISE-------KKLKAS---------RIVQANGSQI---RNRALL

Query:  REFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRL
         +FT  EV  ALK M  TKAPGP+G++A+FYQ +W+++  E +   L+IL+ G  ++++N T++VL+PKI     + D+RPI+LC+V+YKI +  +  RL
Subjt:  REFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRL

Query:  KGVLHSVVSTEQSTFVSGRLISDNVIIS--------------------GMGLSKVDDA-----SDGVSNKMGRVGDELCGVSSVLSFAEWEAIV------
        K VL  V+S  QS FV GRLI+DNV+++                     + +SK  D       + V  ++G V + +  +   LS   +  ++      
Subjt:  KGVLHSVVSTEQSTFVSGRLISDNVIIS--------------------GMGLSKVDDA-----SDGVSNKMGRVGDELCGVSSVLSFAEWEAIV------

Query:  RISAFLGVRQGDPLSPYLFIFV--------------RRV-------------------------------------------------INYDKSMFMVSR
          +A  G+RQGDPLSPY+F+                R++                                                 +N  K+    ++
Subjt:  RISAFLGVRQGDPLSPYLFIFV--------------RRV-------------------------------------------------INYDKSMFMVSR

Query:  NTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANF
        NT+  M+ +I+N   VL+  S  +YLGLPS    +K A F  I+DRVW+ + GWKEK+    G+E+L+KAVAQ+IPTY MSCFKLP SLC E+N + +NF
Subjt:  NTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANF

Query:  YGGQRMLGKRSIGVAGTDCVATSCWVAWGFA-----IYAFLIRPCWLSSASVYPRTSLAFYIEFCEGGTFVQVLFFER--------HWGREVFRRGIRWR
        + G +  G+    +       +      GF        A L +  W          +  F  ++     F+      R           REV   G+RW 
Subjt:  YGGQRMLGKRSIGVAGTDCVATSCWVAWGFA-----IYAFLIRPCWLSSASVYPRTSLAFYIEFCEGGTFVQVLFFER--------HWGREVFRRGIRWR

Query:  VG
        +G
Subjt:  VG

A0A2N9I9F4 Reverse transcriptase domain-containing protein3.7e-8130.25Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKA-TSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ
        + W   +NV+I SYS  HI+ +I   T    RFT FYG P + LR  SW LL  ++  + +PW   GDFNEI+ +SEKSG  P+   QMQ FR  I    
Subjt:  VIWNNELNVNIFSYSAGHINVMIKA-TSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNSQ

Query:  LLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW-
         +D            N+   ++TW+ LDRF+ ++  V  +    V HL     SDH+P+   W   +         RP RFEE W   P C+  +   W 
Subjt:  LLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQIW-

Query:  ---------QATGRDRSLSHQARVQA------YLRILATKVER-------GLVGVGGGTGILSGFIPEHLNGISEKKL-----------KASRIVQANGS
                 Q   + R  S + R  +        R L  K E+        +VG G  T ++S      +  + E+K+              R       
Subjt:  ---------QATGRDRSLSHQARVQA------YLRILATKVER-------GLVGVGGGTGILSGFIPEHLNGISEKKL-----------KASRIVQANGS

Query:  QIR----------------------------------NRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKE
         IR                                  N  L   FT  E+E A+K M+  KAPGP+G+  +FYQ +W ++  + S   L  LN G     
Subjt:  QIR----------------------------------NRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKE

Query:  LNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDG-----------------
        LNHTY+ L+PK      V ++RPISLC+V+YK+ +  +  RLK VL  V+S  QS FV GRLI+DN++I+   L  + +   G                 
Subjt:  LNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDG-----------------

Query:  --------VSNKMGRVGDELCGVSSVLSFAEWEAIVR------ISAFLGVRQGDPLSPYLF---------------------------------------
                V  +MG        + + +S   +  ++       I+   G+RQGDP+SPYLF                                       
Subjt:  --------VSNKMGRVGDELCGVSSVLSFAEWEAIVR------ISAFLGVRQGDPLSPYLF---------------------------------------

Query:  ---IFVR---------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKM
           +F R                     + +N +K+    SRNT   MQ  +++ LGV       +YLGLPS     K   F  I++RVW  ++GWKEK+
Subjt:  ---IFVR---------------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKM

Query:  FLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQR
            G+E+L+KAV QAIPTYTM+CFKLPV+LC EI GL   F+ GQ+
Subjt:  FLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQR

A0A5E4FZN9 PREDICTED: retrotransposon3.7e-8130.54Show/hide
Query:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS
        + W +   + + SYS GHI+V+I   S D +F  TGFYG+P T  R+ SW++L RLS+     WVV GDFNEIL   +K GG  RP  QM  F+ A+ + 
Subjt:  VIWNNELNVNIFSYSAGHINVMIKATSEDCRF--TGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKSGGAPRPLLQMQAFRTAIGNS

Query:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVR--FEEGWTRFPECKDIIKQ
        +L             +    GS     LDR + + V    Y      HL L   SDH PIL          PE    R  R  FEE WT+ P+   +I++
Subjt:  QLLDR---------VKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVR--FEEGWTRFPECKDIIKQ

Query:  IWQATGRDRSLS----------------HQARVQAYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISEKK----LKASRIVQANGSQI---------
         W+ T   +S+S                H + V+  L+    ++      +     +L   + E ++ + EK+     + SR+ Q    +I         
Subjt:  IWQATGRDRSLS----------------HQARVQAYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISEKK----LKASRIVQANGSQI---------

Query:  -------------------------RNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPK
                                  N  LL+ FTR E+E  L  M  TKAPG +G+ ALF+Q++W ++ D+ +  CL ILNG  SV+E NHT + L+PK
Subjt:  -------------------------RNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPK

Query:  ITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISG------MGLSKVDDASDGVSNKMGRVGDE-----LCGVSS
        +    +V +FRPISLC+ VYK+ A  I  RLK VL  V++  QS FV  R+I DNV+ +        G+ K  D    +   M +  D      L  +  
Subjt:  ITGRRSVKDFRPISLCSVVYKICANAI-TRLKGVLHSVVSTEQSTFVSGRLISDNVIISG------MGLSKVDDASDGVSNKMGRVGDE-----LCGVSS

Query:  VLSF-AEWEAIV-------------------RISAFLGVRQGDPLSPYLF------------------------------------------IFVR----
         L F A W A V                    I    G+RQG PLSPYLF                                          +F++    
Subjt:  VLSF-AEWEAIV-------------------RISAFLGVRQGDPLSPYLF------------------------------------------IFVR----

Query:  -----------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVK
                         + INY KS F +S N        I+  L V       +YLGLP+     +  +F+ ++D++WK + GWKEK+    GKE+L+K
Subjt:  -----------------RVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWKVLRGWKEKMFLVGGKEVLVK

Query:  AVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA-----IYAFLIRPCW---LSSASVYPRTSLAFY---I
        AV QAIPTY+MSCF++P  LC E+NG+ A F+   +   KR I     + +  S +    GF        A L + CW    +  S+  R   A Y   +
Subjt:  AVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWV-AWGFA-----IYAFLIRPCW---LSSASVYPRTSLAFY---I

Query:  EFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS
         F E   GT    ++    WG+E+  +G+RWRVG+
Subjt:  EFCEG--GTFVQVLFFERHWGREVFRRGIRWRVGS

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein1.6e-1236.03Show/hide
Query:  SQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKI
        S+ R   L    T  E+  AL+ M   K+PG +G+   F+Q  WD +  +   +       G+         L L+PK    R +K++RP+SL S  YKI
Subjt:  SQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKI

Query:  CANAIT-RLKGVLHSVVSTEQSTFVSGRLISDNVII
         A AI+ RLK VL  V+  +QS  V GR I DNV +
Subjt:  CANAIT-RLKGVLHSVVSTEQSTFVSGRLISDNVII

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-1341.18Show/hide
Query:  EVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKI
        E+ AA+  M   KAPGP+   A F+ + W ++KD T A        G  +K  N T + L+PK+TG   +  FRP+S C+VVYKI
Subjt:  EVEAALKGMSLTKAPGPNGVNALFYQQHWDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCCAAACGCGTGCGATGTTCCATATTGATGACCGGTTCGGTTTGGTCCGGTTCAGTTCATCGATTCGGTTCGACTCCTTACACCCCTACGTGATATGGAATAA
TGAGTTGAATGTTAACATTTTCTCTTATTCTGCAGGGCATATTAATGTTATGATCAAGGCAACTTCGGAGGATTGTCGGTTCACGGGGTTTTATGGAAACCCAGCCACGA
CACTTCGGTCTGCATCGTGGCAATTGCTGGAGCGCCTGAGCTATTTGTCCCCTCTCCCATGGGTTGTGGGTGGGGACTTCAACGAAATACTTGCAGATTCTGAAAAGTCG
GGTGGGGCTCCTAGGCCTCTTTTGCAAATGCAGGCTTTTAGGACGGCTATTGGTAACTCCCAACTTCTGGATCGGGTGAAGAACCAACGGGCAGGTAGCTCCACATGGAT
CCATCTGGATAGATTCCTTCTAAGTCATGTTGTGGTTTGCTTCTATAGGGACTTTTGTGTGGACCATTTGGGCTTGATGGAGGTGTCTGACCATCGCCCCATTCTAGCAA
GTTGGGACAGAGGAAGAAGAGGTGCACCAGAGGGGAGATGCCTTCGACCTGTCAGATTTGAGGAAGGTTGGACTCGATTCCCGGAGTGTAAGGATATCATTAAGCAAATA
TGGCAAGCCACAGGGAGAGACAGATCCCTCTCACACCAGGCAAGAGTGCAGGCCTACTTGCGCATACTGGCGACAAAGGTCGAGAGAGGATTGGTTGGTGTGGGGGGGGG
GACAGGGATTCTAAGTGGTTTCATACCTGAGCATCTCAACGGCATCAGTGAAAAAAAATTGAAGGCCTCGAGGATAGTGCAGGCCAATGGCAGTCAGATCCGAAACCGAG
CCTTACTTAGGGAGTTCACTCGGGTGGAGGTAGAGGCAGCTTTGAAAGGCATGAGCCTCACGAAAGCCCCAGGCCCCAATGGGGTTAATGCTTTGTTTTATCAACAACAC
TGGGACTTGATTAAGGATGAGACTTCTGCACTTTGCCTCAACATTCTGAATGGAGGGGACTCTGTCAAGGAATTGAATCATACATATCTAGTGTTGGTCCCAAAGATAAC
GGGTCGTAGGTCTGTTAAGGATTTTCGACCAATCAGCCTTTGTAGTGTTGTCTATAAAATTTGTGCCAATGCAATCACGAGACTAAAAGGAGTGCTACACTCTGTGGTTT
CTACTGAGCAGTCGACCTTTGTGTCTGGGCGATTGATCTCAGACAATGTGATCATCAGTGGAATGGGGCTTTCTAAGGTAGATGATGCTTCAGATGGGGTTTCAAACAAG
ATGGGTCGAGTTGGTGATGAGTTGTGTGGAGTTAGTTCGGTTCTCAGTTTTGCTGAATGGGAAGCCATCGTCAGAATTTCTGCCTTCCTGGGGGTTCGGCAGGGAGATCC
TCTTTCTCCATATCTCTTTATTTTTGTGCGCAGGGTTATTAATTATGATAAATCTATGTTCATGGTTAGCAGGAATACTAACCCAGACATGCAGCATTTCATTCGGAATA
CATTGGGGGTGCTTCAGACGCACTCACTTGGTCGATATTTGGGGCTCCCTTCACAGACTGCTTGGGCAAAGTGTGCAATCTTTCGATCGATTAGGGATCGAGTTTGGAAG
GTTTTACGGGGATGGAAGGAAAAAATGTTTTTGGTGGGTGGAAAGGAGGTCCTCGTCAAGGCCGTTGCTCAGGCCATTCCCACCTACACGATGAGTTGTTTTAAGCTACC
GGTTTCCCTATGTACTGAGATAAATGGGTTGTGTGCTAATTTTTATGGGGGTCAACGGATGCTGGGCAAAAGATCCATTGGCGTAGCTGGAACCGATTGTGTCGCCACAA
GCTGTTGGGTGGCATGGGGTTTCGCGATTTATGCATTTTTAATCAGGCCATGCTGGCTAAGTAGTGCTAGTGTCTATCCCAGGACCAGTCTAGCCTTCTATATAGAGTTT
TGCGAGGGAGGTACTTTCGTACAGGTTCTTTTCTTCGAGCGACATTGGGGGCGGGAAGTGTTTAGACGAGGTATTAGGTGGCGGGTTGGATCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCCAAACGCGTGCGATGTTCCATATTGATGACCGGTTCGGTTTGGTCCGGTTCAGTTCATCGATTCGGTTCGACTCCTTACACCCCTACGTGATATGGAATAA
TGAGTTGAATGTTAACATTTTCTCTTATTCTGCAGGGCATATTAATGTTATGATCAAGGCAACTTCGGAGGATTGTCGGTTCACGGGGTTTTATGGAAACCCAGCCACGA
CACTTCGGTCTGCATCGTGGCAATTGCTGGAGCGCCTGAGCTATTTGTCCCCTCTCCCATGGGTTGTGGGTGGGGACTTCAACGAAATACTTGCAGATTCTGAAAAGTCG
GGTGGGGCTCCTAGGCCTCTTTTGCAAATGCAGGCTTTTAGGACGGCTATTGGTAACTCCCAACTTCTGGATCGGGTGAAGAACCAACGGGCAGGTAGCTCCACATGGAT
CCATCTGGATAGATTCCTTCTAAGTCATGTTGTGGTTTGCTTCTATAGGGACTTTTGTGTGGACCATTTGGGCTTGATGGAGGTGTCTGACCATCGCCCCATTCTAGCAA
GTTGGGACAGAGGAAGAAGAGGTGCACCAGAGGGGAGATGCCTTCGACCTGTCAGATTTGAGGAAGGTTGGACTCGATTCCCGGAGTGTAAGGATATCATTAAGCAAATA
TGGCAAGCCACAGGGAGAGACAGATCCCTCTCACACCAGGCAAGAGTGCAGGCCTACTTGCGCATACTGGCGACAAAGGTCGAGAGAGGATTGGTTGGTGTGGGGGGGGG
GACAGGGATTCTAAGTGGTTTCATACCTGAGCATCTCAACGGCATCAGTGAAAAAAAATTGAAGGCCTCGAGGATAGTGCAGGCCAATGGCAGTCAGATCCGAAACCGAG
CCTTACTTAGGGAGTTCACTCGGGTGGAGGTAGAGGCAGCTTTGAAAGGCATGAGCCTCACGAAAGCCCCAGGCCCCAATGGGGTTAATGCTTTGTTTTATCAACAACAC
TGGGACTTGATTAAGGATGAGACTTCTGCACTTTGCCTCAACATTCTGAATGGAGGGGACTCTGTCAAGGAATTGAATCATACATATCTAGTGTTGGTCCCAAAGATAAC
GGGTCGTAGGTCTGTTAAGGATTTTCGACCAATCAGCCTTTGTAGTGTTGTCTATAAAATTTGTGCCAATGCAATCACGAGACTAAAAGGAGTGCTACACTCTGTGGTTT
CTACTGAGCAGTCGACCTTTGTGTCTGGGCGATTGATCTCAGACAATGTGATCATCAGTGGAATGGGGCTTTCTAAGGTAGATGATGCTTCAGATGGGGTTTCAAACAAG
ATGGGTCGAGTTGGTGATGAGTTGTGTGGAGTTAGTTCGGTTCTCAGTTTTGCTGAATGGGAAGCCATCGTCAGAATTTCTGCCTTCCTGGGGGTTCGGCAGGGAGATCC
TCTTTCTCCATATCTCTTTATTTTTGTGCGCAGGGTTATTAATTATGATAAATCTATGTTCATGGTTAGCAGGAATACTAACCCAGACATGCAGCATTTCATTCGGAATA
CATTGGGGGTGCTTCAGACGCACTCACTTGGTCGATATTTGGGGCTCCCTTCACAGACTGCTTGGGCAAAGTGTGCAATCTTTCGATCGATTAGGGATCGAGTTTGGAAG
GTTTTACGGGGATGGAAGGAAAAAATGTTTTTGGTGGGTGGAAAGGAGGTCCTCGTCAAGGCCGTTGCTCAGGCCATTCCCACCTACACGATGAGTTGTTTTAAGCTACC
GGTTTCCCTATGTACTGAGATAAATGGGTTGTGTGCTAATTTTTATGGGGGTCAACGGATGCTGGGCAAAAGATCCATTGGCGTAGCTGGAACCGATTGTGTCGCCACAA
GCTGTTGGGTGGCATGGGGTTTCGCGATTTATGCATTTTTAATCAGGCCATGCTGGCTAAGTAGTGCTAGTGTCTATCCCAGGACCAGTCTAGCCTTCTATATAGAGTTT
TGCGAGGGAGGTACTTTCGTACAGGTTCTTTTCTTCGAGCGACATTGGGGGCGGGAAGTGTTTAGACGAGGTATTAGGTGGCGGGTTGGATCCTGA
Protein sequenceShow/hide protein sequence
MKFQTRAMFHIDDRFGLVRFSSSIRFDSLHPYVIWNNELNVNIFSYSAGHINVMIKATSEDCRFTGFYGNPATTLRSASWQLLERLSYLSPLPWVVGGDFNEILADSEKS
GGAPRPLLQMQAFRTAIGNSQLLDRVKNQRAGSSTWIHLDRFLLSHVVVCFYRDFCVDHLGLMEVSDHRPILASWDRGRRGAPEGRCLRPVRFEEGWTRFPECKDIIKQI
WQATGRDRSLSHQARVQAYLRILATKVERGLVGVGGGTGILSGFIPEHLNGISEKKLKASRIVQANGSQIRNRALLREFTRVEVEAALKGMSLTKAPGPNGVNALFYQQH
WDLIKDETSALCLNILNGGDSVKELNHTYLVLVPKITGRRSVKDFRPISLCSVVYKICANAITRLKGVLHSVVSTEQSTFVSGRLISDNVIISGMGLSKVDDASDGVSNK
MGRVGDELCGVSSVLSFAEWEAIVRISAFLGVRQGDPLSPYLFIFVRRVINYDKSMFMVSRNTNPDMQHFIRNTLGVLQTHSLGRYLGLPSQTAWAKCAIFRSIRDRVWK
VLRGWKEKMFLVGGKEVLVKAVAQAIPTYTMSCFKLPVSLCTEINGLCANFYGGQRMLGKRSIGVAGTDCVATSCWVAWGFAIYAFLIRPCWLSSASVYPRTSLAFYIEF
CEGGTFVQVLFFERHWGREVFRRGIRWRVGS