; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031590 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031590
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr11:10469551..10480513
RNA-Seq ExpressionLag0031590
SyntenyLag0031590
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR015410 - Domain of unknown function DUF1985
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017233063.1 PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus]1.3e-7032.81Show/hide
Query:  FELDLEIERTFKRKGESSVDNKIKWITYRVSHRAEN---PIL-----VANDRARAIRAYAFLMFDELNPGIARPQIEA----------------------
        F  D EIERTF R+ ++    KIK     +     N   PI+     + +D+ RAIR YA   F+ELN GI RP I+A                      
Subjt:  FELDLEIERTFKRKGESSVDNKIKWITYRVSHRAEN---PIL-----VANDRARAIRAYAFLMFDELNPGIARPQIEA----------------------

Query:  --------------------------------------------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFK
                                                    WLNS   GS+ TWN+L EKFLSKYFPPN NAKLR+EI  F+Q +D +  +AWERFK
Subjt:  --------------------------------------------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFK

Query:  ELLRKCPHHGLPHCIQMETFYNGLNGATQV----------------------------------------------------------------------
        ELLRKCPHHG+ HCIQMETFYNGLN  T++                                                                      
Subjt:  ELLRKCPHHGLPHCIQMETFYNGLNGATQV----------------------------------------------------------------------

Query:  FSHQQPPAVEPTTMVKQVAEEARVYCGEYHNYEFFPSNPAFVFFVG------------------------------------------------KASRET
          + Q      ++ + Q    + V+CGE H Y+  PSNP  VF++G                                                +A +  
Subjt:  FSHQQPPAVEPTTMVKQVAEEARVYCGEYHNYEFFPSNPAFVFFVG------------------------------------------------KASRET

Query:  SL-----------------------------------------------------DTEHPRREGKEHVKAMTLRSGKPLEERKETSKTQD-IEKNCDKNV
        SL                                                     DTE P+  G EH KAMTL+SGK L      +K  D +E + ++ +
Subjt:  SL-----------------------------------------------------DTEHPRREGKEHVKAMTLRSGKPLEERKETSKTQD-IEKNCDKNV

Query:  VVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPL-PFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGE
          +KE         S ND         V PP  S   ++ P  PFPQR +   Q+ QFKKFL++LKQLHINIPLVEA+EQMPNY KF++DILTKKRRLGE
Subjt:  VVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPL-PFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGE

Query:  FETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG
        FETV+LT+ECS+ L + LPTK KDPGSFTIP +IG    G
Subjt:  FETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG

XP_017970395.1 PREDICTED: uncharacterized protein LOC108660654 [Theobroma cacao]5.6e-7127.71Show/hide
Query:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS
        L P   R + + WLNS   G I TW ELA+KFLSK+F P + AKLR++I  F Q +  +  EAWERFKE L++CPHH LP  +Q++TFYNGL G+ +   
Subjt:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS

Query:  HQQPPAVEPTTMVKQVAEEARVYCGE--YHNYEFFPSNPAFVFFVGK-------------ASRETSLDT------------------------EHPRREG
             A     ++ + A +A     E   +NY++          VG              A+    +DT                         +  +  
Subjt:  HQQPPAVEPTTMVKQVAEEARVYCGE--YHNYEFFPSNPAFVFFVGK-------------ASRETSLDT------------------------EHPRREG

Query:  KEHVKAMTLRSGKPLEERKE-----TSKTQDIEKNCDKNVVVE-KELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKK
        KE  +A+TLR+GK +E   E      ++  D E+ C+K   VE KE    +  G S                 + PPP     PFPQR +    + QF+K
Subjt:  KEHVKAMTLRSGKPLEERKE-----TSKTQDIEKNCDKNVVVE-KELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKK

Query:  FLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGG----------------------KE
        F+ + K+LHINIP  EA+EQMP+Y KFL+ IL+KK +L EFET+SLTEECSAIL N LP K KD GSFTIP +IG                       ++
Subjt:  FLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGG----------------------KE

Query:  LG----------------------------------------FWITQL------------------------------LRQQYRIRQTSIWKIME-----
        LG                                        F I  +                              LR Q +    +I+K ++     
Subjt:  LG----------------------------------------FWITQL------------------------------LRQQYRIRQTSIWKIME-----

Query:  ----------------------------------------------------RFRSKK-----------APPIKPSLIEAPTLDLKPLPDHLK-------
                                                            RFR+              P  KPS+ E   L+LKPLP HL+       
Subjt:  ----------------------------------------------------RFRSKK-----------APPIKPSLIEAPTLDLKPLPDHLK-------

Query:  --------------YVYLGEDSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSR
                       +Y   DS+WVSP+QCV KKG + VV+N +NELIPTRTVTGW                                K YYCFL+GYS 
Subjt:  --------------YVYLGEDSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSR

Query:  YNQITIAPEDQEKTTFTCPYGTLAF--------------------------GECLLAFQ-----------------------------------------
        YNQI IAP+DQEKTTFTCPYGT AF                           +CL AF                                          
Subjt:  YNQITIAPEDQEKTTFTCPYGTLAF--------------------------GECLLAFQ-----------------------------------------

Query:  ---------CSNNISAVKA------------------------------------------------------------FETLKAALISTPILCAPNWNL
                  SN I   KA                                                            F+ LK  LI  PI+ +P+W  
Subjt:  ---------CSNNISAVKA------------------------------------------------------------FETLKAALISTPILCAPNWNL

Query:  PFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV
        PFE+MCDASD AVG  LG ++ +   ++           S  L ++  +Y+  +++L AV
Subjt:  PFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV

XP_030497826.1 LOW QUALITY PROTEIN: uncharacterized protein LOC115713483 [Cannabis sativa]1.5e-7126.58Show/hide
Query:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVF-
        L P   R + ++W  S    SI TW ELA KFLSK+FPP + AKLR++I  F Q +  +  EAWERFK+LLRKCP+HG+   +Q+  FYNGL   T+   
Subjt:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVF-

Query:  ------SHQQPPAVEPTTMVKQVAEEARVYCGE---------------------YHNYEFFPSNPAFVFFVGKASRETSLDTEH----------------
              +  +  A E   +++++A   + +  E                       N       P    + G +  +T L  +                 
Subjt:  ------SHQQPPAVEPTTMVKQVAEEARVYCGE---------------------YHNYEFFPSNPAFVFFVGKASRETSLDTEH----------------

Query:  -------PRREG----------KEHVKAMTLRSGKPLEERKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLP
                R +G          KE+ KA+TLRSGK  +   +     D               E  Q A          G       P +S   ++  +P
Subjt:  -------PRREG----------KEHVKAMTLRSGKPLEERKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLP

Query:  FPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKE------
        +PQR R  N D QF KFLE+ ++LHINIP  EA+EQMP+Y KF+++IL+KKR++ +FETV+LTEECSAIL   LP K KDPGSFTIP +IG  E      
Subjt:  FPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKE------

Query:  -LGFWITQLLRQQYRIRQTSIWK-------------------------------------IMERFRSKKAPPI---------------------------
         LG  I  +    ++  Q    K                                     +++       P I                           
Subjt:  -LGFWITQLLRQQYRIRQTSIWK-------------------------------------IMERFRSKKAPPI---------------------------

Query:  --------------------------------------------KPSLIEAPTLDLKPLPDHLKYVYLGE------------------------------
                                                     PS  + P L+LK LPDHL+Y YLGE                              
Subjt:  --------------------------------------------KPSLIEAPTLDLKPLPDHLKYVYLGE------------------------------

Query:  -------------------------------------------------------------DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGWK--
                                                                     DS WVSP+Q VPKKG +TVV N+ NELIPTRTVTGW+  
Subjt:  -------------------------------------------------------------DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGWK--

Query:  ------------------------------TYYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF-------------
                                       YYCFLDGYS Y+QI IAPEDQEKTTFTCPYGT A              F  C++A              
Subjt:  ------------------------------TYYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF-------------

Query:  -----------QCSNNISAV----------------------------------------------------------------KAFETLKAALISTPIL
                   QC +N+  V                                                                +AF+ LK  LIS PI+
Subjt:  -----------QCSNNISAV----------------------------------------------------------------KAFETLKAALISTPIL

Query:  CAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV
          PNW LPFE+MCDASD A+G  LG +  +  R +           S  L  + ++Y+  ++++ A+
Subjt:  CAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]7.4e-7133.44Show/hide
Query:  VSHRAENPILVANDRARAIRAYAFLMFDELNPGIARPQIEA-----------------------------------------------------------
        ++ +  +PI++ +DRARAIR YA  MF+ELNPGI RP+I+A                                                           
Subjt:  VSHRAENPILVANDRARAIRAYAFLMFDELNPGIARPQIEA-----------------------------------------------------------

Query:  -------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQV-------
               WLN+ +P S+  WN+ AEKFL KYFPP RNAK RSEI+ F QLED + S+AWERFKELLRKCPHHG+PHCIQMETFYNGLN  +Q+       
Subjt:  -------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQV-------

Query:  ------------------------FSHQQPPA-------------------------------------VEPTTMVKQVAEEARVYCGEYHNYEFFPSNP
                                +S+ + P                                      ++P   + Q  + + V+C E H +E  PSNP
Subjt:  ------------------------FSHQQPPA-------------------------------------VEPTTMVKQVAEEARVYCGEYHNYEFFPSNP

Query:  AFVFFVG---------------------------------------------------------------------------------------------
          V ++G                                                                                             
Subjt:  AFVFFVG---------------------------------------------------------------------------------------------

Query:  --------KASRETSL--DTEHPRREGKEHVKAMTLRSGKPLEERKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPY
                KA  + SL  DTE+PRR+GKE  K++ LRSGK L+  +E  K      +   +  + K  ++ Q    +     A G   + +         
Subjt:  --------KASRETSL--DTEHPRREGKEHVKAMTLRSGKPLEERKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPY

Query:  VPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKEL
         PPLPFPQR +   QDGQFKKFL++LKQLHINIPLVEA+EQMPNY KFL+DILTKKRRLGEFE+  LTE   A+L N +P K KDPGSFTIP+SIGG++L
Subjt:  VPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKEL

Query:  G
        G
Subjt:  G

XP_030508947.1 uncharacterized protein LOC115723603 [Cannabis sativa]3.0e-7235.96Show/hide
Query:  SHRAENPILVANDRARAIRAYAFLMFDELNPGIARPQIE-------------------------------------------------------------
        +H  +NPI +A+DRARA R YA L+F+ELNPG  RP+I+                                                             
Subjt:  SHRAENPILVANDRARAIRAYAFLMFDELNPGIARPQIE-------------------------------------------------------------

Query:  -----AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQV--------
             AWLN+  P  + +WN+LAEKFL KYFPP RNA  RSEI+ F+QLED T S+AWERFKELLRKCPHHG+PHCIQ+ETFYNGLN A+++        
Subjt:  -----AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQV--------

Query:  -----------------------FSHQQPP-----------------------------------AVEPTTMVKQVAEEARVYCGEYHNYEFFPSNPAF-
                               +S  + P                                   +V+P   + Q A+ + VYCG+ H +E +PSNPA  
Subjt:  -----------------------FSHQQPP-----------------------------------AVEPTTMVKQVAEEARVYCGEYHNYEFFPSNPAF-

Query:  ----------------------VFFVG-------------KASRETSL----------------------DTEHPRREGKEHVKAMTLRSGKPLEER---
                               F  G             + S+ +SL                      DT +PRR+GK+     TLRSGK LE     
Subjt:  ----------------------VFFVG-------------KASRETSL----------------------DTEHPRREGKEHVKAMTLRSGKPLEER---

Query:  ---KETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQM
           KE S  Q   +   K  +   E+ S         D         VE     PPP     PFPQR +    DGQF++FL++LKQL+INIPL EA+EQM
Subjt:  ---KETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQM

Query:  PNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG
        P Y KFL+DILT+KRRLGEFETV+LTE  SA+L + +P K KDPGSFTIP+SIGG+++G
Subjt:  PNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG

TrEMBL top hitse value%identityAlignment
A0A2G9HWF8 Reverse transcriptase7.7e-6625.58Show/hide
Query:  WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGA
        W  S    SI TW +L E+F+SK+F P + A LR+EI+ FRQ    T  EAW RF+++LR CP+H +P  IQ+ TFY+G                 L+G 
Subjt:  WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNG-----------------LNGA

Query:  TQ---------VFSHQQ-------PPAVEPTTMVKQV-AEEARV----------YCGEYHNYEFFPSNPAFVFFVGKASRETS-----------------
        T          V +H +       PP       V QV A  A++           CGE H  +  P +   + FV  A +  +                 
Subjt:  TQ---------VFSHQQ-------PPAVEPTTMVKQV-AEEARV----------YCGEYHNYEFFPSNPAFVFFVGKASRETS-----------------

Query:  --------------------LDTEHPRREGKEHVKAMTLRSGKPLEE-RKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVS
                                +PR++GK   +A+TLR+G+ L+E  KE +K+++ E   ++    EKE+E+                     P  VS
Subjt:  --------------------LDTEHPRREGKEHVKAMTLRSGKPLEE-RKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVS

Query:  PPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPG---------
         P  + P PFPQR +      QF KFLE+ K+LHINIP  EA+EQMP+Y KF++DIL+KKRRLG++ETV+LTEECSAI+ N LP K KDPG         
Subjt:  PPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPG---------

Query:  ------------------------------SFT--------------------------------IPVSIGGKELGFWITQL--------LRQQYRIRQT
                                      S T                                +P+ +G   L    T +        +R Q +    
Subjt:  ------------------------------SFT--------------------------------IPVSIGGKELGFWITQL--------LRQQYRIRQT

Query:  SIWKIM---------------------------------------------------------ERFRS-------KKAPP--IKPSLIEAPTLDLKPLPD
        +++K M                                                         + F+S       + AP   +KPS+ E PTL+LKPLP 
Subjt:  SIWKIM---------------------------------------------------------ERFRS-------KKAPP--IKPSLIEAPTLDLKPLPD

Query:  HLKYVYLGE-------------------------------------------------------------------------------------------
        HL Y YLGE                                                                                           
Subjt:  HLKYVYLGE-------------------------------------------------------------------------------------------

Query:  DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSRYNQITIAPEDQEKTTFTCPY
        D +W+SP+QCVPKKG +TVV N  NE IPT+TVTGW                                K +YCFLDGYS YNQI IAPEDQEKTTFTCPY
Subjt:  DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSRYNQITIAPEDQEKTTFTCPY

Query:  GTLA--------------FGECLLAF------------------------QCSNNISAV-----------------------------------------
        GT A              F  C++A                         +C NN+S V                                         
Subjt:  GTLA--------------FGECLLAF------------------------QCSNNISAV-----------------------------------------

Query:  ---------------------------------------------------------KAFETLKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDK
                                                                  AF+ LK  LIS PI+  P+W+ PFE+MCDASD A+G  LG +
Subjt:  ---------------------------------------------------------KAFETLKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDK

Query:  RQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV
        + +  R++           S  L  + ++Y+  +++L AV
Subjt:  RQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV

A0A6J1EQ90 uncharacterized protein LOC1114364112.2e-6832.14Show/hide
Query:  MSDPPRARFELDLEIERTFKR---KGESSVDNKIKWITY--RVSHRAENPILVAN-------------DRARAIRAYAFLMFDELNPGIARPQIE-----
        M+ P    F LD EIERTF+R   K +   +  I+ I    +++   ENP ++AN             DR RAIRAYA    +ELNP I RP+I+     
Subjt:  MSDPPRARFELDLEIERTFKR---KGESSVDNKIKWITY--RVSHRAENPILVAN-------------DRARAIRAYAFLMFDELNPGIARPQIE-----

Query:  --------------------------------------------------------------------AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNA
                                                                            +WLN+ APG+I +WN LAE FL KYFPP RNA
Subjt:  --------------------------------------------------------------------AWLNSFAPGSIRTWNELAEKFLSKYFPPNRNA

Query:  KLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ----------------------------------------------
        + ++EIV F+Q ED T SEA ERFKE+LRKCPHHGLPHCIQMETFYNGLN  T+                                              
Subjt:  KLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQ----------------------------------------------

Query:  ----------------VFSHQQPPAVEPTTMVK----------QVAEEARVYCGEYHNYEFFPSNPAFVFFVGKASRETSLDT------------EHPRR
                        V +  Q  A+   +M+K          Q A E+ VYCGE H ++  PSNPA +F+VG  + + +L               HP  
Subjt:  ----------------VFSHQQPPAVEPTTMVK----------QVAEEARVYCGEYHNYEFFPSNPAFVFFVGKASRETSLDT------------EHPRR

Query:  EGK-------------------------EHVKAMTLRSGKPLEERKETSKT--------------------QDIEKNCDKNVVVEKELESGQGAGGSNND
          K                          +        GK   + + TS+T                    Q   +N +  +  EK  E G        D
Subjt:  EGK-------------------------EHVKAMTLRSGKPLEERKETSKT--------------------QDIEKNCDKNVVVEKELESGQGAGGSNND

Query:  ARAFGSVPDVEPPY------VSPPP--------------YVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLG
         +       V+  +      V   P              Y P  PFPQR +   ++  F+KF++ILK++HINIPLVEA++QMPNY KFL+D+L  +R+  
Subjt:  ARAFGSVPDVEPPY------VSPPP--------------YVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLG

Query:  EFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG
        EF+ VSL EECSAIL N +P K KDPGSFTIPVSIGGKELG
Subjt:  EFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELG

A0A6P6TF62 Reverse transcriptase1.4e-6224.49Show/hide
Query:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS
        L P   R + + WL+S AP +  TW++L+  FL+KYFPP + AKLR +I GF Q+E  +  EAWERF++LLRKCPHHGLP  + ++TFYNGL+ +T+   
Subjt:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS

Query:  HQQPPA-----VEPTTM-------------------VKQVAEEARVYC-----GEY--------------HNY---------------------------
                   +E  T+                   V   A  A + C     GE+              +NY                           
Subjt:  HQQPPA-----VEPTTM-------------------VKQVAEEARVYC-----GEY--------------HNY---------------------------

Query:  ---EFFPSN--------------PAFVFFVGKASRETS---------------------------LDTEHPRREG----------KEHVKAMTLRSGKPL
           +  P+N              P +   V K ++ TS                             + + R +G          KEHVKA+TLRSGK L
Subjt:  ---EFFPSN--------------PAFVFFVGKASRETS---------------------------LDTEHPRREG----------KEHVKAMTLRSGKPL

Query:  EERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQ
        E+   +  K  + E+N +K    E  +E     G S  + R      + +P   +  P  P +PFPQR + N  D  F+KF+++ KQLHINIP  +AI Q
Subjt:  EERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQ

Query:  MPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIG----------------------GKELGFW----------------
        +P+YAKFL++I+T+KR+L + ET++LTEECSAI+ N LP K KDPGSF+IP +IG                       ++LG                  
Subjt:  MPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIG----------------------GKELGFW----------------

Query:  -------------------------------------------------------------------------------------------ITQLLRQ--
                                                                                                   +TQ L Q  
Subjt:  -------------------------------------------------------------------------------------------ITQLLRQ--

Query:  ------QYRIRQTSIW-------KIMERFRSKKAPP---------------IKPSLIEAPTLDLKPLPDHLKYVYLGEDS--------------------
              +Y +  + +        + + ++ + +AP                 +PS IE P L+LKPLP HL+Y +LGE+S                    
Subjt:  ------QYRIRQTSIW-------KIMERFRSKKAPP---------------IKPSLIEAPTLDLKPLPDHLKYVYLGEDS--------------------

Query:  ---------NW-------VSPIQCVPK------------------------KGSVTVVSNKDNELIPTRTVTGWKT------------------------
                  W       +SP  C+ +                        KG +T +  K++ELIP+R V GW+                         
Subjt:  ---------NW-------VSPIQCVPK------------------------KGSVTVVSNKDNELIPTRTVTGWKT------------------------

Query:  --------YYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF------------------------QCSNNI------
                +YCFLDG+S YNQI IAPEDQEKTTFTCPYGT A              F  C++A                          C +N+      
Subjt:  --------YYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF------------------------QCSNNI------

Query:  --------------------------------------------------------------------------------------------SAVKAFET
                                                                                                    + + AFE 
Subjt:  --------------------------------------------------------------------------------------------SAVKAFET

Query:  LKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV
        LK  LIS PI+ +P+W+LPFE+MCDASD AVG  LG K++  +  +           S LL ++ ++Y+  +++L AV
Subjt:  LKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV

A0A6P6UJL6 Reverse transcriptase1.8e-6729.32Show/hide
Query:  PGIARPQIEA-------------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFY
        PGI   QI+              WL    PGSI TW++L +KFL KYFP +R A LR EI G +Q    +  E WERFK+L  KCP H +   + ++ FY
Subjt:  PGIARPQIEA-------------WLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFY

Query:  NGL---------------------NGATQVF-----SHQQPPAVE--PTTMVKQV-AEEARVYCGEYHNYEFFPSNPAFVFFVGKASRETSLDTEHPRR-
          L                      GA ++      + QQ  + E  PT  V +V     +    E  ++     N      +     E+ +  + P + 
Subjt:  NGL---------------------NGATQVF-----SHQQPPAVE--PTTMVKQV-AEEARVYCGEYHNYEFFPSNPAFVFFVGKASRETSLDTEHPRR-

Query:  -EGKEHVKAMTLRSGKPLEERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQR-QRPNNQDGQFKKF
            ++V AMTLRSGK ++  +   SK +D E+       +EKELE  +G G  N        VP    P         P PFP R ++P  QD + K+ 
Subjt:  -EGKEHVKAMTLRSGKPLEERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQR-QRPNNQDGQFKKF

Query:  LEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELGFWITQL---------------
        LEI +++ INIPL++AI+Q+P YAKFL D+   ++RL   E V + E  S IL   LP K  DPG FTIP  IG   +G  +  L               
Subjt:  LEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELGFWITQL---------------

Query:  --------------------------------------------LRQQYRIRQTSIWKIME---------------RFRSKKAP--------PIKPSLIE
                                                    L + +    TS  ++ E               + R   AP         + PS+++
Subjt:  --------------------------------------------LRQQYRIRQTSIWKIME---------------RFRSKKAP--------PIKPSLIE

Query:  APTLDLKPLPDHLKYVYLGE--------------------------------------------------------------------------------
        AP L+LKPLP HLKYVYLGE                                                                                
Subjt:  APTLDLKPLPDHLKYVYLGE--------------------------------------------------------------------------------

Query:  -----------DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSRYNQITIAPE
                   DS WVSP+Q VPKK  VTV SN++ EL+P R  TGW                                + YYCFLDG+S Y QI IAP+
Subjt:  -----------DSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGW--------------------------------KTYYCFLDGYSRYNQITIAPE

Query:  DQEKTTFTCPYGTLA-----FGEC--LLAFQCSNNISAVKAFETLKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDP
        DQEKTTFTCP+GT A     FG C     FQ       + AF  LK  L ++PI+ +P+WNLPFE+MCDA+D AVG  LG +        +      +  
Subjt:  DQEKTTFTCPYGTLA-----FGEC--LLAFQCSNNISAVKAFETLKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDP

Query:  TSSLLKQSVISYSFPDEQLFAV
         S  L  + ++YS  +++L AV
Subjt:  TSSLLKQSVISYSFPDEQLFAV

A0A6P6X9H2 Reverse transcriptase4.0e-6224.4Show/hide
Query:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS
        L P   R + + WL+S AP +  TW++L+  FL+KYFPP + AKLR +I GF Q+E  +  E WERF++LLRKCPHHGLP  + ++TFYNGL+ +T+   
Subjt:  LNPGIARPQIEAWLNSFAPGSIRTWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFS

Query:  HQQPPA-----VEPTTM-------------------VKQVAEEARVYC-----GEY--------------HNY---------------------------
                   +E  T+                   V   A  A + C     GE+              +NY                           
Subjt:  HQQPPA-----VEPTTM-------------------VKQVAEEARVYC-----GEY--------------HNY---------------------------

Query:  ---EFFPSN--------------PAFVFFVGKASRETS---------------------------LDTEHPRREG----------KEHVKAMTLRSGKPL
           +  P+N              P +   V K ++ TS                             + + R +G          KEHVKA+TLRSGK L
Subjt:  ---EFFPSN--------------PAFVFFVGKASRETS---------------------------LDTEHPRREG----------KEHVKAMTLRSGKPL

Query:  EERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQ
        E+   +  K  + E+N +K    E  +E     G S  + R      + +P   +  P  P +PFPQR + N  D  F+KF+++ KQLHINIP  +AI Q
Subjt:  EERKET-SKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQRQRPNNQDGQFKKFLEILKQLHINIPLVEAIEQ

Query:  MPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIG----------------------GKELGFW----------------
        +P+YAKFL++I+T+KR+L + ET++LTEECSAI+ N LP K KDPGSF+IP +IG                       ++LG                  
Subjt:  MPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIG----------------------GKELGFW----------------

Query:  -------------------------------------------------------------------------------------------ITQLLRQ--
                                                                                                   +TQ L Q  
Subjt:  -------------------------------------------------------------------------------------------ITQLLRQ--

Query:  ------QYRIRQTSIW-------KIMERFRSKKAPP---------------IKPSLIEAPTLDLKPLPDHLKYVYLGEDS--------------------
              +Y +  + +        + + ++ + +AP                 +PS IE P L+LKPLP HL+Y +LGE+S                    
Subjt:  ------QYRIRQTSIW-------KIMERFRSKKAPP---------------IKPSLIEAPTLDLKPLPDHLKYVYLGEDS--------------------

Query:  ---------NW-------VSPIQCVPK------------------------KGSVTVVSNKDNELIPTRTVTGWKT------------------------
                  W       +SP  C+ +                        KG +T +  K++ELIP+R V GW+                         
Subjt:  ---------NW-------VSPIQCVPK------------------------KGSVTVVSNKDNELIPTRTVTGWKT------------------------

Query:  --------YYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF------------------------QCSNNI------
                +YCFLDG+S YNQI IAPEDQEKTTFTCPYGT A              F  C++A                          C +N+      
Subjt:  --------YYCFLDGYSRYNQITIAPEDQEKTTFTCPYGTLA--------------FGECLLAF------------------------QCSNNI------

Query:  --------------------------------------------------------------------------------------------SAVKAFET
                                                                                                    + + AFE 
Subjt:  --------------------------------------------------------------------------------------------SAVKAFET

Query:  LKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV
        LK  LIS PI+ +P+W+LPFE+MCDASD AVG  LG K++  +  +           S LL ++ ++Y+  +++L AV
Subjt:  LKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31150.1 Domain of unknown function (DUF1985)5.2e-0628.57Show/hide
Query:  QCKPKRASELYFKIGGKILKFGLREFTLITGLNCGPLPQLGRDRLQESSRFK---NEYF---------DDGEGVRRKTLN---------------IVFKA
        Q   K+  EL+F  GG  ++F +REF ++TGL CG LP     +  + S++    N  F         D  E +++K L+               +V   
Subjt:  QCKPKRASELYFKIGGKILKFGLREFTLITGLNCGPLPQLGRDRLQESSRFK---NEYF---------DDGEGVRRKTLN---------------IVFKA

Query:  IKHGVEADLVKMAHYQELFNTYSWGRVAFTLSI
         +  V  D V+M +  + F  Y WGR AF  +I
Subjt:  IKHGVEADLVKMAHYQELFNTYSWGRVAFTLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTTTCTCAAGCAATCCCGAATCTCCCGATTAACGAATTACCTAAGTATGCAATTGATGGTCGAGACGCTGTTGACATAGCGTCTCGACGCTGTGTCAATGCGCG
CCCTAGTAAATGCGGAAAGTTTCGCAGCGTCGAGACGCTGTGGAGCACAATAAATTGCGCTATCAAGGAGGCTTCTAGAAAACAGAAAAGTGGAAAAAAGGACAAAAAGA
TGAAGAAGGCCATGGTTGAAGAAGGTGACACTGTTCGAGTGCATCAATGCAAACCCAAACGAGCATCAGAGTTATATTTCAAGATTGGTGGAAAAATTCTAAAGTTTGGT
CTACGGGAGTTCACGTTAATTACGGGATTGAATTGTGGCCCATTGCCACAACTTGGCAGAGACAGACTACAAGAATCTTCCAGATTCAAGAATGAGTATTTTGACGACGG
CGAGGGGGTCAGAAGAAAGACCCTTAATATAGTATTCAAAGCAATCAAGCATGGGGTTGAGGCAGACCTCGTAAAGATGGCACATTACCAAGAGTTGTTCAACACCTACT
CTTGGGGGCGTGTCGCCTTCACACTATCGATCAACTATATGCAGAAAGCATCAAATTTATTGCTGAGCGACTTGAGGGAGCAAAATCTGTGTTGGAGCAAAGCAAGGAGC
AAAACTGCCACGTCACAGCTTGTTAGGCAATTTGATGAACTAAATTCTGTGATTGTTTGGTGCATGAGCGATCCGCCTCGGGCAAGGTTCGAGCTTGATCTAGAAATCGA
GAGGACATTCAAGAGAAAAGGAGAGAGCAGCGTAGACAACAAAATCAAATGGATAACGTACCGCGTCTCCCACAGGGCTGAGAATCCTATCTTGGTAGCAAACGATAGGG
CCAGAGCCATTCGAGCGTATGCTTTTCTAATGTTTGATGAGTTAAATCCAGGGATTGCACGCCCTCAAATTGAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGG
ACATGGAATGAGTTAGCAGAAAAATTTCTTAGTAAATATTTCCCACCAAATAGGAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGGAACTTT
TAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGGAAACATTTTACAATGGGTTAAATGGAGCAACCC
AAGTGTTTAGTCATCAGCAACCGCCAGCTGTGGAGCCTACTACGATGGTGAAACAAGTTGCAGAGGAAGCACGTGTCTATTGTGGTGAATATCACAACTACGAGTTTTTC
CCCAGCAATCCAGCTTTTGTGTTTTTTGTAGGCAAGGCCTCAAGGGAAACTTCCTTAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCATGTAAAGGCAATGACTCT
TAGGAGTGGTAAGCCACTAGAGGAAAGAAAAGAGACTAGTAAAACTCAGGATATAGAAAAGAATTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAAG
GTGCTGGAGGCAGCAATAATGATGCTAGAGCATTTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGA
CAAAGGCCTAATAATCAGGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGC
TAAATTTCTTGAGGATATTTTAACTAAAAAGAGGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAATAATGGGCTACCAACCAAGG
CTAAGGATCCAGGATCATTTACTATACCTGTGTCAATAGGTGGAAAAGAGTTAGGATTCTGGATTACACAATTGTTGAGACAACAATACAGGATTCGGCAAACAAGCATT
TGGAAGATCATGGAGAGATTTAGATCAAAGAAAGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCAGATCATCTAAAGTATGT
GTATCTTGGGGAAGACAGCAATTGGGTAAGCCCTATCCAATGTGTTCCTAAGAAAGGAAGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACCAGGACAG
TAACTGGCTGGAAGACCTACTACTGTTTCTTAGATGGTTATTCTAGGTATAACCAGATTACCATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGG
ACGCTTGCTTTTGGCGAATGCCTTTTGGCATTTCAATGCTCCAACAACATTTCAGCGGTGAAGGCTTTTGAAACTTTAAAGGCTGCTTTAATCTCAACACCCATTCTTTG
TGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGTAGGAATTCGACTTGGAGATAAAAGACAAGAAGAGATCAGAAATGTCATTGCAG
ATCATTTGTCTCGTCTTGATCCAACATCATCTTTGCTGAAGCAATCTGTCATTTCATATTCTTTTCCAGATGAACAACTTTTTGCTGTTGAGGTGATAAGCAAAGGAAAT
CCTGTAGCAATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCGGTTAGAGGACAACTATGAGGATTTTGCATTGTGGATTCTTTGGCTTCCTTATTTAAGGATGCACAT
TGCAGCGTCGAGACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTTAGCAAGGCAGCGTCGCGACGCT
ACCCATATTCCAGGCCTATAAATAGGCGCCCTTGTGGGAAATTTTGGGGATTTTGGGAAGCAATCATAGAGGCTCGAGGAGTAGTCAGCGGGGGCTCCAAGCGTCAGCAA
GGGGTTTTCACGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACTTGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGC
TAAGACTACCAAGCTGCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATACGAAGCGTGGGAAAGATATAAGGAAATGCTTAGGCGATGCCCCCAAC
ACGGATATCCTGATTGGCTTCAGTTATCAATGGCCGACGGAGAGAGGAACAGTTACAAAAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTG
GCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGGTGGTTAAGTCCATTTCCACCTTAGCTGAAGGACATTCGAAGAAAGAAGCCACCCCCAGGTTTTGCAT
CAACAGTACTCCTGAGAAGAAAAATAATCTGGAGGAGATGGTGGCTTTATTCATCAAGGAACAAAGAATATTGAATGTGAATCTCCAGACATCAGTAAACAACCACGACG
CAGCTCTAAAGAATATGGAAGTGCAGATAGGTCAGATTGCTTCAGCAGTAAATGCCCTGCAGAAGGGAAAATTTCCAAGCGATACTGAGCCTAACCCGAAAGAGCAGTGT
AAGATGGTGGTTCTGAGAAGTGGCAGAAGACTGGAGGACAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGGTTTCTCAAGCAATCCCGAATCTCCCGATTAACGAATTACCTAAGTATGCAATTGATGGTCGAGACGCTGTTGACATAGCGTCTCGACGCTGTGTCAATGCGCG
CCCTAGTAAATGCGGAAAGTTTCGCAGCGTCGAGACGCTGTGGAGCACAATAAATTGCGCTATCAAGGAGGCTTCTAGAAAACAGAAAAGTGGAAAAAAGGACAAAAAGA
TGAAGAAGGCCATGGTTGAAGAAGGTGACACTGTTCGAGTGCATCAATGCAAACCCAAACGAGCATCAGAGTTATATTTCAAGATTGGTGGAAAAATTCTAAAGTTTGGT
CTACGGGAGTTCACGTTAATTACGGGATTGAATTGTGGCCCATTGCCACAACTTGGCAGAGACAGACTACAAGAATCTTCCAGATTCAAGAATGAGTATTTTGACGACGG
CGAGGGGGTCAGAAGAAAGACCCTTAATATAGTATTCAAAGCAATCAAGCATGGGGTTGAGGCAGACCTCGTAAAGATGGCACATTACCAAGAGTTGTTCAACACCTACT
CTTGGGGGCGTGTCGCCTTCACACTATCGATCAACTATATGCAGAAAGCATCAAATTTATTGCTGAGCGACTTGAGGGAGCAAAATCTGTGTTGGAGCAAAGCAAGGAGC
AAAACTGCCACGTCACAGCTTGTTAGGCAATTTGATGAACTAAATTCTGTGATTGTTTGGTGCATGAGCGATCCGCCTCGGGCAAGGTTCGAGCTTGATCTAGAAATCGA
GAGGACATTCAAGAGAAAAGGAGAGAGCAGCGTAGACAACAAAATCAAATGGATAACGTACCGCGTCTCCCACAGGGCTGAGAATCCTATCTTGGTAGCAAACGATAGGG
CCAGAGCCATTCGAGCGTATGCTTTTCTAATGTTTGATGAGTTAAATCCAGGGATTGCACGCCCTCAAATTGAGGCATGGTTAAATTCTTTTGCTCCAGGATCAATTAGG
ACATGGAATGAGTTAGCAGAAAAATTTCTTAGTAAATATTTCCCACCAAATAGGAATGCTAAATTGAGGAGTGAAATAGTAGGGTTTAGGCAACTTGAAGATGGAACTTT
TAGTGAGGCTTGGGAGAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCACATTGTATCCAAATGGAAACATTTTACAATGGGTTAAATGGAGCAACCC
AAGTGTTTAGTCATCAGCAACCGCCAGCTGTGGAGCCTACTACGATGGTGAAACAAGTTGCAGAGGAAGCACGTGTCTATTGTGGTGAATATCACAACTACGAGTTTTTC
CCCAGCAATCCAGCTTTTGTGTTTTTTGTAGGCAAGGCCTCAAGGGAAACTTCCTTAGATACTGAACACCCTAGAAGGGAAGGTAAGGAGCATGTAAAGGCAATGACTCT
TAGGAGTGGTAAGCCACTAGAGGAAAGAAAAGAGACTAGTAAAACTCAGGATATAGAAAAGAATTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAAG
GTGCTGGAGGCAGCAATAATGATGCTAGAGCATTTGGTTCTGTTCCAGATGTGGAACCACCTTATGTGTCGCCCCCACCTTATGTACCACCTCTACCTTTTCCACAAAGA
CAAAGGCCTAATAATCAGGATGGTCAATTTAAGAAGTTTTTAGAGATTCTTAAGCAATTGCATATAAATATCCCTTTAGTAGAAGCTATAGAGCAAATGCCTAATTATGC
TAAATTTCTTGAGGATATTTTAACTAAAAAGAGGAGGTTAGGTGAGTTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTTAATAATGGGCTACCAACCAAGG
CTAAGGATCCAGGATCATTTACTATACCTGTGTCAATAGGTGGAAAAGAGTTAGGATTCTGGATTACACAATTGTTGAGACAACAATACAGGATTCGGCAAACAAGCATT
TGGAAGATCATGGAGAGATTTAGATCAAAGAAAGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCAGATCATCTAAAGTATGT
GTATCTTGGGGAAGACAGCAATTGGGTAAGCCCTATCCAATGTGTTCCTAAGAAAGGAAGTGTCACTGTGGTGAGCAATAAAGACAATGAGTTGATCCCAACCAGGACAG
TAACTGGCTGGAAGACCTACTACTGTTTCTTAGATGGTTATTCTAGGTATAACCAGATTACCATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTGCCCTTATGGG
ACGCTTGCTTTTGGCGAATGCCTTTTGGCATTTCAATGCTCCAACAACATTTCAGCGGTGAAGGCTTTTGAAACTTTAAAGGCTGCTTTAATCTCAACACCCATTCTTTG
TGCACCTAATTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGCTGCGGTAGGAATTCGACTTGGAGATAAAAGACAAGAAGAGATCAGAAATGTCATTGCAG
ATCATTTGTCTCGTCTTGATCCAACATCATCTTTGCTGAAGCAATCTGTCATTTCATATTCTTTTCCAGATGAACAACTTTTTGCTGTTGAGGTGATAAGCAAAGGAAAT
CCTGTAGCAATGTCACTCTTCGCCGTATGGAGGCCATTTCAGCGGTTAGAGGACAACTATGAGGATTTTGCATTGTGGATTCTTTGGCTTCCTTATTTAAGGATGCACAT
TGCAGCGTCGAGACGCTTAGCAAGGCAGCGTCGCGACGCTGCCCTTTCTTGTTGCCCAAGCCGCGTGCGCAGCAGCGTCGCGACGCTTAGCAAGGCAGCGTCGCGACGCT
ACCCATATTCCAGGCCTATAAATAGGCGCCCTTGTGGGAAATTTTGGGGATTTTGGGAAGCAATCATAGAGGCTCGAGGAGTAGTCAGCGGGGGCTCCAAGCGTCAGCAA
GGGGTTTTCACGGATAAAGCAAAAGATTGGCTCGAATCAGTCGAGACGGGCAGCATCAGTACTTGGGACGAGCTTGCCCAGGCTTTTCTGACAAAATTTTTTCCACCAGC
TAAGACTACCAAGCTGCGGACTGAAATTGGAACATTCAGACAGCTTGACGAGGAGCAGTTATACGAAGCGTGGGAAAGATATAAGGAAATGCTTAGGCGATGCCCCCAAC
ACGGATATCCTGATTGGCTTCAGTTATCAATGGCCGACGGAGAGAGGAACAGTTACAAAAAGGCTGGATTATATGAATTGGATGAGTCAAGTTCACTGAAAGCGCAACTG
GCATCTCTGACCAATGCACTAAACAAATTGACGTCATCTGAGGTGGTTAAGTCCATTTCCACCTTAGCTGAAGGACATTCGAAGAAAGAAGCCACCCCCAGGTTTTGCAT
CAACAGTACTCCTGAGAAGAAAAATAATCTGGAGGAGATGGTGGCTTTATTCATCAAGGAACAAAGAATATTGAATGTGAATCTCCAGACATCAGTAAACAACCACGACG
CAGCTCTAAAGAATATGGAAGTGCAGATAGGTCAGATTGCTTCAGCAGTAAATGCCCTGCAGAAGGGAAAATTTCCAAGCGATACTGAGCCTAACCCGAAAGAGCAGTGT
AAGATGGTGGTTCTGAGAAGTGGCAGAAGACTGGAGGACAGTTAG
Protein sequenceShow/hide protein sequence
MTVSQAIPNLPINELPKYAIDGRDAVDIASRRCVNARPSKCGKFRSVETLWSTINCAIKEASRKQKSGKKDKKMKKAMVEEGDTVRVHQCKPKRASELYFKIGGKILKFG
LREFTLITGLNCGPLPQLGRDRLQESSRFKNEYFDDGEGVRRKTLNIVFKAIKHGVEADLVKMAHYQELFNTYSWGRVAFTLSINYMQKASNLLLSDLREQNLCWSKARS
KTATSQLVRQFDELNSVIVWCMSDPPRARFELDLEIERTFKRKGESSVDNKIKWITYRVSHRAENPILVANDRARAIRAYAFLMFDELNPGIARPQIEAWLNSFAPGSIR
TWNELAEKFLSKYFPPNRNAKLRSEIVGFRQLEDGTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNGATQVFSHQQPPAVEPTTMVKQVAEEARVYCGEYHNYEFF
PSNPAFVFFVGKASRETSLDTEHPRREGKEHVKAMTLRSGKPLEERKETSKTQDIEKNCDKNVVVEKELESGQGAGGSNNDARAFGSVPDVEPPYVSPPPYVPPLPFPQR
QRPNNQDGQFKKFLEILKQLHINIPLVEAIEQMPNYAKFLEDILTKKRRLGEFETVSLTEECSAILNNGLPTKAKDPGSFTIPVSIGGKELGFWITQLLRQQYRIRQTSI
WKIMERFRSKKAPPIKPSLIEAPTLDLKPLPDHLKYVYLGEDSNWVSPIQCVPKKGSVTVVSNKDNELIPTRTVTGWKTYYCFLDGYSRYNQITIAPEDQEKTTFTCPYG
TLAFGECLLAFQCSNNISAVKAFETLKAALISTPILCAPNWNLPFEVMCDASDAAVGIRLGDKRQEEIRNVIADHLSRLDPTSSLLKQSVISYSFPDEQLFAVEVISKGN
PVAMSLFAVWRPFQRLEDNYEDFALWILWLPYLRMHIAASRRLARQRRDAALSCCPSRVRSSVATLSKAASRRYPYSRPINRRPCGKFWGFWEAIIEARGVVSGGSKRQQ
GVFTDKAKDWLESVETGSISTWDELAQAFLTKFFPPAKTTKLRTEIGTFRQLDEEQLYEAWERYKEMLRRCPQHGYPDWLQLSMADGERNSYKKAGLYELDESSSLKAQL
ASLTNALNKLTSSEVVKSISTLAEGHSKKEATPRFCINSTPEKKNNLEEMVALFIKEQRILNVNLQTSVNNHDAALKNMEVQIGQIASAVNALQKGKFPSDTEPNPKEQC
KMVVLRSGRRLEDS