; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036443 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036443
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:46547182..46550888
RNA-Seq ExpressionLag0036443
SyntenyLag0036443
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]4.5e-5728.89Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M P+KAPG DG PALF+QKYW  VG+     CL ILN   S +++N+T I LIPKVK P  VS+FRPI+LC    K+ AK                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        G PL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLSG-------------------------ENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM
        S YLFL+ +E  S LL G                         ++S   + A+      L  L Q YE  S Q+IN +KSA   S N +  +  M++ ++
Subjt:  SSYLFLLFSEVLSSLLSG-------------------------ENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM

Query:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WT
         + VV     YLG+P+   K RK+ F  +K ++W+ + GWK    S  GKE+L+K+V QAI ++ MSCFR+P  LC +L+ +M R             W 
Subjt:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WT

Query:  K--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW
        K              F DL +FN+ALLAK+ WRI   P  LV+R+ +ARY      L A   +N SF WRS  W + LL++                D W
Subjt:  K--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW

Query:  IPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLN-SSEFEKACI
        +P    F   +  P Q   ++LV +  T+   W++  L+D  WD +V     I +      D  IWHY  N   +  S +  AC+
Subjt:  IPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLN-SSEFEKACI

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.6e-6225.19Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M P+KAPGPDG PA F+QK+W  VGE     CL ILN+  +    N+TFI LIPKV++P++V +FRPI+LCNV  +I AK                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        G PL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLS------------------------GENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQ
        S YLF+L +E  S+LL+                         ++S     AS      L+ +   Y  AS Q  N  KS+MFFS   SS + + +++I Q
Subjt:  SSYLFLLFSEVLSSLLS------------------------GENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQ

Query:  MSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WTK
        + VV     YLG+P    +N+   F  VK +V   +  W    FS GGKE+LIK+VAQA+ ++ MS F+LP  LC+D+ + + R             W +
Subjt:  MSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WTK

Query:  --------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWA----------------RSLLSQDPWI
                      F DL SFN+AL+AK+ WR+   PN L++RV++ARY   +    A   SN SF WRS +W                 + L+ +D WI
Subjt:  --------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWA----------------RSLLSQDPWI

Query:  PKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYS-----------------------ENANLNSSE
        P+  TF PI   P      ++VA+ I + N W + +L  +   +D+  I  I +     ED+ +WH+                        E++N +S  
Subjt:  PKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYS-----------------------ENANLNSSE

Query:  F---------EKACIAFWSIWNN----------RNSLRE-------------SKPIVE-------WSL--------------------------------
        +         EK  I  W    N          R SL+E             S  ++E       W L                                
Subjt:  F---------EKACIAFWSIWNN----------RNSLRE-------------SKPIVE-------WSL--------------------------------

Query:  ------------------------------QTESIINYWKETT---HKKGAQD-SIDQNPTQQPIILSPNSVQVYTDAVVRPNRTGAGLGVVI-IGEGNI
                                      + +S++  ++  +   +  GA+D  IDQ   + P   S N +++  DA V       GLG ++   EG I
Subjt:  ------------------------------QTESIINYWKETT---HKKGAQD-SIDQNPTQQPIILSPNSVQVYTDAVVRPNRTGAGLGVVI-IGEGNI

Query:  LHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWI-DSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL
        L   ++  +  +   L AE  A+   +++ +++    + V SD    + ++N    + T++ HWI   ++   K+F  + F+F+PR  N  A  LAK AL
Subjt:  LHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWI-DSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL

Query:  EHSSTMLWISNFP
         +SST +W+  FP
Subjt:  EHSSTMLWISNFP

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.8e-6125.11Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M P+KA GPDGFPALFYQ YW  VG   +  CL+ LN     K WN+T+I LIPK+KQP+ +SDFRPI+LCNV+ KI +K                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        GDPL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLSGENSCGVITA-----STTQVT--------------------TLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM
        S YLFLL +E LS+L++ EN+ G +T      + T +T                     LR+LL +Y  AS Q IN +KSA+ FS NV    +  LQ I+
Subjt:  SSYLFLLFSEVLSSLLSGENSCGVITA-----STTQVT--------------------TLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM

Query:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTRWTKFCDLVSFNKALL
         + +V   G YLG+PS F++ R E                K H+   G                   C+      C  L+        F DL  FN+AL+
Subjt:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTRWTKFCDLVSFNKALL

Query:  AKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFI
        AK VWR   +PNLLVS+V++ +Y   T LL A   S  S+FW+  +W R LL +                DPW+P+  TF P+    +    ++ VA FI
Subjt:  AKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFI

Query:  TTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENAN------------------------------------------------------
        T    WD++ +     ++D  LI ++ I   N++D W+WHY +  N                                                      
Subjt:  TTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENAN------------------------------------------------------

Query:  --------------------------------------------------------------------LNSSEFEKACIAFWSIWNNRNSLRESKPIVEW
                                                                            L   +   A I  W IWN+RNSL   K +   
Subjt:  --------------------------------------------------------------------LNSSEFEKACIAFWSIWNNRNSLRESKPIVEW

Query:  SLQTESIINYWKETTHKKGAQDSIDQNPTQQPII-----LSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIR
          + E +  +    +  + +  S       +P++      S  S+++ TDA  R   T    G +I      L  +  +     L+PL AE+  +L  ++
Subjt:  SLQTESIINYWKETTHKKGAQDSIDQNPTQQPII-----LSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIR

Query:  LIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL-EHSSTMLWISNFPTWLVSMDRKVEPLYF
                 + V SDS+ AI++I  E+ T  D  +W+  IQ +   F  ISF+   R  N  A  LAK  +   S+T  W+ NFPTWL+ + ++  P  F
Subjt:  LIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL-EHSSTMLWISNFPTWLVSMDRKVEPLYF

XP_023897447.1 uncharacterized protein LOC112009345 [Quercus suber]4.8e-5925.03Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M+P K+PGP G P LF+Q +W  +G T     LD LN   +   +N+T + LIPK K PK +SD+RPI+LCNV  KIA+K                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        GDPL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLSGENSCGVI-------------------------TASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM
        S YLFL+ +E LSSL+      G +                          A+     TL+++ Q YE AS Q++N  K+++FFS N S   +  ++   
Subjt:  SSYLFLLFSEVLSSLLSGENSCGVI-------------------------TASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM

Query:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR---------------
           V+     YLG+PS   +N++  F+ +K ++ + L GWKG      GKEVLIK+VAQAI ++ MSCF++P +LC++L  M+ +               
Subjt:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR---------------

Query:  -WTKFCD-----------LVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW
         W K C+           L  FN ALLAK+ WR+    N L+ RV++A+Y      L A A +N S+ WRS + A+S++ +                D W
Subjt:  -WTKFCD-----------LVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW

Query:  IPKEITFTPIVKDPSQNHENSLVAEFITTLN-GWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIW--------------------HYSENANLNSS--
        +P+  ++  ++      H  + V+EFI      W    +R   +  D+ +I  I +     ED+ IW                    + +ENA   SS  
Subjt:  IPKEITFTPIVKDPSQNHENSLVAEFITTLN-GWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIW--------------------HYSENANLNSS--

Query:  -------------------EFEKACIAF---WSIWNNRNSLRES------KPIVEWSLQTESIINYWKETTHKKGAQDSIDQNPTQQPIILSPNSVQVYT
                           + EK  +     W+ W NRN +R        + IV+W      ++ Y   T      ++ +       P    P+ ++V  
Subjt:  -------------------EFEKACIAF---WSIWNNRNSLRES------KPIVEWSLQTESIINYWKETTHKKGAQDSIDQNPTQQPIILSPNSVQVYT

Query:  DAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHA
        D     N    G+G V+  E   +  +M       L PL  EV A    ++L   M  + + +  DS+  +R +     +++ +   I  +Q  C DF  
Subjt:  DAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHA

Query:  ISFNFVPRDQNLRADVLAKHALEHSSTMLWISNFP
        +  + V R +N  A VLAK+AL  + +++WI   P
Subjt:  ISFNFVPRDQNLRADVLAKHALEHSSTMLWISNFP

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]4.1e-5830.17Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAKG-------------DPLSS-
        M P+KAPGPDG PA+F+QK+W  V +  +S CL ILN+      +N+T+I LI K  +P++V+DFRPI+LCNV  +I AK               P+ S 
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAKG-------------DPLSS-

Query:  --------------------------------------------------------------YLFLLFSEVLSS----------------------LLSG
                                                                      ++ L+ S +LSS                      LL  
Subjt:  --------------------------------------------------------------YLFLLFSEVLSS----------------------LLSG

Query:  ENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGH
        ++S     AS T    L+K+   Y   S Q  N  KS+MF + N+S+G+ + ++ I Q+++V     YLG+PS   + R   FN +K ++   +  W+  
Subjt:  ENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGH

Query:  FFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMM----------------TRWTK-----------FCDLVSFNKALLAKKVWRIFINPNLLVS
        FFS GGKEVLIK+  QAI ++ MS F++P  +CDD+ R++                +RW K           F D  SFN+AL+AK+ WRI   P+ LV+
Subjt:  FFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMM----------------TRWTK-----------FCDLVSFNKALLAKKVWRIFINPNLLVS

Query:  RVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLS----------------QDPWIPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVW
        +++QARY      + A   SN SF WRS +W R ++S                ++ W+P+ +TF PI K PS    ++LVAE I   + W+   +  +  
Subjt:  RVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLS----------------QDPWIPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVW

Query:  DDDVRLITTIHIGLANVEDKWIWHYSE
          D   I  I +    ++D+ IWHY +
Subjt:  DDDVRLITTIHIGLANVEDKWIWHYSE

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248744.2e-6125.11Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M P+KA GPDGFPALFYQ YW  VG   +  CL+ LN     K WN+T+I LIPK+KQP+ +SDFRPI+LCNV+ KI +K                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        GDPL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLSGENSCGVITA-----STTQVT--------------------TLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM
        S YLFLL +E LS+L++ EN+ G +T      + T +T                     LR+LL +Y  AS Q IN +KSA+ FS NV    +  LQ I+
Subjt:  SSYLFLLFSEVLSSLLSGENSCGVITA-----STTQVT--------------------TLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM

Query:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTRWTKFCDLVSFNKALL
         + +V   G YLG+PS F++ R E                K H+   G                   C+      C  L+        F DL  FN+AL+
Subjt:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTRWTKFCDLVSFNKALL

Query:  AKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFI
        AK VWR   +PNLLVS+V++ +Y   T LL A   S  S+FW+  +W R LL +                DPW+P+  TF P+    +    ++ VA FI
Subjt:  AKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFI

Query:  TTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENAN------------------------------------------------------
        T    WD++ +     ++D  LI ++ I   N++D W+WHY +  N                                                      
Subjt:  TTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENAN------------------------------------------------------

Query:  --------------------------------------------------------------------LNSSEFEKACIAFWSIWNNRNSLRESKPIVEW
                                                                            L   +   A I  W IWN+RNSL   K +   
Subjt:  --------------------------------------------------------------------LNSSEFEKACIAFWSIWNNRNSLRESKPIVEW

Query:  SLQTESIINYWKETTHKKGAQDSIDQNPTQQPII-----LSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIR
          + E +  +    +  + +  S       +P++      S  S+++ TDA  R   T    G +I      L  +  +     L+PL AE+  +L  ++
Subjt:  SLQTESIINYWKETTHKKGAQDSIDQNPTQQPII-----LSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIR

Query:  LIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL-EHSSTMLWISNFPTWLVSMDRKVEPLYF
                 + V SDS+ AI++I  E+ T  D  +W+  IQ +   F  ISF+   R  N  A  LAK  +   S+T  W+ NFPTWL+ + ++  P  F
Subjt:  LIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHAL-EHSSTMLWISNFPTWLVSMDRKVEPLYF

A0A803NGQ8 Uncharacterized protein2.5e-6126.38Show/hide
Query:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK------------------------
        KAPGPDG  A FYQK W  VG +     LD+LN        NNT + L+PK K    + DFRPI+LC    KI +K                        
Subjt:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK------------------------

Query:  --------------------------------------------------------------------------------------------GDPLSSYL
                                                                                                    GDPLS YL
Subjt:  --------------------------------------------------------------------------------------------GDPLSSYL

Query:  FLLFSEVLSS-------------------------LLSGENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSV
        FLL SE LS+                         LL  ++S    T S     +++++L  Y  A+ Q +N++KS++ FS N S  ++      +Q+  
Subjt:  FLLFSEVLSS-------------------------LLSGENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSV

Query:  VDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMT----------------RWTK
           +  YLGVP  F ++++  F+ + Q+    LQ W   FFS  GKE LIK+V QAI S+ MSCFR+P ++C  L R+                  +W+ 
Subjt:  VDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMT----------------RWTK

Query:  FC-----------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNC-SFFWRSCIWARSLLS----------------QDPWIP-
         C           + V  N+ALLAK+ WRI  NP+ L++R+++A+Y      L+A +K +C S+ W S +W R LLS                +  WIP 
Subjt:  FC-----------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNC-SFFWRSCIWARSLLS----------------QDPWIP-

Query:  -KEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLNS------SEFE-------KACIA-FW
         K IT+   V  P     N  V+ FI+    WD+ KL  Y   D V+ I TI I ++   D  IW    + NL        + FE         C+A  W
Subjt:  -KEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLNS------SEFE-------KACIA-FW

Query:  SIWNNRN-SLRESKPIVEWSLQTESIINYWKETTHKK---GAQDSIDQNPTQQPIILSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESM
        SIWN RN +L +SK      ++ +S+++Y KE ++ +    A   I      QP +       +Y+DA +   R+  G G  I      +  ++    + 
Subjt:  SIWNNRN-SLRESKPIVEWSLQTESIINYWKETTHKK---GAQDSIDQNPTQQPIILSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESM

Query:  DLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLW
         L+P  AE  A+L  I+    + +    + +D ++ ++ I+K     +  +  +  I      F     + V R+ N  A  LA  ALE  + ++W
Subjt:  DLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLW

A0A803PVM0 Uncharacterized protein3.0e-5932.18Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCN----------------------------
        ++P K+PG DG  A+FY KYW  VG+      L +LN   S +  N++ ITLIPK K P  + DFRPI+LCN                            
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCN----------------------------

Query:  -----------------VACKIAAK---------------------GDPLSSYLFLLFSEVLSSLLSGE-------------------------NSCGVI
                         ++C  + K                     GDPLS YLFL+ SE LS LL  E                         +S    
Subjt:  -----------------VACKIAAK---------------------GDPLSSYLFLLFSEVLSSLLSGE-------------------------NSCGVI

Query:  TASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGK
         A+      ++K+L  Y  AS Q +N TKS M FS N +   +    N + M + +    YLG+P+   +++KE F+ VK+R+WQ L  W    FS+GGK
Subjt:  TASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGK

Query:  EVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC-----------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARY
        EVL+K+V Q+I ++ MSCFRLPST C  L  MM                  W   C             V FNKALLAK+ WRIF  PN L+SR+++ RY
Subjt:  EVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC-----------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARY

Query:  AHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLI
              L A    + S  W++  W R LL Q                D WIP    F  I     Q   N  VA FIT    W+I+ L  Y    DV  I
Subjt:  AHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPWIPKEITFTPIVKDPSQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLI

Query:  TTIHIGLANVEDKWIWHYS
         TI +      D  IWH++
Subjt:  TTIHIGLANVEDKWIWHYS

A0A803PWX1 Uncharacterized protein2.3e-5925.49Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------
        M+P+KAPG DG PALFYQK+W  + +  I+ CL++LN        N+T + LIPKV +P+++ +FRPI+LCNV  KI +K                    
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK--------------------

Query:  ------------------------------------------------------------------------------------------------GDPL
                                                                                                        GDPL
Subjt:  ------------------------------------------------------------------------------------------------GDPL

Query:  SSYLFLLFSEVLSSLLS-------------------------GENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM
        S +LFLL +E  S L+                           ++S   + A+  +    ++LL+ Y  AS Q +N  KS M F + V+   RT L NIM
Subjt:  SSYLFLLFSEVLSSLLS-------------------------GENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIM

Query:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WT
         + VVD+ G YLG+PS   + +K+ F  +K +VW  L+GWKG FFS  GKEVLIK+V QAI ++ MSCFRLP    + +H M  R             W 
Subjt:  QMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMTR-------------WT

Query:  K--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW
        K              F DL  FN+ALLAK+VWR    PN L SRV++A Y     ++ A + ++ SF WRS +W + ++ +                DPW
Subjt:  K--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQ----------------DPW

Query:  IPKEITFTPIVKDP-------------------------------SQNHENSLVAEFITTLNGWDIS---KLRDYVWDDDVRLITT------IHI-----
        +P+ +TF    K P                               S++ ++   A        W +    K++ +VW      I T       HI     
Subjt:  IPKEITFTPIVKDP-------------------------------SQNHENSLVAEFITTLNGWDIS---KLRDYVWDDDVRLITT------IHI-----

Query:  -------GLANV----------------------------EDKWIWHYSENANLNSSEFEKACIAFWSIWNNRNSLRES--KPIVEWSLQTESIINYWKE
                  NV                            ED   +    ++     EFE   I  W++W  RNS+     KP      Q  +I+++  +
Subjt:  -------GLANV----------------------------EDKWIWHYSENANLNSSEFEKACIAFWSIWNNRNSLRES--KPIVEWSLQTESIINYWKE

Query:  TTHK-KGAQDSIDQNPTQQPIILSP---NSVQVYTDAVVRPNRTGAGLGVVIIG-EGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVC
          H+ +    ++     ++    +P    S +V  DA ++     AGL  V+   EG ++  +    E    +PL  E+ A+L  I+   +  +    V 
Subjt:  TTHK-KGAQDSIDQNPTQQPIILSP---NSVQVYTDAVVRPNRTGAGLGVVIIG-EGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVC

Query:  SDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLWISNFP
        SD + A+ ++ KE +   DV   I  I+E+ +        FV R+ N  A VLA  AL + ++ +W+   P
Subjt:  SDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLWISNFP

A0A803Q7V4 Uncharacterized protein8.8e-5928.83Show/hide
Query:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK----------GDPLS---SYLFLL
        KAPGPDG    F+QK WD+ G    S  L  LN        N+T + LIPKVK   R+ D+RPI+LC+   K+ +K           D +S    Y++  
Subjt:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAK----------GDPLS---SYLFLL

Query:  FSEVLSSLLSGENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQR
         +  +S LL  ++S      ST+  + ++ +L  Y+ A+ Q +N+ KS++ FS N SS +R + ++ + M     +  YLGVP  FS+++K  F  + Q+
Subjt:  FSEVLSSLLSGENSCGVITASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQR

Query:  VWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMT----------------RWTKFC-----------DLVSFNKALLAKKVW
            L  W  + FS  GKE+L+KSV QAI S+ MSCFR+P ++C     +M+                +W   C           ++V  N+A+LAK+ W
Subjt:  VWQTLQGWKGHFFSMGGKEVLIKSVAQAISSFIMSCFRLPSTLCDDLHRMMT----------------RWTKFC-----------DLVSFNKALLAKKVW

Query:  RIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLL----------------SQDPWIP--KEITFTPIVKDPSQNHENSLVAEFITTL
        RIF +P+ LV+ +++A+Y        AP   + SF  RS +W R LL                S+  W+   + + F P +  PS       V+ FIT  
Subjt:  RIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLL----------------SQDPWIP--KEITFTPIVKDPSQNHENSLVAEFITTL

Query:  NGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLNSSEFEKACIAFWSIWNNRNSLRESKPIVE---WSLQTESIINYWKETTHKKGAQD
          WDI+KL+ Y  +     I ++ I   +  D  IW+Y  +     +       A+    +N ++L  S P V    WS    S I    +    + AQ 
Subjt:  NGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLNSSEFEKACIAFWSIWNNRNSLRESKPIVE---WSLQTESIINYWKETTHKKGAQD

Query:  SIDQNPTQQPIILSPNS------VQVYTDAVVRPNRTGAGLGVVIIG-EGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAI
        S+  +P     +++ NS       Q+  DA + P  T  G G V+   +GNI+ G   +  +  L P+ AE  A+   +     +Q     + SD +  +
Subjt:  SIDQNPTQQPIILSPNS------VQVYTDAVVRPNRTGAGLGVVIIG-EGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAI

Query:  RMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLW
        R + K     + ++ +I +I      F   S  F+PR +N     LA+  L     M+W
Subjt:  RMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLW

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein2.7e-0439.51Show/hide
Query:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTF----ITLIPK-VKQPKRVSDFRPINLCNVACKIAAK
        K+PGPDGF A FYQ +     E  I     + ++        N+F    ITLIPK  K P ++ +FRPI+L N+  KI  K
Subjt:  KAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTF----ITLIPK-VKQPKRVSDFRPINLCNVACKIAAK

P93295 Uncharacterized mitochondrial protein AtMg003104.0e-0833.33Show/hide
Query:  AISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC------------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLT
        A+  + MSCFRL   LC  L   MT                 W K C            DL  FN+ALLAK+ +RI   P+ L+SR++++RY   + ++ 
Subjt:  AISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC------------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLT

Query:  APAKSNCSFFWRSCIWARSLLSQ
            +  S+ WRS I  R LLS+
Subjt:  APAKSNCSFFWRSCIWARSLLSQ

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.3e-0940.26Show/hide
Query:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKI
        M  +KAPGPD F A F+ + W  V ++ I+   +        K +N T ITLIPKV    ++S FRP++ C V  KI
Subjt:  MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKI

AT3G25270.1 Ribonuclease H-like superfamily protein5.9e-0722.27Show/hide
Query:  ANLNSSEFEKACIAFWSIWNNRNSLRESKPIVEWS---LQTESIINYWKET-THKKGAQDSIDQNPTQQPIIL------SPNS-VQVYTDAVVRPNRTGA
        AN     F  A    W +W +RN L   +  + W     +  + +  W++T T+ +     +  +  QQP +        P++ ++   D         A
Subjt:  ANLNSSEFEKACIAFWSIWNNRNSLRESKPIVEWS---LQTESIINYWKET-THKKGAQDSIDQNPTQQPIIL------SPNS-VQVYTDAVVRPNRTGA

Query:  GLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQN
          G ++  E  +  GS +   S   + L +E  A++  ++       R+V    DS     ++N E        +WI   +   K F    F +VPR  N
Subjt:  GLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDSIQEMCKDFHAISFNFVPRDQN

Query:  LRADVLAKHALEHSSTMLWISNFPTWLVS
          AD+LAKH L+ + +  +    P ++ S
Subjt:  LRADVLAKHALEHSSTMLWISNFPTWLVS

AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein4.3e-0531.82Show/hide
Query:  VQVYTDAVVRPNRTGAGLGVVI--IGEGNILHGSMEMFESMDLN---PLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDS
        V ++TDA  +      G G VI    E   LH     ++S   N   PL AE +A+   ++    + I ++ + SDS   I  I  E   +T+    I  
Subjt:  VQVYTDAVVRPNRTGAGLGVVI--IGEGNILHGSMEMFESMDLN---PLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVAHWIDS

Query:  IQEMCKDFHAISFNFVPRDQNLRADVLAKHAL
        I  +   F  +SF+FVPR +N  AD LAK +L
Subjt:  IQEMCKDFHAISFNFVPRDQNLRADVLAKHAL

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-0828.69Show/hide
Query:  AISSFIMSCFRLPSTLCDDLHRMMT-------------RWTK--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTA
        A+ ++ M+CF LP T+C  +  ++               W                F D+ +FN ALL K++WR+   P  L+++V ++RY H +  L A
Subjt:  AISSFIMSCFRLPSTLCDDLHRMMT-------------RWTK--------------FCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTA

Query:  PAKSNCSFFWRSCIWARSLLSQ
        P  S  SF W+S   ++ +L Q
Subjt:  PAKSNCSFFWRSCIWARSLLSQ

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-0933.33Show/hide
Query:  AISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC------------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLT
        A+  + MSCFRL   LC  L   MT                 W K C            DL  FN+ALLAK+ +RI   P+ L+SR++++RY   + ++ 
Subjt:  AISSFIMSCFRLPSTLCDDLHRMMTR----------------WTKFC------------DLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLT

Query:  APAKSNCSFFWRSCIWARSLLSQ
            +  S+ WRS I  R LLS+
Subjt:  APAKSNCSFFWRSCIWARSLLSQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCCTTCCAAGGCCCCTGGACCGGATGGCTTTCCAGCTCTCTTTTATCAAAAGTATTGGGATGCGGTAGGTGAAACAAATATCTCGAATTGCTTGGATATTTTAAA
CCAGTGTCGATCTGCTAAAGACTGGAATAATACTTTTATCACCCTTATCCCAAAGGTAAAGCAACCCAAACGTGTCTCTGATTTTCGACCCATCAATTTATGTAATGTTG
CTTGCAAGATTGCAGCCAAGGGAGACCCACTTTCTTCTTATTTATTCCTGCTTTTCTCTGAGGTGTTGTCATCTCTATTATCTGGGGAAAACAGTTGTGGCGTTATCACA
GCATCGACTACCCAAGTCACTACTTTACGTAAGCTACTACAAGCATATGAGGTGGCATCTGTTCAAGAAATTAATGTGACAAAGTCAGCAATGTTCTTTTCCCAAAATGT
AAGTAGTGGCGAGCGCACTATGCTTCAAAATATCATGCAGATGTCGGTTGTGGATTCTTTAGGGACATATCTGGGTGTCCCTTCTTCTTTTTCTAAAAATAGAAAAGAAG
ATTTCAATGGAGTCAAACAACGTGTTTGGCAAACTCTACAAGGTTGGAAGGGACATTTTTTCTCCATGGGAGGGAAAGAGGTTCTTATTAAAAGTGTGGCTCAAGCTATC
TCATCTTTCATTATGAGTTGCTTTCGTTTGCCAAGTACCCTCTGCGATGATTTACACAGAATGATGACCAGGTGGACTAAATTTTGCGATCTAGTAAGCTTCAACAAGGC
TTTATTGGCAAAAAAAGTTTGGAGAATATTCATTAATCCAAACCTTCTCGTCTCTAGAGTTATTCAGGCCAGGTACGCTCATGGTACTTTGTTATTAACTGCTCCTGCTA
AATCAAATTGTTCTTTCTTTTGGAGAAGCTGTATTTGGGCTCGTAGTTTGCTTTCACAAGATCCCTGGATCCCAAAAGAAATAACTTTTACACCAATCGTCAAAGATCCA
TCACAAAATCATGAAAATAGTCTAGTGGCAGAGTTTATCACAACCTTAAATGGGTGGGATATATCAAAGTTGAGAGATTATGTGTGGGATGACGATGTTCGATTAATTAC
TACAATCCACATTGGCTTGGCAAATGTTGAAGATAAGTGGATCTGGCATTACTCTGAAAACGCAAATTTAAATTCAAGTGAGTTCGAAAAGGCTTGCATCGCCTTCTGGT
CTATCTGGAATAATCGTAATTCTCTACGTGAAAGCAAGCCTATTGTTGAATGGTCTCTTCAGACTGAATCGATTATTAATTATTGGAAGGAAACAACTCACAAGAAAGGA
GCACAAGATAGTATCGACCAAAATCCGACACAACAACCCATTATTTTATCTCCTAATTCCGTACAAGTATATACTGATGCAGTTGTAAGGCCTAACCGTACGGGTGCGGG
ACTGGGTGTTGTGATAATTGGGGAGGGAAATATCCTACATGGTTCTATGGAGATGTTCGAAAGTATGGATTTAAATCCTTTAGCGGCTGAAGTCCTAGCAGTTCTCCACG
TAATTAGACTCATTCATCGAATGCAAATTCGAGAAGTTCATGTGTGTTCTGATTCTGTTAATGCCATTAGGATGATCAATAAGGAATTGGATACGACTACAGATGTGGCA
CATTGGATAGACAGTATCCAGGAAATGTGTAAGGATTTTCATGCTATTTCTTTCAATTTTGTTCCTAGAGATCAAAATTTAAGAGCTGATGTTTTAGCTAAACATGCTTT
AGAACATAGCAGTACCATGTTGTGGATATCAAATTTTCCAACATGGTTGGTGTCTATGGACCGAAAGGTTGAACCACTATATTTCTTGGTGTTCTTTTCTCTTTTGCTTA
ATTATTTGCTGCTGGAGATTTTCGTTCTGATTTGGCATATTTACATTCGCTTAAAATTCTACAAGTGGTATCAGAGCCATGTCTTGGGCGTAGCCGTGTCGGGTGGAATC
CTCGGGTGCCGAACAAAGAAAGTGTTGAGCCTTGAAGTAGTCATGGGGAGATCTGTGGTTGAGCTTTGGGTAGGCGAGTATACTTCGGAGAAGATGAGTCGTCCTGGATA
A
mRNA sequenceShow/hide mRNA sequence
ATGCATCCTTCCAAGGCCCCTGGACCGGATGGCTTTCCAGCTCTCTTTTATCAAAAGTATTGGGATGCGGTAGGTGAAACAAATATCTCGAATTGCTTGGATATTTTAAA
CCAGTGTCGATCTGCTAAAGACTGGAATAATACTTTTATCACCCTTATCCCAAAGGTAAAGCAACCCAAACGTGTCTCTGATTTTCGACCCATCAATTTATGTAATGTTG
CTTGCAAGATTGCAGCCAAGGGAGACCCACTTTCTTCTTATTTATTCCTGCTTTTCTCTGAGGTGTTGTCATCTCTATTATCTGGGGAAAACAGTTGTGGCGTTATCACA
GCATCGACTACCCAAGTCACTACTTTACGTAAGCTACTACAAGCATATGAGGTGGCATCTGTTCAAGAAATTAATGTGACAAAGTCAGCAATGTTCTTTTCCCAAAATGT
AAGTAGTGGCGAGCGCACTATGCTTCAAAATATCATGCAGATGTCGGTTGTGGATTCTTTAGGGACATATCTGGGTGTCCCTTCTTCTTTTTCTAAAAATAGAAAAGAAG
ATTTCAATGGAGTCAAACAACGTGTTTGGCAAACTCTACAAGGTTGGAAGGGACATTTTTTCTCCATGGGAGGGAAAGAGGTTCTTATTAAAAGTGTGGCTCAAGCTATC
TCATCTTTCATTATGAGTTGCTTTCGTTTGCCAAGTACCCTCTGCGATGATTTACACAGAATGATGACCAGGTGGACTAAATTTTGCGATCTAGTAAGCTTCAACAAGGC
TTTATTGGCAAAAAAAGTTTGGAGAATATTCATTAATCCAAACCTTCTCGTCTCTAGAGTTATTCAGGCCAGGTACGCTCATGGTACTTTGTTATTAACTGCTCCTGCTA
AATCAAATTGTTCTTTCTTTTGGAGAAGCTGTATTTGGGCTCGTAGTTTGCTTTCACAAGATCCCTGGATCCCAAAAGAAATAACTTTTACACCAATCGTCAAAGATCCA
TCACAAAATCATGAAAATAGTCTAGTGGCAGAGTTTATCACAACCTTAAATGGGTGGGATATATCAAAGTTGAGAGATTATGTGTGGGATGACGATGTTCGATTAATTAC
TACAATCCACATTGGCTTGGCAAATGTTGAAGATAAGTGGATCTGGCATTACTCTGAAAACGCAAATTTAAATTCAAGTGAGTTCGAAAAGGCTTGCATCGCCTTCTGGT
CTATCTGGAATAATCGTAATTCTCTACGTGAAAGCAAGCCTATTGTTGAATGGTCTCTTCAGACTGAATCGATTATTAATTATTGGAAGGAAACAACTCACAAGAAAGGA
GCACAAGATAGTATCGACCAAAATCCGACACAACAACCCATTATTTTATCTCCTAATTCCGTACAAGTATATACTGATGCAGTTGTAAGGCCTAACCGTACGGGTGCGGG
ACTGGGTGTTGTGATAATTGGGGAGGGAAATATCCTACATGGTTCTATGGAGATGTTCGAAAGTATGGATTTAAATCCTTTAGCGGCTGAAGTCCTAGCAGTTCTCCACG
TAATTAGACTCATTCATCGAATGCAAATTCGAGAAGTTCATGTGTGTTCTGATTCTGTTAATGCCATTAGGATGATCAATAAGGAATTGGATACGACTACAGATGTGGCA
CATTGGATAGACAGTATCCAGGAAATGTGTAAGGATTTTCATGCTATTTCTTTCAATTTTGTTCCTAGAGATCAAAATTTAAGAGCTGATGTTTTAGCTAAACATGCTTT
AGAACATAGCAGTACCATGTTGTGGATATCAAATTTTCCAACATGGTTGGTGTCTATGGACCGAAAGGTTGAACCACTATATTTCTTGGTGTTCTTTTCTCTTTTGCTTA
ATTATTTGCTGCTGGAGATTTTCGTTCTGATTTGGCATATTTACATTCGCTTAAAATTCTACAAGTGGTATCAGAGCCATGTCTTGGGCGTAGCCGTGTCGGGTGGAATC
CTCGGGTGCCGAACAAAGAAAGTGTTGAGCCTTGAAGTAGTCATGGGGAGATCTGTGGTTGAGCTTTGGGTAGGCGAGTATACTTCGGAGAAGATGAGTCGTCCTGGATA
A
Protein sequenceShow/hide protein sequence
MHPSKAPGPDGFPALFYQKYWDAVGETNISNCLDILNQCRSAKDWNNTFITLIPKVKQPKRVSDFRPINLCNVACKIAAKGDPLSSYLFLLFSEVLSSLLSGENSCGVIT
ASTTQVTTLRKLLQAYEVASVQEINVTKSAMFFSQNVSSGERTMLQNIMQMSVVDSLGTYLGVPSSFSKNRKEDFNGVKQRVWQTLQGWKGHFFSMGGKEVLIKSVAQAI
SSFIMSCFRLPSTLCDDLHRMMTRWTKFCDLVSFNKALLAKKVWRIFINPNLLVSRVIQARYAHGTLLLTAPAKSNCSFFWRSCIWARSLLSQDPWIPKEITFTPIVKDP
SQNHENSLVAEFITTLNGWDISKLRDYVWDDDVRLITTIHIGLANVEDKWIWHYSENANLNSSEFEKACIAFWSIWNNRNSLRESKPIVEWSLQTESIINYWKETTHKKG
AQDSIDQNPTQQPIILSPNSVQVYTDAVVRPNRTGAGLGVVIIGEGNILHGSMEMFESMDLNPLAAEVLAVLHVIRLIHRMQIREVHVCSDSVNAIRMINKELDTTTDVA
HWIDSIQEMCKDFHAISFNFVPRDQNLRADVLAKHALEHSSTMLWISNFPTWLVSMDRKVEPLYFLVFFSLLLNYLLLEIFVLIWHIYIRLKFYKWYQSHVLGVAVSGGI
LGCRTKKVLSLEVVMGRSVVELWVGEYTSEKMSRPG