; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008280 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008280
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:16532305..16536067
RNA-Seq ExpressionLag0008280
SyntenyLag0008280
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]1.3e-11435.04Show/hide
Query:  SEILRQENLGLIRDCESMRVQINNVVERIIVIARRVGVFVDWELELRGGFTPSVPQHSGASATSVQPI-------FYTHAPQMPPLGGGAGNLGNGLENS
        S+I+ +++    +D + MR +INN+ E++  I   + +        +G       Q S     +  PI       ++ + P++ PL              
Subjt:  SEILRQENLGLIRDCESMRVQINNVVERIIVIARRVGVFVDWELELRGGFTPSVPQHSGASATSVQPI-------FYTHAPQMPPLGGGAGNLGNGLENS

Query:  ENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-----------------------------------
            SK KLD +E+RLRAIE  D +G+IDA QLCLVP ++IP KFK+PEF KYDG +CPR+HLI+                                   
Subjt:  ENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-----------------------------------

Query:  -AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIE
         AH+  WKDLA+ FLKQYK N+DM PDRLDLQR+EKKS+E+FKEYAQRWR+MAA++QPPLTDKE+T+MF +TLR+P+Y+ MIGN+S NFS+II IGERIE
Subjt:  -AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIE

Query:  YGVKYGRIVDTVSD---------SSQKKGRQAVQG----------------------------QVNYSSDAQNSNPSSAQRLFRNNSR------------
        YG+K+GR+ +  ++         S +K+G     G                             + Y+S       S   +   +NS             
Subjt:  YGVKYGRIVDTVSD---------SSQKKGRQAVQG----------------------------QVNYSSDAQNSNPSSAQRLFRNNSR------------

Query:  ----NYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNP
             +D IP+TYT+LLPQLIQ++QL  I M  +QPPYPKWY  N+RCDYH+  V                             E  +V +NPL +  NP
Subjt:  ----NYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNP

Query:  EVNVI-HVSSKCEG-------GSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPLLVKHSEYKVIEQ--------LFLNSK
        +VNV+  +  KC+            V+   + G + +  +   +  E + E  +           KD   + P++ K  EYK +            +  K
Subjt:  EVNVI-HVSSKCEG-------GSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPLLVKHSEYKVIEQ--------LFLNSK

Query:  PHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSST
        PHRK L+DILN+AHV H I VEK  GI+GNIT+SN+I FTD+EI P+                                           D SHIK S+ 
Subjt:  PHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSST

Query:  VVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASEC
        VV+AFDG+R+EV+G+I+LP+KIGP  FN+ FQ+      Y     RP                               D L+TK   TPYVE  EEA EC
Subjt:  VVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASEC

Query:  SFQSFEIANATIFTHAEDRSLE
        SF SFEIA+AT+     D  ++
Subjt:  SFQSFEIANATIFTHAEDRSLE

KAA0036947.1 uncharacterized protein E6C27_scaffold86G00270 [Cucumis melo var. makuwa]9.8e-11537.66Show/hide
Query:  LENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-------------------------------
        +E +ENT +K KLD +E+RLRAIE  D +G+IDA QLCLVP ++IP KFK+PEF+KYDG + PR+HLI+                               
Subjt:  LENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-------------------------------

Query:  -----AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIG
             AH+  WKDLA+ FLKQYKHN+DM PDRLDLQR+EKKS+E+FKEYAQRWR+M A++QPPLTDKE+T+MF +TLR+P+Y+ MIGN+S NFS+II IG
Subjt:  -----AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIG

Query:  ERIEYGVKYGRIVDTVSD---------SSQKKGR-QAVQGQVNYSSDAQNSNPSSAQRLFRNNSRNYDL-----IPITYTKLLPQLIQSKQLVPIHMELV
        ERIEYG+K+GR+ +  ++         S +K+G   A+   V+ +    NSN        + +  N+D      IP+TYT+LLPQLIQ++Q+  I M  +
Subjt:  ERIEYGVKYGRIVDTVSD---------SSQKKGR-QAVQGQVNYSSDAQNSNPSSAQRLFRNNSRNYDL-----IPITYTKLLPQLIQSKQLVPIHMELV

Query:  QPPYPKWYKLNSRCDYHSRAVEEA----------------------------PDVTQNPLSNHGNPEVNVI-HVSSKCEGGSNFV---YQCSKDGCLPYS
        QPPYPKWY  N+RCDYH+  V  +                            P+V +NPL +H NP+VNV+  +  KC+   + +    +   +      
Subjt:  QPPYPKWYKLNSRCDYHSRAVEEA----------------------------PDVTQNPLSNHGNPEVNVI-HVSSKCEGGSNFV---YQCSKDGCLPYS

Query:  EVVMCLMQENEREINSV----------NMII------------GEDTSVKDSFKQE----PLLVKHSEYKVI-------EQLFLNSKPHRKTLMDILNQA
         V+   + +N   I+ +          N+ +             E  +VK+  K +    P++ K  EYK +       E L +  +   ++++  LN+A
Subjt:  EVVMCLMQENEREINSV----------NMII------------GEDTSVKDSFKQE----PLLVKHSEYKVI-------EQLFLNSKPHRKTLMDILNQA

Query:  HVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVV
        HV H I VEK  GI+GNIT+SN+I FTD+EI P+                                           D SHIK S+ VV+AFDG+R+EV+
Subjt:  HVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVV

Query:  GEIKLPIKIGPRTFNVTFQIKVCFREY--------------------------FGEHI----RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF
        G+I+LP+KIGP  FN+ FQ+      Y                           G  +       D L+TK   TPYVE  EEA ECSF+SFEIA+AT+ 
Subjt:  GEIKLPIKIGPRTFNVTFQIKVCFREY--------------------------FGEHI----RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF

Query:  THAEDRSLE
            D  ++
Subjt:  THAEDRSLE

XP_022157707.1 uncharacterized protein LOC111024360 [Momordica charantia]5.7e-11538.15Show/hide
Query:  GFTPSVPQHSGASATSVQPIFYTHAPQMPPLGGGA-------------------------------GNLGNGLENSENTTSKGKLDFIEQRLRAIEGADT
        GFTP V  H+G +    QPIFYT  P M   G GA                                N G   E +E   SKGKLDF+E+RL+AIEGAD 
Subjt:  GFTPSVPQHSGASATSVQPIFYTHAPQMPPLGGGA-------------------------------GNLGNGLENSENTTSKGKLDFIEQRLRAIEGADT

Query:  FGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------------AHVQKWKDLANLFLKQYKHNLDM
        F SIDA QLCL+ D+VIP KFK+ EFEKYDG SCPR+HLI+                                    AHV KWK    +FL    +   +
Subjt:  FGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------------AHVQKWKDLANLFLKQYKHNLDM

Query:  VPDRLDLQR-LEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIEYGVKYGRIVDTVSDSSQKK----
            + +   LE KSTENFKEYAQRWR+MAAQIQ PLT+KELTTMF +TLRSPYYD MIGN+S +FS+IITI ERIEYGVKYGRI+DT SD+S+KK    
Subjt:  VPDRLDLQR-LEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIEYGVKYGRIVDTVSDSSQKK----

Query:  ------------GRQAVQGQV-------------NYS------SDAQNSNPSSAQRLFRNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQ------
                    G++ + G +             +YS      ++  +SN S  QR +RNNSR YD IP+TY +LLPQLI + Q VP+ ++L        
Subjt:  ------------GRQAVQGQV-------------NYS------SDAQNSNPSSAQRLFRNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQ------

Query:  --------PPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDT
                    +W  +  +   HS+     PDVTQNP  NHGNP VNVIH SS+ +   +FVYQ   D CL YSE   CL   N+ E + VNM+  E+ 
Subjt:  --------PPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDT

Query:  SVKDSFKQEPL-----------------------------------------------------------------------------------------
        S K SFK EPL                                                                                         
Subjt:  SVKDSFKQEPL-----------------------------------------------------------------------------------------

Query:  ---------------------------------------LVKHSEYKVIEQL------------FLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNI
                                               L+KH+EYKVIEQL            F+NSKPHRK LMD+LNQ+HVN+ I +EKL GIVGNI
Subjt:  ---------------------------------------LVKHSEYKVIEQL------------FLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNI

Query:  TASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTF
        T+SNTITF DEEI P+                                           D +HIKHSS VVR F GT + VVG+I+LPIKIGPRTFNVTF
Subjt:  TASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTF

Query:  QI
        Q+
Subjt:  QI

XP_031737553.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402446 [Cucumis sativus]2.7e-11234.39Show/hide
Query:  SENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII----------------------------------
        +ENT +K KLD +E+RLRAIEG D +G+IDA QLCLVP ++IP KFK+PEF+KYDG SCPR+HLI+                                  
Subjt:  SENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII----------------------------------

Query:  --AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERI
          AH+  WKDLA+ FLKQYKHN+DM PDRLDLQR+EKKS+E+FKEYAQRWR++AA++QPPLTDKE+T MF +TLR+P+YD MIGN++ NFS+II IGERI
Subjt:  --AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERI

Query:  EYGVKYGRIVDTVSD-SSQKKGR--QAVQGQV----------NYSSDAQNSNPSSAQRLFRN-------------NSR--NYDLIPITYTKLLPQLIQSK
        EYG+K+GR+++T ++    KKG   +  +G+V          + S+  Q  +  S      N             NS    +D IP+TYT+LLPQL+ ++
Subjt:  EYGVKYGRIVDTVSD-SSQKKGR--QAVQGQV----------NYSSDAQNSNPSSAQRLFRN-------------NSR--NYDLIPITYTKLLPQLIQSK

Query:  QLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNPEVNVIH-VSSKCEG----------
        QL PI +  +QPPYPKWY  N+RCDYH+  V                             E PDV  NPL NH N +VN I     KC+           
Subjt:  QLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNPEVNVIH-VSSKCEG----------

Query:  -----------------GSNFVYQCSKDG--CLPY--------SEVVMCLMQ---ENEREINSVNMIIGEDTSVKDSFKQEPL-----------------
                           N  Y+   +G  C+ +        ++V++ + +   ++E + N +  ++GE    ++ F   PL                 
Subjt:  -----------------GSNFVYQCSKDG--CLPY--------SEVVMCLMQ---ENEREINSVNMIIGEDTSVKDSFKQEPL-----------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------LVKHSEYKVIEQ------------LFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK----------------
               +VK SEYK+IEQ            LFLNS+PHRK L+DILN+AHV H I VEK  GI+G+IT+SN+I FTD+EI P+                
Subjt:  -------LVKHSEYKVIEQ------------LFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK----------------

Query:  ---------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP----------------
                                   D S+IK S+ VVRAFDG+R+EV+G+I+LPIKIGP TFN+ FQ+      Y     RP                
Subjt:  ---------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP----------------

Query:  --------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATI
                       D L+TK   TPYVE  EEA ECSF+SFEIA+AT+
Subjt:  --------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATI

XP_031742474.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116404360 [Cucumis sativus]1.3e-11939.71Show/hide
Query:  SENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII----------------------------------
        +ENT +K KLD +E+RLRAIEG D +G+IDA QLCLVP ++IP KFK+PEF+KYDG SCPR+HLI+                                  
Subjt:  SENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII----------------------------------

Query:  --AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERI
          AH+  WKDLA+ FLKQYKHN+DM PDRLDLQR+EKKS+E+FKEYAQRWR++AA++QPPLTDKE+T MF +TLR+P+YD MIGN++ NFS+II IGERI
Subjt:  --AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERI

Query:  EYGVKYGRIVDTVSD-SSQKKGR--QAVQGQVN----------------------------------YSSDAQNSNPSSAQRLFRNN-------------
        EYG+K+GR+++T ++    KKG   +  +G+V+                                  Y++     +PS A +L  +N             
Subjt:  EYGVKYGRIVDTVSD-SSQKKGR--QAVQGQVN----------------------------------YSSDAQNSNPSSAQRLFRNN-------------

Query:  ---SRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCL
           +  +D IP+TYT+LLPQL+ ++QL PI +  +QPPYPKWY  N+RCDYH+  V  +   T+N L+      +     S K                 
Subjt:  ---SRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCL

Query:  PYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPL-----LVKHSEYKV-----------IEQLFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGI
                 + E+++E +    +  +D   K     E       +VK SEYK            +  LFLNS+PHRK L+DILN+AHV H I VEK  GI
Subjt:  PYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPL-----LVKHSEYKV-----------IEQLFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGI

Query:  VGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTF
        +G+IT+SN+I FTD+EI P+                                           D S+IK S+ VVRAFDG+R+EV+G+I+LPIKIGP TF
Subjt:  VGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTF

Query:  NVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATI
        N+ FQ+      Y     RP                               D L+TK   TPYVE  EEA ECSF+SFEIA+AT+
Subjt:  NVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATI

TrEMBL top hitse value%identityAlignment
A0A5A7T1W2 Retrotrans_gag domain-containing protein6.2e-11535.04Show/hide
Query:  SEILRQENLGLIRDCESMRVQINNVVERIIVIARRVGVFVDWELELRGGFTPSVPQHSGASATSVQPI-------FYTHAPQMPPLGGGAGNLGNGLENS
        S+I+ +++    +D + MR +INN+ E++  I   + +        +G       Q S     +  PI       ++ + P++ PL              
Subjt:  SEILRQENLGLIRDCESMRVQINNVVERIIVIARRVGVFVDWELELRGGFTPSVPQHSGASATSVQPI-------FYTHAPQMPPLGGGAGNLGNGLENS

Query:  ENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-----------------------------------
            SK KLD +E+RLRAIE  D +G+IDA QLCLVP ++IP KFK+PEF KYDG +CPR+HLI+                                   
Subjt:  ENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-----------------------------------

Query:  -AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIE
         AH+  WKDLA+ FLKQYK N+DM PDRLDLQR+EKKS+E+FKEYAQRWR+MAA++QPPLTDKE+T+MF +TLR+P+Y+ MIGN+S NFS+II IGERIE
Subjt:  -AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIE

Query:  YGVKYGRIVDTVSD---------SSQKKGRQAVQG----------------------------QVNYSSDAQNSNPSSAQRLFRNNSR------------
        YG+K+GR+ +  ++         S +K+G     G                             + Y+S       S   +   +NS             
Subjt:  YGVKYGRIVDTVSD---------SSQKKGRQAVQG----------------------------QVNYSSDAQNSNPSSAQRLFRNNSR------------

Query:  ----NYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNP
             +D IP+TYT+LLPQLIQ++QL  I M  +QPPYPKWY  N+RCDYH+  V                             E  +V +NPL +  NP
Subjt:  ----NYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNP

Query:  EVNVI-HVSSKCEG-------GSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPLLVKHSEYKVIEQ--------LFLNSK
        +VNV+  +  KC+            V+   + G + +  +   +  E + E  +           KD   + P++ K  EYK +            +  K
Subjt:  EVNVI-HVSSKCEG-------GSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPLLVKHSEYKVIEQ--------LFLNSK

Query:  PHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSST
        PHRK L+DILN+AHV H I VEK  GI+GNIT+SN+I FTD+EI P+                                           D SHIK S+ 
Subjt:  PHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSST

Query:  VVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASEC
        VV+AFDG+R+EV+G+I+LP+KIGP  FN+ FQ+      Y     RP                               D L+TK   TPYVE  EEA EC
Subjt:  VVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASEC

Query:  SFQSFEIANATIFTHAEDRSLE
        SF SFEIA+AT+     D  ++
Subjt:  SFQSFEIANATIFTHAEDRSLE

A0A5A7T5W2 Retrotrans_gag domain-containing protein4.7e-11537.66Show/hide
Query:  LENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-------------------------------
        +E +ENT +K KLD +E+RLRAIE  D +G+IDA QLCLVP ++IP KFK+PEF+KYDG + PR+HLI+                               
Subjt:  LENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII-------------------------------

Query:  -----AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIG
             AH+  WKDLA+ FLKQYKHN+DM PDRLDLQR+EKKS+E+FKEYAQRWR+M A++QPPLTDKE+T+MF +TLR+P+Y+ MIGN+S NFS+II IG
Subjt:  -----AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIG

Query:  ERIEYGVKYGRIVDTVSD---------SSQKKGR-QAVQGQVNYSSDAQNSNPSSAQRLFRNNSRNYDL-----IPITYTKLLPQLIQSKQLVPIHMELV
        ERIEYG+K+GR+ +  ++         S +K+G   A+   V+ +    NSN        + +  N+D      IP+TYT+LLPQLIQ++Q+  I M  +
Subjt:  ERIEYGVKYGRIVDTVSD---------SSQKKGR-QAVQGQVNYSSDAQNSNPSSAQRLFRNNSRNYDL-----IPITYTKLLPQLIQSKQLVPIHMELV

Query:  QPPYPKWYKLNSRCDYHSRAVEEA----------------------------PDVTQNPLSNHGNPEVNVI-HVSSKCEGGSNFV---YQCSKDGCLPYS
        QPPYPKWY  N+RCDYH+  V  +                            P+V +NPL +H NP+VNV+  +  KC+   + +    +   +      
Subjt:  QPPYPKWYKLNSRCDYHSRAVEEA----------------------------PDVTQNPLSNHGNPEVNVI-HVSSKCEGGSNFV---YQCSKDGCLPYS

Query:  EVVMCLMQENEREINSV----------NMII------------GEDTSVKDSFKQE----PLLVKHSEYKVI-------EQLFLNSKPHRKTLMDILNQA
         V+   + +N   I+ +          N+ +             E  +VK+  K +    P++ K  EYK +       E L +  +   ++++  LN+A
Subjt:  EVVMCLMQENEREINSV----------NMII------------GEDTSVKDSFKQE----PLLVKHSEYKVI-------EQLFLNSKPHRKTLMDILNQA

Query:  HVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVV
        HV H I VEK  GI+GNIT+SN+I FTD+EI P+                                           D SHIK S+ VV+AFDG+R+EV+
Subjt:  HVNHHIDVEKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVV

Query:  GEIKLPIKIGPRTFNVTFQIKVCFREY--------------------------FGEHI----RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF
        G+I+LP+KIGP  FN+ FQ+      Y                           G  +       D L+TK   TPYVE  EEA ECSF+SFEIA+AT+ 
Subjt:  GEIKLPIKIGPRTFNVTFQIKVCFREY--------------------------FGEHI----RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF

Query:  THAEDRSLE
            D  ++
Subjt:  THAEDRSLE

A0A5A7T5W6 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein2.1e-11033.3Show/hide
Query:  RDCESMRVQINNVVERIIVIARRVGV-----FVDWELELR-----------GGFTP---SVPQHSGASATSVQPIFYTHAPQMPPLGGGAGNLG-----N
        +D + MR +INN+ E++  I   + +      VD                  GFTP   +VPQ    S T+   +       +PP+  G  +L       
Subjt:  RDCESMRVQINNVVERIIVIARRVGV-----FVDWELELR-----------GGFTP---SVPQHSGASATSVQPIFYTHAPQMPPLGGGAGNLG-----N

Query:  GLENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------
         +E +ENT +K KLD +E+RLRAIE  D +G+IDA QLCLVP ++IP KFK+PEF+KYDG +CPR+HLI+                              
Subjt:  GLENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------

Query:  ------AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITI
              AH+  WKDLA+ FLKQYKHN+DM PD LDLQR+EKKS+E+FKEYAQRWR+MAA++QPPLTDKE+T+MF +TLR+P+Y+ MIGN+S NFS+II I
Subjt:  ------AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITI

Query:  GERIEYGVKYGRIVDTVSD---------SSQKKG------------RQAVQGQVNYS------------------------SDAQNSNPSSAQRLF----
        GERIEYG+K+GR+ +  ++         S +K+G             +++ GQ  Y                         S+      S++ R F    
Subjt:  GERIEYGVKYGRIVDTVSD---------SSQKKG------------RQAVQGQVNYS------------------------SDAQNSNPSSAQRLF----

Query:  ----RNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLS
             +++  +D IP+TYT++LPQLIQ++QL  I M  +QPPY KWY  N+RCDYH+  V                             E  +V +NPL 
Subjt:  ----RNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLS

Query:  NHGNPEVNVI-HVSSKCEG------------------------------------------------GSNFVYQC----SKDGCLPYSEVVMCL---MQE
        +H NP+VN +     KC+                                                   + V QC    SK   L  S+++  +   + +
Subjt:  NHGNPEVNVI-HVSSKCEG------------------------------------------------GSNFVYQC----SKDGCLPYSEVVMCL---MQE

Query:  NEREINSV----------NMIIGEDTSV----------------KDSFKQEPLLVKHSEYKVIEQ--------LFLNSKPHRKTLMDILNQAHVNHHIDV
        N   I+ +          N+ +  +  +                KD   + P++ K  EYK +            +  KP RK L+DILN+AHV H I V
Subjt:  NEREINSV----------NMIIGEDTSV----------------KDSFKQEPLLVKHSEYKVIEQ--------LFLNSKPHRKTLMDILNQAHVNHHIDV

Query:  EKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIK
        EK  GI+GNIT+SN+I FTD+EI P+                                           D SHIK S+ VV+AFDG+R+EV+G+I+LP+K
Subjt:  EKLGGIVGNITASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIK

Query:  IGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATIFTHAEDRSL
        IGP  FN+ FQ+      Y     RP                               D L+TK   TPYVE  EEA ECSF+SFEIA+AT+     D  +
Subjt:  IGPRTFNVTFQIKVCFREYFGEHIRP------------------------------RDVLVTKSTPTPYVETAEEASECSFQSFEIANATIFTHAEDRSL

Query:  E
        +
Subjt:  E

A0A5A7V681 Retrotrans_gag domain-containing protein4.7e-10733.56Show/hide
Query:  GLENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------
        G + SE   S+ +L+F+E+RLR IEGAD +GSIDA QLCL+ DVVIP KFK P+FEKY+G SCP++HL++                              
Subjt:  GLENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------

Query:  ------AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITI
              + V +WKDLA+ FLKQYK+N+DM PDRLDLQR+EKK+ E FKEYAQRWRE+AAQ+QPP TDKELT MF +TLR+PYYD M+G++S NFS++ITI
Subjt:  ------AHVQKWKDLANLFLKQYKHNLDMVPDRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITI

Query:  GERIEYGVKYGRIVDTVSDS------SQKKGR---------------QAVQGQVNYSSDAQNSNPS----SAQRLFRNN--SRNYDLIPITYTKLLPQLI
        GERIE+GVK  RI D  S++       +K+G                  + GQ N+S   QN   S    S QR  RNN     +D IP++YT+LLPQLI
Subjt:  GERIEYGVKYGRIVDTVSDS------SQKKGR---------------QAVQGQVNYSSDAQNSNPS----SAQRLFRNN--SRNYDLIPITYTKLLPQLI

Query:  QSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNPEVNVI----------------
        +S Q+  +  E +QPPYPKWY  N++C+YH+ AV                             E PDV QNPL NH  P +N +                
Subjt:  QSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAV----------------------------EEAPDVTQNPLSNHGNPEVNVI----------------

Query:  -------------HVSSKCEG------GSNFVYQC---------SKDGCLPYSEVVMCLMQE----------NEREINSV-NMIIGEDTSVKDSFKQEPL
                     ++S +         G     QC         S + C  +   V  LM             E E+N + N    E TS + +F  +P 
Subjt:  -------------HVSSKCEG------GSNFVYQC---------SKDGCLPYSEVVMCLMQE----------NEREINSV-NMIIGEDTSVKDSFKQEPL

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------LVKHSEYKVIEQ------------LFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEE
                                   L+K SEYKVIEQ            LF+ S+PHRK L+DILN+AHV H I V  L  IV NI A+N I+FTDEE
Subjt:  ---------------------------LVKHSEYKVIEQ------------LFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEE

Query:  ISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQI----------
        I P+                                           DPS+++ S+ VVRAFDG R+EV+G+I +P+KIGP TFNV+FQ+          
Subjt:  ISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQI----------

Query:  ------------------KVCFREYFGEHI--RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF
                          ++ F    G+ I     D+ VTK++  PYVE  EEA ECS++SFEIANATIF
Subjt:  ------------------KVCFREYFGEHI--RPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIF

A0A6J1DV70 uncharacterized protein LOC1110243602.8e-11538.15Show/hide
Query:  GFTPSVPQHSGASATSVQPIFYTHAPQMPPLGGGA-------------------------------GNLGNGLENSENTTSKGKLDFIEQRLRAIEGADT
        GFTP V  H+G +    QPIFYT  P M   G GA                                N G   E +E   SKGKLDF+E+RL+AIEGAD 
Subjt:  GFTPSVPQHSGASATSVQPIFYTHAPQMPPLGGGA-------------------------------GNLGNGLENSENTTSKGKLDFIEQRLRAIEGADT

Query:  FGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------------AHVQKWKDLANLFLKQYKHNLDM
        F SIDA QLCL+ D+VIP KFK+ EFEKYDG SCPR+HLI+                                    AHV KWK    +FL    +   +
Subjt:  FGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLII------------------------------------AHVQKWKDLANLFLKQYKHNLDM

Query:  VPDRLDLQR-LEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIEYGVKYGRIVDTVSDSSQKK----
            + +   LE KSTENFKEYAQRWR+MAAQIQ PLT+KELTTMF +TLRSPYYD MIGN+S +FS+IITI ERIEYGVKYGRI+DT SD+S+KK    
Subjt:  VPDRLDLQR-LEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIEYGVKYGRIVDTVSDSSQKK----

Query:  ------------GRQAVQGQV-------------NYS------SDAQNSNPSSAQRLFRNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQ------
                    G++ + G +             +YS      ++  +SN S  QR +RNNSR YD IP+TY +LLPQLI + Q VP+ ++L        
Subjt:  ------------GRQAVQGQV-------------NYS------SDAQNSNPSSAQRLFRNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQ------

Query:  --------PPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDT
                    +W  +  +   HS+     PDVTQNP  NHGNP VNVIH SS+ +   +FVYQ   D CL YSE   CL   N+ E + VNM+  E+ 
Subjt:  --------PPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCLPYSEVVMCLMQENEREINSVNMIIGEDT

Query:  SVKDSFKQEPL-----------------------------------------------------------------------------------------
        S K SFK EPL                                                                                         
Subjt:  SVKDSFKQEPL-----------------------------------------------------------------------------------------

Query:  ---------------------------------------LVKHSEYKVIEQL------------FLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNI
                                               L+KH+EYKVIEQL            F+NSKPHRK LMD+LNQ+HVN+ I +EKL GIVGNI
Subjt:  ---------------------------------------LVKHSEYKVIEQL------------FLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNI

Query:  TASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTF
        T+SNTITF DEEI P+                                           D +HIKHSS VVR F GT + VVG+I+LPIKIGPRTFNVTF
Subjt:  TASNTITFTDEEISPK-------------------------------------------DPSHIKHSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTF

Query:  QI
        Q+
Subjt:  QI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAGAGCATCGAATTCGAAGGGTTCAAGAGTCAGCAGCGAAGATTCTATCAGAGCATAACCAGTTGAAAGGGAGATGCGATTTACTGTTGAGCAATCAAGCTAA
TTCAGAGATTCTTCGTCAAGAAAACCTCGGTTTAATCAGAGATTGTGAAAGTATGAGAGTCCAAATAAATAACGTTGTGGAAAGAATAATAGTAATAGCCAGAAGAGTTG
GAGTTTTCGTTGATTGGGAACTTGAGCTCAGAGGAGGTTTTACCCCATCAGTACCTCAGCATAGTGGGGCATCAGCTACATCTGTACAACCGATATTCTATACTCATGCA
CCTCAAATGCCTCCATTAGGAGGAGGGGCAGGAAATTTGGGAAATGGGTTGGAGAATAGTGAAAACACTACATCCAAAGGGAAGTTGGATTTCATAGAGCAAAGATTGAG
GGCAATAGAAGGTGCTGACACATTTGGGAGCATAGATGCAATACAACTATGTTTGGTTCCTGATGTTGTTATTCCTCAAAAATTCAAGTTACCCGAATTCGAGAAGTATG
ATGGGATATCTTGTCCAAGAAACCATTTGATTATAGCTCATGTTCAGAAGTGGAAGGACTTGGCAAATCTATTTCTCAAACAATATAAACATAATCTTGATATGGTTCCT
GACCGTTTAGATCTCCAGCGACTAGAAAAGAAGAGTACTGAGAATTTTAAGGAGTATGCGCAAAGATGGAGAGAAATGGCAGCCCAAATTCAACCGCCACTTACAGATAA
AGAATTAACGACCATGTTCAAACATACGCTTCGATCCCCTTATTACGATAGTATGATTGGGAACTCTTCCATGAATTTCTCAAATATCATAACTATCGGAGAAAGAATCG
AGTATGGAGTGAAATATGGAAGAATTGTTGACACAGTATCGGACTCGAGCCAGAAAAAAGGTCGTCAGGCGGTACAGGGGCAAGTGAATTATTCATCAGATGCTCAAAAT
TCCAACCCGTCGAGTGCTCAACGACTTTTTAGAAACAACTCAAGGAATTATGATCTCATCCCTATAACTTATACTAAACTTCTCCCACAATTGATTCAAAGTAAGCAACT
TGTTCCAATACATATGGAACTCGTGCAACCACCATATCCAAAATGGTACAAGCTAAATTCAAGATGTGACTATCATTCGAGAGCGGTAGAAGAGGCACCAGATGTCACAC
AAAATCCACTCTCTAATCATGGCAACCCAGAAGTCAATGTCATTCATGTATCATCAAAATGTGAAGGAGGGTCGAACTTTGTATATCAATGTTCAAAGGATGGTTGTTTA
CCTTACAGTGAAGTAGTGATGTGCCTAATGCAAGAAAATGAACGAGAAATTAATTCTGTAAACATGATCATTGGAGAAGATACTTCAGTAAAAGATTCATTTAAGCAAGA
ACCCCTGTTGGTCAAGCACAGTGAATATAAGGTGATCGAACAACTGTTCTTAAATTCAAAGCCACATCGTAAAACTTTAATGGATATTTTGAACCAAGCACATGTGAATC
ATCACATCGATGTAGAAAAGTTGGGTGGAATCGTAGGAAACATTACTGCTTCCAACACTATTACATTTACAGACGAAGAGATTTCGCCAAAAGATCCATCTCATATTAAA
CACAGTAGCACTGTAGTAAGAGCTTTCGATGGGACTCGTAAAGAAGTTGTGGGGGAAATCAAATTACCCATTAAAATTGGACCTCGTACCTTCAACGTGACATTTCAAAT
TAAAGTTTGTTTTCGAGAATACTTTGGTGAGCATATTCGGCCAAGAGATGTGTTGGTCACTAAATCTACCCCAACCCCTTATGTGGAGACAGCAGAGGAGGCTTCGGAAT
GTTCTTTTCAATCTTTCGAGATCGCAAATGCTACTATTTTTACCCATGCGGAGGATCGAAGTCTAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAGAGCATCGAATTCGAAGGGTTCAAGAGTCAGCAGCGAAGATTCTATCAGAGCATAACCAGTTGAAAGGGAGATGCGATTTACTGTTGAGCAATCAAGCTAA
TTCAGAGATTCTTCGTCAAGAAAACCTCGGTTTAATCAGAGATTGTGAAAGTATGAGAGTCCAAATAAATAACGTTGTGGAAAGAATAATAGTAATAGCCAGAAGAGTTG
GAGTTTTCGTTGATTGGGAACTTGAGCTCAGAGGAGGTTTTACCCCATCAGTACCTCAGCATAGTGGGGCATCAGCTACATCTGTACAACCGATATTCTATACTCATGCA
CCTCAAATGCCTCCATTAGGAGGAGGGGCAGGAAATTTGGGAAATGGGTTGGAGAATAGTGAAAACACTACATCCAAAGGGAAGTTGGATTTCATAGAGCAAAGATTGAG
GGCAATAGAAGGTGCTGACACATTTGGGAGCATAGATGCAATACAACTATGTTTGGTTCCTGATGTTGTTATTCCTCAAAAATTCAAGTTACCCGAATTCGAGAAGTATG
ATGGGATATCTTGTCCAAGAAACCATTTGATTATAGCTCATGTTCAGAAGTGGAAGGACTTGGCAAATCTATTTCTCAAACAATATAAACATAATCTTGATATGGTTCCT
GACCGTTTAGATCTCCAGCGACTAGAAAAGAAGAGTACTGAGAATTTTAAGGAGTATGCGCAAAGATGGAGAGAAATGGCAGCCCAAATTCAACCGCCACTTACAGATAA
AGAATTAACGACCATGTTCAAACATACGCTTCGATCCCCTTATTACGATAGTATGATTGGGAACTCTTCCATGAATTTCTCAAATATCATAACTATCGGAGAAAGAATCG
AGTATGGAGTGAAATATGGAAGAATTGTTGACACAGTATCGGACTCGAGCCAGAAAAAAGGTCGTCAGGCGGTACAGGGGCAAGTGAATTATTCATCAGATGCTCAAAAT
TCCAACCCGTCGAGTGCTCAACGACTTTTTAGAAACAACTCAAGGAATTATGATCTCATCCCTATAACTTATACTAAACTTCTCCCACAATTGATTCAAAGTAAGCAACT
TGTTCCAATACATATGGAACTCGTGCAACCACCATATCCAAAATGGTACAAGCTAAATTCAAGATGTGACTATCATTCGAGAGCGGTAGAAGAGGCACCAGATGTCACAC
AAAATCCACTCTCTAATCATGGCAACCCAGAAGTCAATGTCATTCATGTATCATCAAAATGTGAAGGAGGGTCGAACTTTGTATATCAATGTTCAAAGGATGGTTGTTTA
CCTTACAGTGAAGTAGTGATGTGCCTAATGCAAGAAAATGAACGAGAAATTAATTCTGTAAACATGATCATTGGAGAAGATACTTCAGTAAAAGATTCATTTAAGCAAGA
ACCCCTGTTGGTCAAGCACAGTGAATATAAGGTGATCGAACAACTGTTCTTAAATTCAAAGCCACATCGTAAAACTTTAATGGATATTTTGAACCAAGCACATGTGAATC
ATCACATCGATGTAGAAAAGTTGGGTGGAATCGTAGGAAACATTACTGCTTCCAACACTATTACATTTACAGACGAAGAGATTTCGCCAAAAGATCCATCTCATATTAAA
CACAGTAGCACTGTAGTAAGAGCTTTCGATGGGACTCGTAAAGAAGTTGTGGGGGAAATCAAATTACCCATTAAAATTGGACCTCGTACCTTCAACGTGACATTTCAAAT
TAAAGTTTGTTTTCGAGAATACTTTGGTGAGCATATTCGGCCAAGAGATGTGTTGGTCACTAAATCTACCCCAACCCCTTATGTGGAGACAGCAGAGGAGGCTTCGGAAT
GTTCTTTTCAATCTTTCGAGATCGCAAATGCTACTATTTTTACCCATGCGGAGGATCGAAGTCTAGAATAG
Protein sequenceShow/hide protein sequence
MKEEHRIRRVQESAAKILSEHNQLKGRCDLLLSNQANSEILRQENLGLIRDCESMRVQINNVVERIIVIARRVGVFVDWELELRGGFTPSVPQHSGASATSVQPIFYTHA
PQMPPLGGGAGNLGNGLENSENTTSKGKLDFIEQRLRAIEGADTFGSIDAIQLCLVPDVVIPQKFKLPEFEKYDGISCPRNHLIIAHVQKWKDLANLFLKQYKHNLDMVP
DRLDLQRLEKKSTENFKEYAQRWREMAAQIQPPLTDKELTTMFKHTLRSPYYDSMIGNSSMNFSNIITIGERIEYGVKYGRIVDTVSDSSQKKGRQAVQGQVNYSSDAQN
SNPSSAQRLFRNNSRNYDLIPITYTKLLPQLIQSKQLVPIHMELVQPPYPKWYKLNSRCDYHSRAVEEAPDVTQNPLSNHGNPEVNVIHVSSKCEGGSNFVYQCSKDGCL
PYSEVVMCLMQENEREINSVNMIIGEDTSVKDSFKQEPLLVKHSEYKVIEQLFLNSKPHRKTLMDILNQAHVNHHIDVEKLGGIVGNITASNTITFTDEEISPKDPSHIK
HSSTVVRAFDGTRKEVVGEIKLPIKIGPRTFNVTFQIKVCFREYFGEHIRPRDVLVTKSTPTPYVETAEEASECSFQSFEIANATIFTHAEDRSLE