; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032313 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032313
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:30308480..30311705
RNA-Seq ExpressionLag0032313
SyntenyLag0032313
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_017217082.1 PREDICTED: uncharacterized protein LOC108194640 [Daucus carota subsp. sativus]2.0e-6134.26Show/hide
Query:  QHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQD
        + + W       ++           + K+   F +E+ W +   C ++I++     G  NS    +  L  C ++LK W       + R I+ +++ + D
Subjt:  QHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQD

Query:  LYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSE
        L     P  + EIK  E +L+  LE++E+YW+QRSR  WL+ GDKNTR+FH++A+ R+K+NEI+ ++D +G   V++ ++      +F            
Subjt:  LYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSE

Query:  DIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHG
         +D+   D+          +  CL +LN    +  +NDT +ALI K++ P+ + +FRPISLCNV YKI++K + NR+K +L E+ISENQSAFV  R IH 
Subjt:  DIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHG

Query:  NVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF
        NVIIG+E LH  K  R  +G  +ALKLDM+KAYDRVEW F+E +++K+G++  WV      C  S++++    G        +R    G  +S + F
Subjt:  NVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.1e-8330.27Show/hide
Query:  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFD
        DLLE W+ F L++ EE T +DVD  A   T   L   L GKL     I   VM  T + AW + N    ++ L  NLFLFS    +D+ ++ +  P  FD
Subjt:  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFD

Query:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS-----------------------------------------------VWTPIK
        + L+ ++KP+ ++ P+ ++F     WV F +LP+    R MA RLG+                                                W PI+
Subjt:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS-----------------------------------------------VWTPIK

Query:  YEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS
        YE+LPD C  CG                 SS ++ +YG W+ +          P+   +   Q  +   + N S   S + V  G++     +S+P TG 
Subjt:  YEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS

Query:  GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLA------------DGPMKRAKE
         + PM+ SPV E   +    S  G  P  I+ GE  +    +S + P LK      + +   S++         G                   + R   
Subjt:  GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLA------------DGPMKRAKE

Query:  DGDAT---TNGSVQHLNWAQSNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEIITH-----QGCWTGQGNSSSR
        + DA+     G +  + W         ++T Q+                   G        F  +++W +  +  C +   +      G W+   ++ S 
Subjt:  DGDAT---TNGSVQHLNWAQSNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEIITH-----QGCWTGQGNSSSR

Query:  FESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR
        F   +Q+  S L+HWGR     +++QI+  +  + D Y++P P DF  I  +E+ L   LE +EI+WKQRSRE+WL+WG         +A I      I 
Subjt:  FESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR

Query:  EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNV
           ++N  L+      +E  EL    MF       +    AL      + VG  T+  CL+ LN    I+ WN T+IALI K+K P+ ISDFRPISLCNV
Subjt:  EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNV

Query:  SYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSD
        SYKII+K + NR+K  +  +IS+ QSAFVPSR+I  NVIIG+ECLHTI S ++   G  ALKLD+SKA+DRVEW +LE ++ K+GF+  W++     C  
Subjt:  SYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSD

Query:  SILFNI
        ++ F+I
Subjt:  SILFNI

XP_024038343.1 uncharacterized protein LOC112097373 [Citrus clementina]7.2e-6435.73Show/hide
Query:  DATTNGSVQHLNWAQSNHRPILFNTCQVHGNG--YQNKRPRLFRFEEVWTQHPKCKEII----THQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSS
        D  ++G+  +++   S+H P++    QV G+G  +  +R  L  +E++W+ +  CKEII    + QGCW    N  S F+   ++  +RL  W +  F  
Subjt:  DATTNGSVQHLNWAQSNHRPILFNTCQVHGNG--YQNKRPRLFRFEEVWTQHPKCKEII----THQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSS

Query:  IWRQIETNQRILQDL-YSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFE
          +++E     L+ L  S+      ++IK VE Q+   L +DEIYWKQRSR +WL+ GDKNT++FH++A+ R+K+N I  +++  GN I +   +E  F 
Subjt:  IWRQIETNQRILQDL-YSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFE

Query:  LYFSNMFSLSNPNSEDIDIALQDIPVR---------------------------------------------NEVGYITVLNCLDMLNMVRSIRPWNDTF
         YF+N+F+ S PN + I  AL  I  R                                               V    +  CL +LN    + P+N T+
Subjt:  LYFSNMFSLSNPNSEDIDIALQDIPVR---------------------------------------------NEVGYITVLNCLDMLNMVRSIRPWNDTF

Query:  IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCF
        I LISK   P+ ++DFRPISLCNV Y+I+AK + NR+K  L  +IS  QSAF+P+  I  N+I+GYECLH I+  +    G +ALKLD+SKAYD++EW F
Subjt:  IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCF

Query:  LERLLLKIGFHSQWVKV
        LE+ +  +GF   WV +
Subjt:  LERLLLKIGFHSQWVKV

XP_042962496.1 uncharacterized protein LOC122296768 [Carya illinoinensis]1.1e-6125.6Show/hide
Query:  MVYEDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPC
        M  EDL++ WER  L+  EE+    V+ + +    +     + G+ +    +  +    T    W +   +   ++    F+   +   D+ +VL   P 
Subjt:  MVYEDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPC

Query:  LFDKFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSM----AERLGSV-----------------------------------------WTPI
         FD+ LL L +    V     +F++  FWV    LP+   N  +    A  +G V                                         W   
Subjt:  LFDKFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSM----AERLGSV-----------------------------------------WTPI

Query:  KYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGS
        KYE+L + C  CG ++H  ++C     E+  + +       + F  R +S            Q L +  P       P +        +         G+
Subjt:  KYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGS

Query:  GSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLADGPMKRAKEDGDATTNGSVQHLNWA
        GSRP      +EE   V  + ++  I+      T             W    R             K+ I   LA+      KE  +  +N S   L   
Subjt:  GSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLADGPMKRAKEDGDATTNGSVQHLNWA

Query:  QSNHRP---ILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQG---NSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQD
        QS+H P   +L N+ +V+         R FR+E  W     CK+++ +   W G       ++     L +C+  L  W +   +   + I    R ++ 
Subjt:  QSNHRP---ILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQG---NSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQD

Query:  LYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSE
        L          E++ ++  ++ AL  +E+ W+QR++++W++ GD+NT +FH QA+ RRK N I  ++D  G ++  Q  +  AF  YFS++F+ S+P++ 
Subjt:  LYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSE

Query:  DIDIALQDIPVRNE-------------------------------------------VGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFR
        +  +   +  + NE                                           VG       L +LN   S    NDT I+LI KVK+P  I DFR
Subjt:  DIDIALQDIPVRNE-------------------------------------------VGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFR

Query:  PISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKV
        PISLCNV YKII+K + NR K  L ++IS NQ+AFVP R I  NV++ YE LH++ ++     G +ALKLDMSKAYDR+EW F+E ++ K+GF  QW+ +
Subjt:  PISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKV

Query:  DYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF
            C  SI ++I   G  + N    R    G  +S + F
Subjt:  DYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF

XP_042962692.1 uncharacterized protein LOC122296963 [Carya illinoinensis]7.4e-6124.19Show/hide
Query:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD
        EDL E W+   L   EE   +++D + +         SL GKL +  +I  +VM  T    W I       ++  N F        D+ +V    P LFD
Subjt:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD

Query:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS------------------------------------------VWTPIKYEKLP
          L+ L +         + F   +FWV F  LP+        E++GS                                          +W P  YEK+P
Subjt:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS------------------------------------------VWTPIKYEKLP

Query:  DICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMW--------------MAFNDRTSSVFRSPSNSPI--------------------------------
         IC  CG I+HG+ +C       G  +   +YG W              M  N+  +  ++   N P                                 
Subjt:  DICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMW--------------MAFNDRTSSVFRSPSNSPI--------------------------------

Query:  ---GNQQ--------------------------LMVASPNRNPSLQPSLT----------VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGPVSAIG
           GN++                          +++    +  +++  L            E G+ L +      + GS S  + +  V      +SA  
Subjt:  ---GNQQ--------------------------LMVASPNRNPSLQPSLT----------VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGPVSAIG

Query:  NIPSINAGEATVTAMYLSKIGPK-------LKHWKRKARKN---VGGSISSEASEKKCIGEVLADGPMKRAKE---DGDATTNG----------------
         +      +  +T  Y +    K       LK  K K  +    VG       +++K  G+V  DG M+  +E   +GD    G                
Subjt:  NIPSINAGEATVTAMYLSKIGPK-------LKHWKRKARKN---VGGSISSEASEKKCIGEVLADGPMKRAKE---DGDATTNG----------------

Query:  --------SVQHLNWAQ--------------SNHRPIL--FNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRS
                +V +  W                S+H+P+L   N           ++ R F++E  W    +C+ ++  +  W   G  +      L + R 
Subjt:  --------SVQHLNWAQ--------------SNHRPIL--FNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRS

Query:  RLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLI
         L+ W +       ++IE   + L+ L ++    +  EI++    L   LE+++++WKQR++ NW ++GD+NT++FH  A  R+KRN I+E++D + N++
Subjt:  RLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLI

Query:  VDQRQMEEAFELYFSNMFSLSNPNSEDIDIAL-----------------------------QDIPVR-------------NEVGYITV---LNCLDMLNM
           +Q+EE F  YF  +F    P+  +I+  L                             Q  P++             N  G ++       L +LN 
Subjt:  VDQRQMEEAFELYFSNMFSLSNPNSEDIDIAL-----------------------------QDIPVR-------------NEVGYITV---LNCLDMLNM

Query:  VRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDM
               N T++ALI K K P+ ++++RPISLCNV YKI++K + NR K  L EIIS  QSAF+P R I+ N++I YE LH++++++    G +A+KLDM
Subjt:  VRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDM

Query:  SKAYDRVEWCFLERLLLKIGFHSQWV
        SKAYDRVEW FLE +L K+GF +QWV
Subjt:  SKAYDRVEWCFLERLLLKIGFHSQWV

TrEMBL top hitse value%identityAlignment
A0A2N9FMJ0 Reverse transcriptase domain-containing protein1.4e-6525.64Show/hide
Query:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD
        E+L   W+RFSL      TE + D+     +  S  F+L  K     II  + + RTFKP W    G +   +  N+ LF    + D  RVL  EP  +D
Subjt:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD

Query:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSV----------------------------------------------WTPIKY
        K+L+   +    ++  ++ F+ A FWV    LP+    R +AE LGS                                               W   KY
Subjt:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSV----------------------------------------------WTPIKY

Query:  EKLPDICAFCGRIDHGMRDCTFNYLESGS--SSRRQEYGMWM-AFNDRTSSVF--------RSPSNSPIGNQQLMVA----------SPNRN--------
        E+LP+ C +CG + HG +DC + +L +    +   Q YG W+ A  DR +           +S +N P  +Q+   A          SPN          
Subjt:  EKLPDICAFCGRIDHGMRDCTFNYLESGS--SSRRQEYGMWM-AFNDRTSSVF--------RSPSNSPIGNQQLMVA----------SPNRN--------

Query:  ----PSLQPSLT----VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGP--VSAIGNIPSINAGEATVTAMYLSKIG---------PKLKHWKRKARK
            P L P  +    +   T  +  R+     G     +D+  V E   P  ++    IPS+       T + L ++          P    WK+KAR 
Subjt:  ----PSLQPSLT----VETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGP--VSAIGNIPSINAGEATVTAMYLSKIG---------PKLKHWKRKARK

Query:  NVGGS-----------------ISSEASEKK---------------------------------------------CIG---EVLADGPMKRAK------
           G                  I S+A                                                 C+G   E++ +  M+  +      
Subjt:  NVGGS-----------------ISSEASEKK---------------------------------------------CIG---EVLADGPMKRAK------

Query:  ----------------------------EDGDATT------------------NGSVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHP
                                     D   TT                  +  ++H++   S+H+ +L  T + H   +  ++P  FRFEEVWT   
Subjt:  ----------------------------EDGDATT------------------NGSVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHP

Query:  KCKEIITHQGCWTGQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDL-YSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWL
         C+  IT +  W  + + ++ F+  + L  C+  L +W R +F +I +Q+   +R+L+          +   +K ++ +++  L ++E  W+QRSR NWL
Subjt:  KCKEIITHQGCWTGQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDL-YSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWL

Query:  QWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIP-----------VR-----------------
        + GD+NTR+FH +A+ RR+RN I  ++D  G       ++      Y+ ++F  S P  + ID A++ +P           VR                 
Subjt:  QWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIP-----------VR-----------------

Query:  -----------------NEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPS
                         + VG       L  LN  + +   N T+I LI KVK+P+ +++FRPISLCNV YK+++KV+ NR+K  L +IISE+QSAFVP 
Subjt:  -----------------NEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPS

Query:  RSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNI
        R I  N+++ +E LH + S R    G +ALKLDMSKAYDRVEW FLE+++ K+GFH++++ +    C  S+ ++I
Subjt:  RSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNI

A0A2N9INH4 Reverse transcriptase domain-containing protein3.4e-6727.5Show/hide
Query:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD
        E L + W+RFSLS  ++  +VD+    A  T+QS    L  K L   ++  D + RTFKP W      +++ L  N         +D  RVL  EP  FD
Subjt:  EDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLEVDQMRVLRQEPCLFD

Query:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSV----------------------------------------------WTPIKY
        KFL+   +    +    + F    FWV    LP+       A  +G                                                W   +Y
Subjt:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSV----------------------------------------------WTPIKY

Query:  EKLPDICAFCGRIDHGMRDCTFNYLESG-SSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGSG
        E+LP+ C  CG +DH  +DC     +    SS   +YG WM      + + R P  + I +   ++A+ + +  L  S+ +   T +     S  ++ S 
Subjt:  EKLPDICAFCGRIDHGMRDCTFNYLESG-SSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGSG

Query:  SRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKR-KARKNVGGSISSEASEKKCIGEVLADGPMKRA----KEDGDATTNG----
                            ++P    G+          IG + ++ ++ +  +NV           + +     +     A    + D   TTN     
Subjt:  SRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKR-KARKNVGGSISSEASEKKCIGEVLADGPMKRA----KEDGDATTNG----

Query:  ----SVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETN
             V HL   +S+H+PI   T  +     Q  +P+LFRF+E+W     C+E IT       +G+   + +  +  C   L  W R  F S+ + I   
Subjt:  ----SVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETN

Query:  QRIL-----QDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFS
        +  L     Q + S+       ++  +  +L+    ++   W+QRSR  WL+ GD+NT++FHN+AT R++RN +  ++D  G LI D  ++ + F  Y+ 
Subjt:  QRIL-----QDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFS

Query:  NMFSLS------------NPN--------------SEDIDIALQDI-PVR----------------NEVGYITVLNCLDMLNMVRSIRPWNDTFIALISK
        ++F  +            NP+               +++++AL+ + P++                N VG  TV   L  +N    +   N TF+ALI K
Subjt:  NMFSLS------------NPN--------------SEDIDIALQDI-PVR----------------NEVGYITVLNCLDMLNMVRSIRPWNDTFIALISK

Query:  VKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLL
        VK+P+ ++D+R ISLCNV YK+I+KVL N +K  L  +I+E QSAFVP R I  NV+I +E LH + + R    G +ALKLDMSKAYDRVEW FL  +++
Subjt:  VKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLL

Query:  KIGFHSQWVKV
        K+GFHS+WV +
Subjt:  KIGFHSQWVKV

A0A2N9IWN7 Uncharacterized protein4.8e-6636.64Show/hide
Query:  VQHLNWAQSNHRPILFNTCQVHGNGYQNKRP---RLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETN
        V HL+   S+H PI      +  +   N RP   R+FRF+E+W  H  CKE IT    W  Q + ++ F+    L+SCR+ L+ W R +F ++ R+++  
Subjt:  VQHLNWAQSNHRPILFNTCQVHGNGYQNKRP---RLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFE--SCLQSCRSRLKHWGRGTFSSIWRQIETN

Query:  QRILQDLYSKPPPWDFHE-IKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFS
          +L++  S+      HE    ++ ++   L  +E  W+QRSR+ WL+WGDK+T +FH+ AT RR+RN I E+QDI+GN       +   FE +F  +FS
Subjt:  QRILQDLYSKPPPWDFHE-IKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFS

Query:  LSNP--------------------------NSEDIDIALQDIPVRNEVG--------------------YITVLNCLDMLNMVRSIRPWNDTFIALISKV
         S+P                           +E++D AL+ +      G                       +L+CL+  ++++++   N T+I LI K 
Subjt:  LSNP--------------------------NSEDIDIALQDIPVRNEVG--------------------YITVLNCLDMLNMVRSIRPWNDTFIALISKV

Query:  KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLK
        + P+ +SDFRPISLCNV YKI++KV+ NR+K  L +IISE QSAFVP R I  N+++ +E LH +K+ R+   G +ALKLDMSKAYDRVEW FL++++LK
Subjt:  KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLK

Query:  IGFHSQWVKVDYGMCSDSILFNI
        +GF +QWV +    C  ++ F+I
Subjt:  IGFHSQWVKVDYGMCSDSILFNI

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-8330.27Show/hide
Query:  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFD
        DLLE W+ F L++ EE T +DVD  A   T   L   L GKL     I   VM  T + AW + N    ++ L  NLFLFS    +D+ ++ +  P  FD
Subjt:  DLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLI-MEKLEANLFLFSLRLEVDQMRVLRQEPCLFD

Query:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS-----------------------------------------------VWTPIK
        + L+ ++KP+ ++ P+ ++F     WV F +LP+    R MA RLG+                                                W PI+
Subjt:  KFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGS-----------------------------------------------VWTPIK

Query:  YEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS
        YE+LPD C  CG                 SS ++ +YG W+ +          P+   +   Q  +   + N S   S + V  G++     +S+P TG 
Subjt:  YEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMAFNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLT-VETGTRLDNSRKSSPMTGS

Query:  GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLA------------DGPMKRAKE
         + PM+ SPV E   +    S  G  P  I+ GE  +    +S + P LK      + +   S++         G                   + R   
Subjt:  GSRPMDISPVME---EDGPVSAIGNIP-SINAGEATVTAMYLSKIGPKLKHWKRKARKNVGGSISSEASEKKCIGEVLA------------DGPMKRAKE

Query:  DGDAT---TNGSVQHLNWAQSNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEIITH-----QGCWTGQGNSSSR
        + DA+     G +  + W         ++T Q+                   G        F  +++W +  +  C +   +      G W+   ++ S 
Subjt:  DGDAT---TNGSVQHLNWAQSNHRPILFNTCQVHG----------------NGYQNKRPRLFRFEEVWTQHPK--CKEIITH-----QGCWTGQGNSSSR

Query:  FESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR
        F   +Q+  S L+HWGR     +++QI+  +  + D Y++P P DF  I  +E+ L   LE +EI+WKQRSRE+WL+WG         +A I      I 
Subjt:  FESCLQSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIR

Query:  EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNV
           ++N  L+      +E  EL    MF       +    AL      + VG  T+  CL+ LN    I+ WN T+IALI K+K P+ ISDFRPISLCNV
Subjt:  EVQDINGNLIVDQRQMEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNV

Query:  SYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSD
        SYKII+K + NR+K  +  +IS+ QSAFVPSR+I  NVIIG+ECLHTI S ++   G  ALKLD+SKA+DRVEW +LE ++ K+GF+  W++     C  
Subjt:  SYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSD

Query:  SILFNI
        ++ F+I
Subjt:  SILFNI

A0A803PWX1 Uncharacterized protein7.0e-6535.49Show/hide
Query:  VQHLNWAQSNHRPILFN-TCQVHGNGY-QNKRPRLFRFEEVWTQHPKCKEIITHQGCW---TGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETN
        VQ L+W +S+HR ++ N   +V G+   + KR   F FEE W Q  +C EII +   W    G+G   S F   +  C   L+ W +   + +  +I   
Subjt:  VQHLNWAQSNHRPILFN-TCQVHGNGY-QNKRPRLFRFEEVWTQHPKCKEIITHQGCW---TGQGNSSSRFESCLQSCRSRLKHWGRGTFSSIWRQIETN

Query:  QRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSL
        ++IL +L  +  P  +  I+ +ED+L+  LE+DE YW+QRSR  WLQWGD+NT++FH++A+ RRK+NEI+ +QD  G    D+  + +  E Y+  +F  
Subjt:  QRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSL

Query:  SNPNSEDIDIALQ----------------DIPVRNEVGYITVLN-----------------------------CLDMLNMVRSIRPWNDTFIALISKVKH
        S+ +   +   L+                D  V   V  +  +N                             CL++LN    +   NDT +ALI KV  
Subjt:  SNPNSEDIDIALQ----------------DIPVRNEVGYITVLN-----------------------------CLDMLNMVRSIRPWNDTFIALISKVKH

Query:  PKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIG
        P+ I +FRPISLCNV YKI++K L NR++ +L +++S++QSAF+  R IH N I+GYECLH ++  R  +G  +ALKLDM+KAYDRVEW FLE ++LK+G
Subjt:  PKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIG

Query:  FHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF
        +   WV      C  S+ F+    G  +   + +R    G  +S F F
Subjt:  FHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.8e-0828.85Show/hide
Query:  DFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL-HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQ
        +FRPISL N+  KI+ K+L NR++  ++++I  +Q  F+P      N+      + H  ++K  +H   + + +D  KA+D+++  F+ + L K+G    
Subjt:  DFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL-HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQ

Query:  WVKV
        ++K+
Subjt:  WVKV

P08548 LINE-1 reverse transcriptase homolog2.4e-0921.86Show/hide
Query:  QRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSL
        +++ ++ +S P P    EI ++  +L++   +  I    +S+  + +  +K  +   N    +R ++ I  +++ N  +  D  ++++    Y+  ++S 
Subjt:  QRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSL

Query:  SNPNSEDID-----------------------------IALQDIPVRNEVG--------YITVLNCL--DMLNMVRSI-------RPWNDTFIALISKV-
           N ++ID                               +Q++P +   G        Y T    L   +LN+ ++I         + +  I LI K  
Subjt:  SNPNSEDID-----------------------------IALQDIPVRNEVG--------YITVLNCL--DMLNMVRSI-------RPWNDTFIALISKV-

Query:  KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL-HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLL
        K P    ++RPISL N+  KI+ K+L NR++  +++II  +Q  F+P      N+      + H  K K   H   + L +D  KA+D ++  F+ R L 
Subjt:  KHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECL-HTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLL

Query:  KIGFHSQWVKV
        KIG    ++K+
Subjt:  KIGFHSQWVKV

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-0832.77Show/hide
Query:  IALISK-VKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTI-KSKRTSHGGWIALKLDMSKAYDRVEW
        I LI K  K P  I +FRPISL N+  KI+ K+L NR++  ++ II  +Q  F+P      N+      +H I K K  +H   + + LD  KA+D+++ 
Subjt:  IALISK-VKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTI-KSKRTSHGGWIALKLDMSKAYDRVEW

Query:  CFLERLLLKIGFHSQWVKV
         F+ ++L + G    ++ +
Subjt:  CFLERLLLKIGFHSQWVKV

P14381 Transposon TX1 uncharacterized 149 kDa protein4.7e-1036.52Show/hide
Query:  IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCF
        ++L+ K    +LI ++RP+SL +  YKI+AK +  R+K  L E+I  +QS  VP R+I  NV +  + LH  +    S      L LD  KA+DRV+  +
Subjt:  IALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCF

Query:  LERLLLKIGFHSQWV
        L   L    F  Q+V
Subjt:  LERLLLKIGFHSQWV

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.1e-0423.68Show/hide
Query:  EIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSN------------------------------P
        E +++Q+SR  WLQ GD NTR+FH      + +N I+ ++  +   + +  Q++E    Y++++    +                              P
Subjt:  EIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQMEEAFELYFSNMFSLSN------------------------------P

Query:  NSEDIDIALQDIPVRNE------------------VGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKII
        + ++I  A+  +P RN+                  V   T+    +       ++ +N T I LI KV     +S FRP+S C V YKII
Subjt:  NSEDIDIALQDIPVRNE------------------VGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.0e-1136.14Show/hide
Query:  LVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWV
        +V R+K  +  +I   Q++F+P R    N++   E +H+++ K+    GW+ LKLD+ KAYDR+ W +LE  L+  GF   W+
Subjt:  LVNRMKWALQEIISENQSAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAACACTCAAATTCAAAAGACAATCATCCAAATTCCATACTTTTAGATGCTCTCTTTTGATGGTGTATGAAGACTTGCTCGAGAACTGGGAAAGATTCAGCCT
ATCAGCTGTAGAGGAGGCCACTGAAGTTGATGTCGATCGCCAAGCTGCAGTAGTCACGAGACAATCACTGGGTTTCAGTTTGACCGGGAAGCTTCTAGCTCCCTGTATCA
TTTATGGGGACGTTATGCATCGAACGTTTAAACCTGCTTGGAATATACCCAATGGTCTAATTATGGAAAAACTTGAAGCAAATCTATTCCTTTTCTCGCTGAGGTTAGAA
GTCGACCAAATGAGGGTGCTGAGGCAAGAACCGTGTCTGTTCGACAAATTCCTTCTTTTCCTTTCGAAGCCAATCCCTATGGTAAAACCTACGGCCATGGAATTTAAGTT
CGCAGCTTTCTGGGTCCACTTCTGCGAGCTCCCAATGGATCTCTACAACCGGTCAATGGCGGAACGACTTGGCAGCGTATGGACACCGATCAAGTATGAAAAGCTTCCGG
ACATTTGTGCATTTTGCGGCCGCATCGACCATGGAATGAGAGATTGTACTTTTAATTATCTTGAGTCTGGTTCATCTTCACGCCGACAAGAGTACGGTATGTGGATGGCC
TTCAACGACAGGACTTCCAGCGTGTTTCGTTCGCCTAGCAACAGTCCAATCGGTAATCAACAGTTGATGGTAGCTTCTCCAAATCGAAATCCTTCACTTCAGCCATCTTT
GACGGTCGAAACTGGTACTCGATTGGATAATTCAAGGAAATCCTCTCCGATGACTGGATCTGGTAGCCGACCAATGGATATCTCGCCGGTGATGGAGGAAGACGGTCCAG
TTTCGGCGATCGGTAACATACCGTCAATTAATGCAGGTGAAGCCACAGTTACAGCAATGTACTTATCTAAAATCGGGCCAAAATTGAAGCATTGGAAACGCAAAGCTCGT
AAGAATGTTGGAGGCTCTATTTCAAGTGAAGCATCTGAGAAAAAATGTATTGGTGAAGTGCTGGCAGATGGTCCTATGAAACGCGCTAAAGAGGATGGTGATGCTACTAC
CAATGGATCAGTGCAACATCTGAATTGGGCTCAATCGAATCACCGCCCAATTCTGTTCAATACATGTCAGGTGCATGGAAATGGTTATCAGAACAAAAGACCTCGTCTGT
TTCGTTTCGAAGAAGTATGGACTCAACATCCAAAATGCAAAGAAATCATTACTCATCAGGGTTGTTGGACAGGACAAGGCAATAGTAGCAGCCGATTTGAGAGTTGTCTC
CAAAGTTGCCGATCACGTTTAAAGCATTGGGGTAGAGGCACATTCTCATCCATTTGGAGACAGATTGAAACTAACCAGAGGATTCTTCAAGACCTCTATAGTAAGCCTCC
ACCTTGGGATTTTCATGAGATAAAACGTGTAGAAGACCAATTAGACCAGGCTCTAGAAGAGGATGAAATATACTGGAAACAACGGTCACGTGAGAACTGGCTTCAATGGG
GGGATAAGAACACACGTTGGTTCCATAACCAAGCCACTATAAGAAGGAAGAGAAATGAAATTCGTGAAGTGCAGGATATAAACGGTAACCTGATTGTTGACCAAAGACAA
ATGGAGGAGGCTTTTGAATTGTATTTTTCAAATATGTTTTCATTGTCTAATCCAAATTCGGAAGACATTGACATTGCATTGCAGGATATTCCAGTTAGGAACGAGGTTGG
TTACATTACTGTTCTTAATTGTCTGGATATGCTTAATATGGTTAGATCGATTAGACCATGGAATGATACATTCATTGCCTTAATTTCTAAAGTTAAGCATCCAAAGCTTA
TCTCTGATTTCAGACCTATAAGCCTCTGTAATGTCTCATACAAAATAATAGCTAAAGTCCTGGTAAATCGTATGAAATGGGCTCTTCAGGAGATAATATCTGAAAACCAA
TCTGCTTTTGTACCTAGTAGATCAATTCATGGTAATGTAATTATAGGATATGAATGCCTGCATACGATCAAGAGTAAAAGAACAAGTCATGGGGGATGGATAGCCTTGAA
ATTAGACATGAGTAAGGCATATGACAGAGTGGAATGGTGCTTTTTAGAAAGGCTTTTGTTAAAAATTGGGTTCCACTCCCAATGGGTTAAAGTTGATTATGGAATGTGTT
CGGACTCCATCCTTTTCAATATAACTGAATGGGGTCCCTTCGAGGCAAATCATTCCTCAAAGAGGACTTCGTCAGGGGGATCCCTTATCTCCCTATTTGCTTTTGCTTGT
ATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCAACACTCAAATTCAAAAGACAATCATCCAAATTCCATACTTTTAGATGCTCTCTTTTGATGGTGTATGAAGACTTGCTCGAGAACTGGGAAAGATTCAGCCT
ATCAGCTGTAGAGGAGGCCACTGAAGTTGATGTCGATCGCCAAGCTGCAGTAGTCACGAGACAATCACTGGGTTTCAGTTTGACCGGGAAGCTTCTAGCTCCCTGTATCA
TTTATGGGGACGTTATGCATCGAACGTTTAAACCTGCTTGGAATATACCCAATGGTCTAATTATGGAAAAACTTGAAGCAAATCTATTCCTTTTCTCGCTGAGGTTAGAA
GTCGACCAAATGAGGGTGCTGAGGCAAGAACCGTGTCTGTTCGACAAATTCCTTCTTTTCCTTTCGAAGCCAATCCCTATGGTAAAACCTACGGCCATGGAATTTAAGTT
CGCAGCTTTCTGGGTCCACTTCTGCGAGCTCCCAATGGATCTCTACAACCGGTCAATGGCGGAACGACTTGGCAGCGTATGGACACCGATCAAGTATGAAAAGCTTCCGG
ACATTTGTGCATTTTGCGGCCGCATCGACCATGGAATGAGAGATTGTACTTTTAATTATCTTGAGTCTGGTTCATCTTCACGCCGACAAGAGTACGGTATGTGGATGGCC
TTCAACGACAGGACTTCCAGCGTGTTTCGTTCGCCTAGCAACAGTCCAATCGGTAATCAACAGTTGATGGTAGCTTCTCCAAATCGAAATCCTTCACTTCAGCCATCTTT
GACGGTCGAAACTGGTACTCGATTGGATAATTCAAGGAAATCCTCTCCGATGACTGGATCTGGTAGCCGACCAATGGATATCTCGCCGGTGATGGAGGAAGACGGTCCAG
TTTCGGCGATCGGTAACATACCGTCAATTAATGCAGGTGAAGCCACAGTTACAGCAATGTACTTATCTAAAATCGGGCCAAAATTGAAGCATTGGAAACGCAAAGCTCGT
AAGAATGTTGGAGGCTCTATTTCAAGTGAAGCATCTGAGAAAAAATGTATTGGTGAAGTGCTGGCAGATGGTCCTATGAAACGCGCTAAAGAGGATGGTGATGCTACTAC
CAATGGATCAGTGCAACATCTGAATTGGGCTCAATCGAATCACCGCCCAATTCTGTTCAATACATGTCAGGTGCATGGAAATGGTTATCAGAACAAAAGACCTCGTCTGT
TTCGTTTCGAAGAAGTATGGACTCAACATCCAAAATGCAAAGAAATCATTACTCATCAGGGTTGTTGGACAGGACAAGGCAATAGTAGCAGCCGATTTGAGAGTTGTCTC
CAAAGTTGCCGATCACGTTTAAAGCATTGGGGTAGAGGCACATTCTCATCCATTTGGAGACAGATTGAAACTAACCAGAGGATTCTTCAAGACCTCTATAGTAAGCCTCC
ACCTTGGGATTTTCATGAGATAAAACGTGTAGAAGACCAATTAGACCAGGCTCTAGAAGAGGATGAAATATACTGGAAACAACGGTCACGTGAGAACTGGCTTCAATGGG
GGGATAAGAACACACGTTGGTTCCATAACCAAGCCACTATAAGAAGGAAGAGAAATGAAATTCGTGAAGTGCAGGATATAAACGGTAACCTGATTGTTGACCAAAGACAA
ATGGAGGAGGCTTTTGAATTGTATTTTTCAAATATGTTTTCATTGTCTAATCCAAATTCGGAAGACATTGACATTGCATTGCAGGATATTCCAGTTAGGAACGAGGTTGG
TTACATTACTGTTCTTAATTGTCTGGATATGCTTAATATGGTTAGATCGATTAGACCATGGAATGATACATTCATTGCCTTAATTTCTAAAGTTAAGCATCCAAAGCTTA
TCTCTGATTTCAGACCTATAAGCCTCTGTAATGTCTCATACAAAATAATAGCTAAAGTCCTGGTAAATCGTATGAAATGGGCTCTTCAGGAGATAATATCTGAAAACCAA
TCTGCTTTTGTACCTAGTAGATCAATTCATGGTAATGTAATTATAGGATATGAATGCCTGCATACGATCAAGAGTAAAAGAACAAGTCATGGGGGATGGATAGCCTTGAA
ATTAGACATGAGTAAGGCATATGACAGAGTGGAATGGTGCTTTTTAGAAAGGCTTTTGTTAAAAATTGGGTTCCACTCCCAATGGGTTAAAGTTGATTATGGAATGTGTT
CGGACTCCATCCTTTTCAATATAACTGAATGGGGTCCCTTCGAGGCAAATCATTCCTCAAAGAGGACTTCGTCAGGGGGATCCCTTATCTCCCTATTTGCTTTTGCTTGT
ATCTGA
Protein sequenceShow/hide protein sequence
MDSTLKFKRQSSKFHTFRCSLLMVYEDLLENWERFSLSAVEEATEVDVDRQAAVVTRQSLGFSLTGKLLAPCIIYGDVMHRTFKPAWNIPNGLIMEKLEANLFLFSLRLE
VDQMRVLRQEPCLFDKFLLFLSKPIPMVKPTAMEFKFAAFWVHFCELPMDLYNRSMAERLGSVWTPIKYEKLPDICAFCGRIDHGMRDCTFNYLESGSSSRRQEYGMWMA
FNDRTSSVFRSPSNSPIGNQQLMVASPNRNPSLQPSLTVETGTRLDNSRKSSPMTGSGSRPMDISPVMEEDGPVSAIGNIPSINAGEATVTAMYLSKIGPKLKHWKRKAR
KNVGGSISSEASEKKCIGEVLADGPMKRAKEDGDATTNGSVQHLNWAQSNHRPILFNTCQVHGNGYQNKRPRLFRFEEVWTQHPKCKEIITHQGCWTGQGNSSSRFESCL
QSCRSRLKHWGRGTFSSIWRQIETNQRILQDLYSKPPPWDFHEIKRVEDQLDQALEEDEIYWKQRSRENWLQWGDKNTRWFHNQATIRRKRNEIREVQDINGNLIVDQRQ
MEEAFELYFSNMFSLSNPNSEDIDIALQDIPVRNEVGYITVLNCLDMLNMVRSIRPWNDTFIALISKVKHPKLISDFRPISLCNVSYKIIAKVLVNRMKWALQEIISENQ
SAFVPSRSIHGNVIIGYECLHTIKSKRTSHGGWIALKLDMSKAYDRVEWCFLERLLLKIGFHSQWVKVDYGMCSDSILFNITEWGPFEANHSSKRTSSGGSLISLFAFAC
I