; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011695 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011695
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr1:31069524..31073472
RNA-Seq ExpressionLag0011695
SyntenyLag0011695
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0005634 - nucleus (cellular component)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]4.2e-18137.39Show/hide
Query:  SFVRIWDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEE-----
        + +   DPAY CF F   DL+PTIEEY  +L++ +K  E++Y FNP+ T KRTLSKFL  VH  E+QK +K KG EE++  DYL ++ + +I+E+     
Subjt:  SFVRIWDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEE-----

Query:  -----------------------KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHG----TQVFRGK-------------KHGIFLFEFQIRACDLEGTL
                               KLFF +E+GV+P IP+LA+ F RS +N  R  G        RGK              H  F  EF+    D     
Subjt:  -----------------------KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHG----TQVFRGK-------------KHGIFLFEFQIRACDLEGTL

Query:  DAPKS------------------------LNLQMRELPHSSSSRPL----------------------GMRPLCSVVGLATK---------RINGNSHSE
        +  ++                          L    +   +   PL                         PL  +  +  K         +I    H E
Subjt:  DAPKS------------------------LNLQMRELPHSSSSRPL----------------------GMRPLCSVVGLATK---------RINGNSHSE

Query:  GVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDE
        GVT  Y  W+  R K I   +R+ V      + ++P+Q   +  EL  +N+ L+ ENEKL++E  + MD  T    +LE+ K   KN+ K E D + LD+
Subjt:  GVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDE

Query:  ENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAF
        E RR+ K N +++NE T L+                                                   L + +  +SEE +  K+Y+  L  QL A 
Subjt:  ENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAF

Query:  QMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWA-----------------------------RDLRENTSPMA
        Q +S++++ E   L   Y  ++ DY     D Q+++ +V +T+  ++++++RA GFAEWA                               +R+  + + 
Subjt:  QMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWA-----------------------------RDLRENTSPMA

Query:  SNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFK
            ++ E L M     GKG+   DTT    PI                             T +P+Y                                
Subjt:  SNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFK

Query:  PEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKML
        P F+     VP L       L   + +KL+VL+ERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH++MYCRKMA H +NDK+L
Subjt:  PEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKML

Query:  IHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEH
        +HCFQDSL  PA+ WY+QLD++H+  WK+LAD+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE+++MF+NTLR+PFYE 
Subjt:  IHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEH

Query:  MVGRASTNFSDIITIGERIEYEIKHGRIAK-TVESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNAS-TQYSPFCGQ
        M+G ASTNFSDII IGERIEY IKHGR+A+ T E  G KKG  SKKKEGEV  IG  +    +    + +Y  N       +P  +NN S   Y+ +   
Subjt:  MVGRASTNFSDIITIGERIEYEIKHGRIAK-TVESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNAS-TQYSPFCGQ

Query:  NTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQEL
        +T  +  +   S   +P    Q  ++      +FDPIPMTYTELLPQLIQN QLA  P+ P+QPPYP+WYD NARCDYHAG VGHSTENC ALK  VQ L
Subjt:  NTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQEL

Query:  IKAGWLNFKKE-EGPSVDNNPLP---NHQVNAID
        I AGWL+FKK  E  +V+ NPLP   N +VN +D
Subjt:  IKAGWLNFKKE-EGPSVDNNPLP---NHQVNAID

XP_022147189.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia]7.8e-16764.1Show/hide
Query:  MPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKE----LTAPTNEKLEVLKERLRAVEGTSVFGNMDATQL
        MPQ+TTYNPLYD+PVGQY  P  K  Q  QIP        + +PE    P  V NL D   K       AP+NEK EVLKERLRA+E T VFGN+DA+QL
Subjt:  MPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKE----LTAPTNEKLEVLKERLRAVEGTSVFGNMDATQL

Query:  CLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQR
        C V  +VIPPK K+PEFEKY+G++CPK+H+ MYCRKMAA+  NDK+LIHCFQDSL GPA+ WYMQLDSSHV +WKNLADSFLKQYKHNID+APDRLDLQR
Subjt:  CLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQR

Query:  MEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQM
        MEKKST+ FK+YAQRWRDTAAQVQPPL DKELSAMFINTL+ PFY+ M+G ASTNFSDI+TIGERIEY ++HGRI  T  E    KK + SKKKEGEVQM
Subjt:  MEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQM

Query:  IGKVDKS-HRQIHQPVAQYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQQP--------VTPSQQINRAPYKQAQFDPIPMT
        +G    S  +Q ++   QY   Y P  YGY    VNNA++ Y P+  QN RP  +QNF+    S+  QP         T  QQ NR   KQ QFDPIPMT
Subjt:  IGKVDKS-HRQIHQPVAQYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQQP--------VTPSQQINRAPYKQAQFDPIPMT

Query:  YTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID
        YTELLPQL QN+QLAP P+ P+QPPYPRWYD NARCDYHAGA+ HSTENCT LKYRVQ LIKAGW NFKKE G  V    L NH   Q+NAI+
Subjt:  YTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID

XP_022155098.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022231, partial [Momordica charantia]6.0e-16758.48Show/hide
Query:  DLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSF---ASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQ
        D +        + +EL E L  I   L KG+ IA+T+ P      PQ    F PSF           E  M Q+TTYNPLYDIP GQ+P P  +      
Subjt:  DLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSF---ASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQ

Query:  IPMASQAGASYFKPEFSKIPFAVPNLDDPEV----KELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHI
         P  + AG  + +PE    P  V NL  P+     +   AP++EKLEVL+ERLRAVEGT VFGN+DA+QLCL   +VIPPKFK+PEFEKY+G++CPK+H+
Subjt:  IPMASQAGASYFKPEFSKIPFAVPNLDDPEV----KELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHI

Query:  VMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDK
        +MYCRKMAA+  NDK+LIHCFQDSL GP + WYM LDS HV +WKNLADSFLKQYKHNID+  DRLDLQ MEKK+ E FK+Y QRWRDTAAQ QPP TDK
Subjt:  VMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDK

Query:  ELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQ-IHQPVAQYPSNYLPI-YGY
        ELS+MFINTL+ PFY+ M+G AST+FSDI+TIGERIEY + HGRI  TV ESS  K   SSKKKEGEVQM+G      RQ  +     Y   Y P  YGY
Subjt:  ELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQ-IHQPVAQYPSNYLPI-YGY

Query:  YPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQP---VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYH
            VN A+ QY+    QN RP  +Q F+ R  Q     T  QQ NR+  KQ+QFDPIPMTYTELLPQL QN+QLAP P+ P+QPPYP WYD N RCDYH
Subjt:  YPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQP---VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYH

Query:  AGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID
        AGA+GHSTENCTALKYRVQ LIKAG L FKKE  P V NNPLPNH   Q+NA++
Subjt:  AGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID

XP_022158986.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia]1.9e-16570.42Show/hide
Query:  APTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSS
        AP+NEK EVL+ERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H++MYCRKMAA+  NDK+LIHCFQDSL GPA+ WYMQLDSS
Subjt:  APTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSS

Query:  HVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYE
        +V +WKNLADSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDTAAQVQPPLTDKELSAMFINTL+ PFY+ M+G ASTNFSDI+TIGERIEY 
Subjt:  HVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYE

Query:  IKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVA---QYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQ
        ++HGRI  TV E    KK + SKKKEGEVQM+G    S +Q  QP +   +Y   Y P  YGY    VNNA++ YSP+  QN RP  +QNF+    S+  
Subjt:  IKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVA---QYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQ

Query:  QP--------VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLN
        QP         T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QLAP P+ P+QPPYPRWYD NARCDYHAGA+GHSTENCTALKYRVQ LIKAGWLN
Subjt:  QP--------VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLN

Query:  FKKEEGPSVDNNPLPNH---QVNAID
        FKKE GP V  NPLPNH   Q+NAI+
Subjt:  FKKEEGPSVDNNPLPNH---QVNAID

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]2.0e-19447.6Show/hide
Query:  ATKRINGNSHSEGVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNR
        + ++I    H EGVT  Y  W+  R K +  T R+ V      + ++P Q   +  +L  +N+ L+ ENEKLQ+E  + +D  T    +LE+ K   KN+
Subjt:  ATKRINGNSHSEGVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNR

Query:  AKRERDYDILDEENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAE---TINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEA
         K E++ ++LDEE RR+ K N +L+NE T L+    SQ++ IKDL   KE  LELV +   +I K++ Q+++ E  N +LR+ +D+L V +   SE+ + 
Subjt:  AKRERDYDILDEENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAE---TINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEA

Query:  SKSYSSSLERQLQAFQMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWARDLRENTSPMASNADELFEFLGMIR
         K+Y+ SL  QL AFQ +SE++  E   L+  Y  ++ DY   R D Q ++ +V +T+  ++I++RRA GFAEWA DLR N   +  ++D+L  FL MI 
Subjt:  SKSYSSSLERQLQAFQMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWARDLRENTSPMASNADELFEFLGMIR

Query:  RDL---GKGQKIADTTTPDTPIGNPQVGLPFPPSFA-SHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAV
        R+L   GKG+ + +T     P+ +    + +PP F   H++ T   +   +   NPL+D+P    P P  ++       + +QA                
Subjt:  RDL---GKGQKIADTTTPDTPIGNPQVGLPFPPSFA-SHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAV

Query:  PNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKG
          + D    E T P  +KL+VL+ERLRA+EGT V+GN+DATQLCLVP ++IP KFK+P F+KYDG++CP+SH++MYCRKMAAH  NDK+LIHCFQDSL G
Subjt:  PNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKG

Query:  PAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFS
        PA  WY+QLD++H+  WK+LAD+FLKQYKHNID+APDRLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE++ MF+NTLR+PFY+ M+G A+TNFS
Subjt:  PAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFS

Query:  DIITIGERIEYEIKHGRIAKT-VESSGTKKGNSSKKKEGEVQMIG-------KVDKSHRQIHQPVAQYPSN--YLPIYGYYPHQVNNASTQYSPFCGQNT
        DII IGERIEY IKHGR+ +T  E  G KKG + KKKEGEV  IG       K     R+  Q    Y SN  ++P   Y P         +SP      
Subjt:  DIITIGERIEYEIKHGRIAKT-VESSGTKKGNSSKKKEGEVQMIG-------KVDKSHRQIHQPVAQYPSN--YLPIYGYYPHQVNNASTQYSPFCGQNT

Query:  RPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIK
           +N NF     +P    Q  ++   +  +FDPIPMTYTELLPQL+ N QLAP PI P+QPPYP+WYDPNARCDYHAG VGHSTENC ALK +VQ LI 
Subjt:  RPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIK

Query:  AGWLNFKKE-EGPSVDNNPLPNHQ---VNAID
        AGWL+FKK  E P V+NNPLPNH+   VNAID
Subjt:  AGWLNFKKE-EGPSVDNNPLPNHQ---VNAID

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase7.6e-14434.88Show/hide
Query:  WDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELAR----------------
        WDP  +CF F  FDL PTIEEY  ++++    G  +Y ++ +LT +R+LSKF+  +H  E++K +K K     I  +YL  LAR                
Subjt:  WDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELAR----------------

Query:  ----------KHINEE--KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHGTQVFRGKKHGIFLF----------------EFQIRACDLEGTLDAPKSL
                   ++ EE  K+F GIE+GV+P IP++A+ F RS +N  R  G   F      +F++                +F         T+    S 
Subjt:  ----------KHINEE--KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHGTQVFRGKKHGIFLF----------------EFQIRACDLEGTLDAPKSL

Query:  NLQMRELPHSSSSRPLGMRPLC------------SVVGLA------------------------TKRINGNSHSEGVTPEYLQWRIKRSKIPITTRDNVG
        +   ++    +    +   PL              V GL                          K++    H EG T +Y  WR  R+ I   T + + 
Subjt:  NLQMRELPHSSSSRPLGMRPLC------------SVVGLA------------------------TKRINGNSHSEGVTPEYLQWRIKRSKIPITTRDNVG

Query:  ESSSRAVD--KPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDEENRRLIKENHALRNENTALRRETR
        ES    ++  K      E K L   N  L  ENEKL++EVK+ + Q       L+E KR                                         
Subjt:  ESSSRAVD--KPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDEENRRLIKENHALRNENTALRRETR

Query:  SQEDRIKDLSRRKETL--LELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAFQMASEQLSLERGQLEEKYSSL-
                  +R ET   +E++  +I + K+QLIE E  N  L++ +                     +S E QL   + A E ++ +  QL EKY  + 
Subjt:  SQEDRIKDLSRRKETL--LELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAFQMASEQLSLERGQLEEKYSSL-

Query:  ------RGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWARDLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPP
              + +Y T+R      +G++ R    ++ MAR A  F+  AR LR+   P      EL  FLG+I + LG  Q +         I    + L    
Subjt:  ------RGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWARDLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPP

Query:  SFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYF----------KPEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKE
           S V T+++V +  +     L D+P   YP P F   + S   M  +   + F          +   +  P +   ++  +       +  +LE L+E
Subjt:  SFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYF----------KPEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKE

Query:  RLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSF
        RLR +E                            +FEKY+GT+CPKSH+VMYCRKM+A+ H+DK+LIHCFQDSL  PA+ WYMQLD S V  WK+LADSF
Subjt:  RLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSF

Query:  LKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTVES
        LKQYK+NID+APDRLDLQRMEKK+ E FK+YAQRWR+ AAQVQPPLTDKEL+AMFINTLR+P+Y+ MVG ASTNFSD+ITIGERIE+ +K+GRI+    +
Subjt:  LKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTVES

Query:  SGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDP
        S T++  + KKKEGEV  +    +   ++  P+    +N+ P Y          +   SPF GQ+T+  +  N                   +KQ +FDP
Subjt:  SGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDP

Query:  IPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPSVDNNPLPNHQ---VNAID
        IPM+YTELLP LI++ Q+A  P  PLQPPYP+WYDPNA+C+YHAGAVGHSTENC  LK +VQ L+KAGWL FKK  E P V+ NPLPNH+   +NA+D
Subjt:  IPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKE-EGPSVDNNPLPNHQ---VNAID

A0A5A7T1W2 Retrotrans_gag domain-containing protein2.1e-18137.39Show/hide
Query:  SFVRIWDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEE-----
        + +   DPAY CF F   DL+PTIEEY  +L++ +K  E++Y FNP+ T KRTLSKFL  VH  E+QK +K KG EE++  DYL ++ + +I+E+     
Subjt:  SFVRIWDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFL-VVHLKEVQKNVKVKGVEESISADYLTELARKHINEE-----

Query:  -----------------------KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHG----TQVFRGK-------------KHGIFLFEFQIRACDLEGTL
                               KLFF +E+GV+P IP+LA+ F RS +N  R  G        RGK              H  F  EF+    D     
Subjt:  -----------------------KLFFGIEQGVDPTIPLLAKRFGRSIINLARPHG----TQVFRGK-------------KHGIFLFEFQIRACDLEGTL

Query:  DAPKS------------------------LNLQMRELPHSSSSRPL----------------------GMRPLCSVVGLATK---------RINGNSHSE
        +  ++                          L    +   +   PL                         PL  +  +  K         +I    H E
Subjt:  DAPKS------------------------LNLQMRELPHSSSSRPL----------------------GMRPLCSVVGLATK---------RINGNSHSE

Query:  GVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDE
        GVT  Y  W+  R K I   +R+ V      + ++P+Q   +  EL  +N+ L+ ENEKL++E  + MD  T    +LE+ K   KN+ K E D + LD+
Subjt:  GVTPEYLQWRIKRSK-IPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDE

Query:  ENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAF
        E RR+ K N +++NE T L+                                                   L + +  +SEE +  K+Y+  L  QL A 
Subjt:  ENRRLIKENHALRNENTALRRETRSQEDRIKDLSRRKETLLELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAF

Query:  QMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWA-----------------------------RDLRENTSPMA
        Q +S++++ E   L   Y  ++ DY     D Q+++ +V +T+  ++++++RA GFAEWA                               +R+  + + 
Subjt:  QMASEQLSLERGQLEEKYSSLRGDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWA-----------------------------RDLRENTSPMA

Query:  SNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFK
            ++ E L M     GKG+   DTT    PI                             T +P+Y                                
Subjt:  SNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSFASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFK

Query:  PEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKML
        P F+     VP L       L   + +KL+VL+ERLRA+E T V+GN+DATQLCLVP ++IP KFK+PEF KYDG+TCP+SH++MYCRKMA H +NDK+L
Subjt:  PEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKML

Query:  IHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEH
        +HCFQDSL  PA+ WY+QLD++H+  WK+LAD+FLKQYK NID+APDRLDLQRMEKKS+E FK+YAQRWRD AA+VQPPLTDKE+++MF+NTLR+PFYE 
Subjt:  IHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEH

Query:  MVGRASTNFSDIITIGERIEYEIKHGRIAK-TVESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNAS-TQYSPFCGQ
        M+G ASTNFSDII IGERIEY IKHGR+A+ T E  G KKG  SKKKEGEV  IG  +    +    + +Y  N       +P  +NN S   Y+ +   
Subjt:  MVGRASTNFSDIITIGERIEYEIKHGRIAK-TVESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNAS-TQYSPFCGQ

Query:  NTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQEL
        +T  +  +   S   +P    Q  ++      +FDPIPMTYTELLPQLIQN QLA  P+ P+QPPYP+WYD NARCDYHAG VGHSTENC ALK  VQ L
Subjt:  NTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQEL

Query:  IKAGWLNFKKE-EGPSVDNNPLP---NHQVNAID
        I AGWL+FKK  E  +V+ NPLP   N +VN +D
Subjt:  IKAGWLNFKKE-EGPSVDNNPLP---NHQVNAID

A0A6J1D099 Ribonuclease H3.8e-16764.1Show/hide
Query:  MPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKE----LTAPTNEKLEVLKERLRAVEGTSVFGNMDATQL
        MPQ+TTYNPLYD+PVGQY  P  K  Q  QIP        + +PE    P  V NL D   K       AP+NEK EVLKERLRA+E T VFGN+DA+QL
Subjt:  MPQHTTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKE----LTAPTNEKLEVLKERLRAVEGTSVFGNMDATQL

Query:  CLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQR
        C V  +VIPPK K+PEFEKY+G++CPK+H+ MYCRKMAA+  NDK+LIHCFQDSL GPA+ WYMQLDSSHV +WKNLADSFLKQYKHNID+APDRLDLQR
Subjt:  CLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQR

Query:  MEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQM
        MEKKST+ FK+YAQRWRDTAAQVQPPL DKELSAMFINTL+ PFY+ M+G ASTNFSDI+TIGERIEY ++HGRI  T  E    KK + SKKKEGEVQM
Subjt:  MEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQM

Query:  IGKVDKS-HRQIHQPVAQYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQQP--------VTPSQQINRAPYKQAQFDPIPMT
        +G    S  +Q ++   QY   Y P  YGY    VNNA++ Y P+  QN RP  +QNF+    S+  QP         T  QQ NR   KQ QFDPIPMT
Subjt:  IGKVDKS-HRQIHQPVAQYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQQP--------VTPSQQINRAPYKQAQFDPIPMT

Query:  YTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID
        YTELLPQL QN+QLAP P+ P+QPPYPRWYD NARCDYHAGA+ HSTENCT LKYRVQ LIKAGW NFKKE G  V    L NH   Q+NAI+
Subjt:  YTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID

A0A6J1DM29 LOW QUALITY PROTEIN: uncharacterized protein LOC1110222311.7e-16758.66Show/hide
Query:  DLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSF---ASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQ
        D +        + +EL E L  I   L KG+ IA+T+ P      PQ    F PSF           E  M Q+TTYNPLYDIP GQ+P P  +      
Subjt:  DLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSF---ASHVHTTAEVSMPQHTTYNPLYDIPVGQYPFPSFKEGQVSQ

Query:  IPMASQAGASYFKPEFSKIPFAVPNLDDPEV----KELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHI
         P  + AG  + +PE    P  V NL  P+     +   AP++EKLEVL+ERLRAVEGT VFGN+DA+QLCL   +VIPPKFK+PEFEKYDG++CPK+H+
Subjt:  IPMASQAGASYFKPEFSKIPFAVPNLDDPEV----KELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHI

Query:  VMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDK
        +MYCRKMAA+  NDK+LIHCFQDSL GP + WYM LDS HV +WKNLADSFLKQYKHNID+  DRLDLQ MEKK+ E FK+Y QRWRDTAAQ QPP TDK
Subjt:  VMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDK

Query:  ELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQ-IHQPVAQYPSNYLPI-YGY
        ELS+MFINTL+ PFY+ M+G AST+FSDI+TIGERIEY + HGRI  TV ESS  K   SSKKKEGEVQM+G      RQ  +     Y   Y P  YGY
Subjt:  ELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQ-IHQPVAQYPSNYLPI-YGY

Query:  YPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQP---VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYH
            VN A+ QY+    QN RP  +Q F+ R  Q     T  QQ NR+  KQ+QFDPIPMTYTELLPQL QN+QLAP P+ P+QPPYP WYD N RCDYH
Subjt:  YPHQVNNASTQYSPFCGQNTRPQMNQNFKSRRQQP---VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYH

Query:  AGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID
        AGA+GHSTENCTALKYRVQ LIKAG L FKKE  P V NNPLPNH   Q+NA++
Subjt:  AGAVGHSTENCTALKYRVQELIKAGWLNFKKEEGPSVDNNPLPNH---QVNAID

A0A6J1E2J7 Ribonuclease H9.3e-16670.42Show/hide
Query:  APTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSS
        AP+NEK EVL+ERLRA+EGT VFGN+DA+QLCLV  +VIPPKFK+PEFEKYDG++CPK+H++MYCRKMAA+  NDK+LIHCFQDSL GPA+ WYMQLDSS
Subjt:  APTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFEKYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSS

Query:  HVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYE
        +V +WKNLADSFLKQYKHNID+APDRLDLQRMEKKSTE FK+YAQRWRDTAAQVQPPLTDKELSAMFINTL+ PFY+ M+G ASTNFSDI+TIGERIEY 
Subjt:  HVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLTDKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYE

Query:  IKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVA---QYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQ
        ++HGRI  TV E    KK + SKKKEGEVQM+G    S +Q  QP +   +Y   Y P  YGY    VNNA++ YSP+  QN RP  +QNF+    S+  
Subjt:  IKHGRIAKTV-ESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVA---QYPSNYLPI-YGYYPHQVNNASTQYSPFCGQNTRPQMNQNFK----SRRQ

Query:  QP--------VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLN
        QP         T  QQ NR   KQ QFDPIPMTYTELLPQL QN+QLAP P+ P+QPPYPRWYD NARCDYHAGA+GHSTENCTALKYRVQ LIKAGWLN
Subjt:  QP--------VTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKAGWLN

Query:  FKKEEGPSVDNNPLPNH---QVNAID
        FKKE GP V  NPLPNH   Q+NAI+
Subjt:  FKKEEGPSVDNNPLPNH---QVNAID

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGCTTTGTTAGAATTTGGGACCCGGCCTACAGGTGTTTCGTATTTCAAGA
TTTCGACTTAGTCCCAACCATTGAGGAGTACCACACGATACTGAACATTGAAGAAAAAGGTGGAGAAATGATCTATTGTTTCAACCCACAACTAACTGCAAAGAGAACTT
TATCCAAATTTCTAGTTGTACACCTGAAGGAAGTGCAAAAAAATGTGAAAGTGAAAGGGGTTGAAGAGTCTATATCTGCAGATTATCTAACAGAACTCGCTCGCAAACAC
ATAAATGAAGAAAAGCTATTCTTCGGCATTGAGCAAGGGGTGGATCCAACCATACCTCTTCTAGCGAAACGGTTCGGGCGCTCAATTATTAATTTGGCACGGCCCCATGG
AACTCAAGTTTTCCGAGGAAAGAAGCATGGGATCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGGAAGGCACATTGGATGCCCCTAAAAGCCTTAATCTACAGA
TGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTTGCGACAAAAAGGATAAATGGCAATAGTCACTCTGAGGGA
GTTACTCCAGAATATCTGCAATGGCGCATTAAGAGGAGCAAGATCCCAATTACGACTCGTGACAATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCT
AGCGACAGAGCGGAAAGAGCTAGTGGGAAGAAATCAAACATTGAAACTAGAAAATGAAAAATTACAGCAAGAGGTCAAACGTTTGATGGACCAAGTAACTCAAGCTAGTG
GCAAACTCGAAGAAGTAAAGAGAGTTTCAAAAAACCGAGCCAAGCGAGAAAGAGATTATGATATATTGGACGAGGAGAATAGAAGATTGATTAAGGAGAATCACGCTTTG
AGGAATGAAAATACTGCATTGCGAAGGGAAACTCGTTCACAAGAGGATAGGATCAAAGACCTTTCAAGACGCAAAGAGACTCTCTTAGAGTTAGTTGCGGAAACCATCAA
CAAGCAGAAAACACAACTTATCGAGTTTGAAGAAGCTAATACTACTCTAAGGCGAAGGCTGGACAATCTACGCGTGAATATACAGGCTCAATCAGAAGAGTCTGAAGCTT
CTAAAAGTTATTCAAGCTCGTTGGAACGTCAGCTCCAAGCATTTCAAATGGCAAGCGAACAGTTGTCACTAGAGAGAGGACAATTAGAGGAGAAATACTCCTCATTAAGG
GGAGATTATGCCACTATGAGAGATGACATGCAAATAATTCTTGGGAAAGTAAGTCGAACCATGAACACTATCAAGATCATGGCTAGGAGAGCCCGAGGATTTGCAGAATG
GGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGATGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGAAAAGGGCAAAAGATTGCAG
ATACGACTACACCAGACACTCCGATTGGAAACCCTCAAGTTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCATACGACAGCAGAAGTGTCCATGCCACAACAC
ACTACCTATAACCCCTTATATGACATACCTGTTGGGCAGTACCCTTTTCCATCATTTAAAGAAGGCCAAGTCTCCCAAATCCCCATGGCTAGCCAAGCTGGTGCTTCTTA
TTTCAAACCAGAATTTTCAAAAATACCTTTTGCGGTCCCTAACTTGGATGACCCAGAAGTAAAGGAGTTAACCGCGCCCACTAATGAAAAACTTGAAGTTTTAAAGGAAA
GATTGAGGGCAGTCGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGTTGTGATCCCACCGAAATTCAAACTTCCTGAATTTGAG
AAATACGATGGGACAACATGCCCTAAAAGTCACATTGTTATGTACTGTAGGAAGATGGCAGCGCACGATCATAATGATAAGATGCTCATACATTGCTTTCAGGATAGCTT
AAAGGGTCCAGCTGCCCATTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGCTGATTCCTTCCTCAAACAGTACAAGCACAATATTGATTTGG
CTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTCAAAGATTATGCTCAAAGGTGGCGGGATACGGCAGCTCAAGTTCAACCCCCTTTAACG
GATAAGGAGCTTTCTGCCATGTTCATCAACACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGGGCGTCAACTAATTTCTCCGATATCATAACAATTGGAGAAAG
AATTGAGTACGAGATCAAACATGGTAGAATAGCCAAAACCGTTGAATCCTCAGGCACAAAGAAGGGCAACAGTTCAAAGAAGAAGGAGGGAGAAGTACAGATGATAGGGA
AGGTGGATAAGTCCCATCGACAAATCCACCAACCCGTCGCACAATATCCTTCGAATTATTTACCTATATACGGATATTATCCACACCAGGTGAATAATGCATCAACGCAG
TACTCCCCATTTTGTGGCCAAAATACCAGACCCCAAATGAATCAGAATTTTAAGTCTCGTAGACAACAACCAGTCACTCCAAGTCAGCAAATTAATCGAGCACCTTACAA
GCAAGCCCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACGATCAGTTGGCACCCAAGCCTATTGCTCCACTGCAACCCCCTTATC
CAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGCTGTTGGGCATTCTACAGAAAATTGCACTGCTTTGAAGTATAGGGTGCAAGAGCTCATTAAGGCC
GGATGGTTAAATTTCAAAAAAGAGGAGGGACCCAGTGTTGATAATAATCCTTTACCAAATCATCAGGTGAATGCAATAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGCCAACGTACCCATAGAGTCGATACGGTCATCTTCCTCTGAGTATGATGAGTTGAGCTTTGTTAGAATTTGGGACCCGGCCTACAGGTGTTTCGTATTTCAAGA
TTTCGACTTAGTCCCAACCATTGAGGAGTACCACACGATACTGAACATTGAAGAAAAAGGTGGAGAAATGATCTATTGTTTCAACCCACAACTAACTGCAAAGAGAACTT
TATCCAAATTTCTAGTTGTACACCTGAAGGAAGTGCAAAAAAATGTGAAAGTGAAAGGGGTTGAAGAGTCTATATCTGCAGATTATCTAACAGAACTCGCTCGCAAACAC
ATAAATGAAGAAAAGCTATTCTTCGGCATTGAGCAAGGGGTGGATCCAACCATACCTCTTCTAGCGAAACGGTTCGGGCGCTCAATTATTAATTTGGCACGGCCCCATGG
AACTCAAGTTTTCCGAGGAAAGAAGCATGGGATCTTTCTTTTCGAATTCCAAATTAGAGCATGTGATTTGGAAGGCACATTGGATGCCCCTAAAAGCCTTAATCTACAGA
TGCGGGAGCTTCCACACAGTTCCTCTAGTAGGCCCTTGGGGATGCGTCCATTATGCTCCGTTGTTGGCCTTGCGACAAAAAGGATAAATGGCAATAGTCACTCTGAGGGA
GTTACTCCAGAATATCTGCAATGGCGCATTAAGAGGAGCAAGATCCCAATTACGACTCGTGACAATGTGGGAGAATCATCTAGTAGAGCAGTAGATAAGCCTAGCCAGCT
AGCGACAGAGCGGAAAGAGCTAGTGGGAAGAAATCAAACATTGAAACTAGAAAATGAAAAATTACAGCAAGAGGTCAAACGTTTGATGGACCAAGTAACTCAAGCTAGTG
GCAAACTCGAAGAAGTAAAGAGAGTTTCAAAAAACCGAGCCAAGCGAGAAAGAGATTATGATATATTGGACGAGGAGAATAGAAGATTGATTAAGGAGAATCACGCTTTG
AGGAATGAAAATACTGCATTGCGAAGGGAAACTCGTTCACAAGAGGATAGGATCAAAGACCTTTCAAGACGCAAAGAGACTCTCTTAGAGTTAGTTGCGGAAACCATCAA
CAAGCAGAAAACACAACTTATCGAGTTTGAAGAAGCTAATACTACTCTAAGGCGAAGGCTGGACAATCTACGCGTGAATATACAGGCTCAATCAGAAGAGTCTGAAGCTT
CTAAAAGTTATTCAAGCTCGTTGGAACGTCAGCTCCAAGCATTTCAAATGGCAAGCGAACAGTTGTCACTAGAGAGAGGACAATTAGAGGAGAAATACTCCTCATTAAGG
GGAGATTATGCCACTATGAGAGATGACATGCAAATAATTCTTGGGAAAGTAAGTCGAACCATGAACACTATCAAGATCATGGCTAGGAGAGCCCGAGGATTTGCAGAATG
GGCAAGGGATCTGCGAGAGAATACATCACCTATGGCCTCTAATGCGGATGAGTTGTTTGAGTTTTTAGGGATGATTCGTAGAGACCTTGGAAAAGGGCAAAAGATTGCAG
ATACGACTACACCAGACACTCCGATTGGAAACCCTCAAGTTGGCCTACCATTTCCACCCAGTTTCGCTTCACATGTTCATACGACAGCAGAAGTGTCCATGCCACAACAC
ACTACCTATAACCCCTTATATGACATACCTGTTGGGCAGTACCCTTTTCCATCATTTAAAGAAGGCCAAGTCTCCCAAATCCCCATGGCTAGCCAAGCTGGTGCTTCTTA
TTTCAAACCAGAATTTTCAAAAATACCTTTTGCGGTCCCTAACTTGGATGACCCAGAAGTAAAGGAGTTAACCGCGCCCACTAATGAAAAACTTGAAGTTTTAAAGGAAA
GATTGAGGGCAGTCGAGGGCACTAGTGTTTTTGGAAACATGGATGCCACTCAATTATGTTTAGTTCCAGACGTTGTGATCCCACCGAAATTCAAACTTCCTGAATTTGAG
AAATACGATGGGACAACATGCCCTAAAAGTCACATTGTTATGTACTGTAGGAAGATGGCAGCGCACGATCATAATGATAAGATGCTCATACATTGCTTTCAGGATAGCTT
AAAGGGTCCAGCTGCCCATTGGTATATGCAGTTGGATAGTTCACATGTTGTCACATGGAAGAACTTGGCTGATTCCTTCCTCAAACAGTACAAGCACAATATTGATTTGG
CTCCCGATCGACTGGACTTACAGAGGATGGAAAAGAAGAGTACTGAAGACTTCAAAGATTATGCTCAAAGGTGGCGGGATACGGCAGCTCAAGTTCAACCCCCTTTAACG
GATAAGGAGCTTTCTGCCATGTTCATCAACACCTTAAGGTCCCCATTCTATGAACATATGGTTGGAAGGGCGTCAACTAATTTCTCCGATATCATAACAATTGGAGAAAG
AATTGAGTACGAGATCAAACATGGTAGAATAGCCAAAACCGTTGAATCCTCAGGCACAAAGAAGGGCAACAGTTCAAAGAAGAAGGAGGGAGAAGTACAGATGATAGGGA
AGGTGGATAAGTCCCATCGACAAATCCACCAACCCGTCGCACAATATCCTTCGAATTATTTACCTATATACGGATATTATCCACACCAGGTGAATAATGCATCAACGCAG
TACTCCCCATTTTGTGGCCAAAATACCAGACCCCAAATGAATCAGAATTTTAAGTCTCGTAGACAACAACCAGTCACTCCAAGTCAGCAAATTAATCGAGCACCTTACAA
GCAAGCCCAATTTGATCCGATTCCGATGACATATACTGAGCTCTTGCCTCAGTTAATTCAGAACGATCAGTTGGCACCCAAGCCTATTGCTCCACTGCAACCCCCTTATC
CAAGATGGTATGATCCCAATGCAAGGTGTGATTACCATGCAGGTGCTGTTGGGCATTCTACAGAAAATTGCACTGCTTTGAAGTATAGGGTGCAAGAGCTCATTAAGGCC
GGATGGTTAAATTTCAAAAAAGAGGAGGGACCCAGTGTTGATAATAATCCTTTACCAAATCATCAGGTGAATGCAATAGATTAA
Protein sequenceShow/hide protein sequence
MNANVPIESIRSSSSEYDELSFVRIWDPAYRCFVFQDFDLVPTIEEYHTILNIEEKGGEMIYCFNPQLTAKRTLSKFLVVHLKEVQKNVKVKGVEESISADYLTELARKH
INEEKLFFGIEQGVDPTIPLLAKRFGRSIINLARPHGTQVFRGKKHGIFLFEFQIRACDLEGTLDAPKSLNLQMRELPHSSSSRPLGMRPLCSVVGLATKRINGNSHSEG
VTPEYLQWRIKRSKIPITTRDNVGESSSRAVDKPSQLATERKELVGRNQTLKLENEKLQQEVKRLMDQVTQASGKLEEVKRVSKNRAKRERDYDILDEENRRLIKENHAL
RNENTALRRETRSQEDRIKDLSRRKETLLELVAETINKQKTQLIEFEEANTTLRRRLDNLRVNIQAQSEESEASKSYSSSLERQLQAFQMASEQLSLERGQLEEKYSSLR
GDYATMRDDMQIILGKVSRTMNTIKIMARRARGFAEWARDLRENTSPMASNADELFEFLGMIRRDLGKGQKIADTTTPDTPIGNPQVGLPFPPSFASHVHTTAEVSMPQH
TTYNPLYDIPVGQYPFPSFKEGQVSQIPMASQAGASYFKPEFSKIPFAVPNLDDPEVKELTAPTNEKLEVLKERLRAVEGTSVFGNMDATQLCLVPDVVIPPKFKLPEFE
KYDGTTCPKSHIVMYCRKMAAHDHNDKMLIHCFQDSLKGPAAHWYMQLDSSHVVTWKNLADSFLKQYKHNIDLAPDRLDLQRMEKKSTEDFKDYAQRWRDTAAQVQPPLT
DKELSAMFINTLRSPFYEHMVGRASTNFSDIITIGERIEYEIKHGRIAKTVESSGTKKGNSSKKKEGEVQMIGKVDKSHRQIHQPVAQYPSNYLPIYGYYPHQVNNASTQ
YSPFCGQNTRPQMNQNFKSRRQQPVTPSQQINRAPYKQAQFDPIPMTYTELLPQLIQNDQLAPKPIAPLQPPYPRWYDPNARCDYHAGAVGHSTENCTALKYRVQELIKA
GWLNFKKEEGPSVDNNPLPNHQVNAID