; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0009438 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0009438
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr03:10933785..10936690
RNA-Seq ExpressionPI0009438
SyntenyPI0009438
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]2.2e-15158.16Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q L HP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQ++EV+MIT+  S+E+ S++TT MWKPLVIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWSSYIDLVIIRVFRTHRKVLLDILNQAHVGHDIS--VNALSEIVKNITATNCISFTD--EEIPFEGTGHTKAL
         SYIQ+PK M V+IP     +  R     +  D+      R  +   +++  +  + +D+S      + +V+  T    +S  +  EEIP EGTGHTKAL
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWSSYIDLVIIRVFRTHRKVLLDILNQAHVGHDIS--VNALSEIVKNITATNCISFTD--EEIPFEGTGHTKAL

Query:  HISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-----------------------
        HISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFD A REVIGDI++PLKIGPSTFNV FQ                       
Subjt:  HISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-----------------------

Query:  ---------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPE
                 FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE  NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD E
Subjt:  ---------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPE

Query:  VISLPKAKERFGLGYKPTTSE
        VISLPKAKE+FGLGYKP TSE
Subjt:  VISLPKAKERFGLGYKPTTSE

KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]2.9e-1051.19Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHE----SPTINVVDTFTEKY
        MPAYPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S          + V++ F EKY
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHE----SPTINVVDTFTEKY

KAA0033221.1 uncharacterized protein E6C27_scaffold845G00100 [Cucumis melo var. makuwa]1.1e-14255.66Show/hide
Query:  EPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILYGAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKV
        EPDVNQN L  H+ P IN +D F EK+K++V D+TTSMN LFQIL+  GYLSPRFN++  EK GCT+ +Q LFHP+ +DH IEDC EFK EVQKLMDAK+
Subjt:  EPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILYGAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKV

Query:  LMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASITSYIQEPKTMIVKIPSPFAYKDNRAMPW--------SSYIDLVIIRVFRTHRKV
        L+VGQMSMQE+EV+MITD  S+++ S++TTS+WKPLVIHYEEK SI SYIQ+PK M ++IP PFAYKDN  +P          + +   I  + R  R  
Subjt:  LMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASITSYIQEPKTMIVKIPSPFAYKDNRAMPW--------SSYIDLVIIRVFRTHRKV

Query:  LLDILNQA--------------HVGHDISVNALSEIVKNIT----------------------------------------ATNCISFTDEEIPFEGTGH
         LD L                  +  +  +N LS++    T                                          + IS   EEI  EG GH
Subjt:  LLDILNQA--------------HVGHDISVNALSEIVKNIT----------------------------------------ATNCISFTDEEIPFEGTGH

Query:  TKALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-------------------
        TK LHISVKC+DHYVARVLVDN SSLNIM RSTL KL IDPSYLRPSTMVVRAFD A REVI DI++PLKIGPSTFN+ FQ                   
Subjt:  TKALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-------------------

Query:  -------------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQ
                     FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALE SYRSFE+ NATI P +GL MDRYVSK SLMI +TMIKS F MHKGL KDNQ
Subjt:  -------------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQ

Query:  GDPEVISLPKAKERFGLGYKP
         D E+ISLPKAKE FGLGYKP
Subjt:  GDPEVISLPKAKERFGLGYKP

KAA0061241.1 uncharacterized protein E6C27_scaffold455G00760 [Cucumis melo var. makuwa]1.0e-17256.02Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKP VIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NI ATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVIGDI++PLKIGPSTFNV FQ                     
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------

Query:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGD
                   FS+E GQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE+ NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD
Subjt:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGD

Query:  PEVISLPKAKERFGLGYKPTTSE
         EVISLPKAKE FGLGYKP TSE
Subjt:  PEVISLPKAKERFGLGYKPTTSE

KAA0061241.1 uncharacterized protein E6C27_scaffold455G00760 [Cucumis melo var. makuwa]2.9e-1028.51Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY
        MPAYPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S         ++   E+ +   G D+  S++     L 
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS
            + P+F +   EK+             C     Y    K+  H  +D L         + +  ++   K L    +   +  +DM  D L  +    
Subjt:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS

Query:  KTTSMWKPLVIHYEEKASITSYIQEPKT
        K    +K     + E   + + +Q P+T
Subjt:  KTTSMWKPLVIHYEEKASITSYIQEPKT

KAA0061241.1 uncharacterized protein E6C27_scaffold455G00760 [Cucumis melo var. makuwa]7.0e-15854.22Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKPLVIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NITATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-FSLEGGQAIVYGEEDMFVTK
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVIGDI++PLKIGPSTFNV FQ   +    + + G   +    
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-FSLEGGQAIVYGEEDMFVTK

Query:  TSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPEVISLPKAKERFGLGYKPTTSE
               ++   +  CSYRSFE+ NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD EVISLPKAKE+FGLGYKP TSE
Subjt:  TSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPEVISLPKAKERFGLGYKPTTSE

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]1.4e-0928.07Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY
        MP YPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S         ++   E+ +   G D+  S++     L 
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS
            + P+F +   EK+             C     Y    K+  H  +D L         + +  ++   K L    +   +  +DM  D L  +    
Subjt:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS

Query:  KTTSMWKPLVIHYEEKASITSYIQEPKT
        K    +K     + E   + + +Q P T
Subjt:  KTTSMWKPLVIHYEEKASITSYIQEPKT

KAA0065293.1 uncharacterized protein E6C27_scaffold1023G00060 [Cucumis melo var. makuwa]1.7e-15153.32Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        G+GYLSPRFN++  +K GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLM++K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKPLVIHYEEK+SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NITATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVI DI++PLKIGPSTFN+ FQ                     
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------

Query:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSF
                   FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE+ NATI P E L MDRYVSK SLMI +TMIKS F
Subjt:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSF

TrEMBL top hitse value%identityAlignment
A0A5A7SUT0 Reverse transcriptase1.1e-15158.16Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q L HP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQ++EV+MIT+  S+E+ S++TT MWKPLVIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWSSYIDLVIIRVFRTHRKVLLDILNQAHVGHDIS--VNALSEIVKNITATNCISFTD--EEIPFEGTGHTKAL
         SYIQ+PK M V+IP     +  R     +  D+      R  +   +++  +  + +D+S      + +V+  T    +S  +  EEIP EGTGHTKAL
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWSSYIDLVIIRVFRTHRKVLLDILNQAHVGHDIS--VNALSEIVKNITATNCISFTD--EEIPFEGTGHTKAL

Query:  HISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-----------------------
        HISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFD A REVIGDI++PLKIGPSTFNV FQ                       
Subjt:  HISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-----------------------

Query:  ---------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPE
                 FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE  NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD E
Subjt:  ---------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPE

Query:  VISLPKAKERFGLGYKPTTSE
        VISLPKAKE+FGLGYKP TSE
Subjt:  VISLPKAKERFGLGYKPTTSE

A0A5A7SUT0 Reverse transcriptase1.4e-1051.19Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHE----SPTINVVDTFTEKY
        MPAYPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S          + V++ F EKY
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHE----SPTINVVDTFTEKY

A0A5A7SUT0 Reverse transcriptase5.2e-14355.66Show/hide
Query:  EPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILYGAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKV
        EPDVNQN L  H+ P IN +D F EK+K++V D+TTSMN LFQIL+  GYLSPRFN++  EK GCT+ +Q LFHP+ +DH IEDC EFK EVQKLMDAK+
Subjt:  EPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILYGAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKV

Query:  LMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASITSYIQEPKTMIVKIPSPFAYKDNRAMPW--------SSYIDLVIIRVFRTHRKV
        L+VGQMSMQE+EV+MITD  S+++ S++TTS+WKPLVIHYEEK SI SYIQ+PK M ++IP PFAYKDN  +P          + +   I  + R  R  
Subjt:  LMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASITSYIQEPKTMIVKIPSPFAYKDNRAMPW--------SSYIDLVIIRVFRTHRKV

Query:  LLDILNQA--------------HVGHDISVNALSEIVKNIT----------------------------------------ATNCISFTDEEIPFEGTGH
         LD L                  +  +  +N LS++    T                                          + IS   EEI  EG GH
Subjt:  LLDILNQA--------------HVGHDISVNALSEIVKNIT----------------------------------------ATNCISFTDEEIPFEGTGH

Query:  TKALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-------------------
        TK LHISVKC+DHYVARVLVDN SSLNIM RSTL KL IDPSYLRPSTMVVRAFD A REVI DI++PLKIGPSTFN+ FQ                   
Subjt:  TKALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-------------------

Query:  -------------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQ
                     FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALE SYRSFE+ NATI P +GL MDRYVSK SLMI +TMIKS F MHKGL KDNQ
Subjt:  -------------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQ

Query:  GDPEVISLPKAKERFGLGYKP
         D E+ISLPKAKE FGLGYKP
Subjt:  GDPEVISLPKAKERFGLGYKP

A0A5A7V681 Retrotrans_gag domain-containing protein4.9e-17356.02Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKP VIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NI ATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVIGDI++PLKIGPSTFNV FQ                     
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------

Query:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGD
                   FS+E GQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE+ NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD
Subjt:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGD

Query:  PEVISLPKAKERFGLGYKPTTSE
         EVISLPKAKE FGLGYKP TSE
Subjt:  PEVISLPKAKERFGLGYKPTTSE

A0A5A7V681 Retrotrans_gag domain-containing protein1.4e-1028.51Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY
        MPAYPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S         ++   E+ +   G D+  S++     L 
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS
            + P+F +   EK+             C     Y    K+  H  +D L         + +  ++   K L    +   +  +DM  D L  +    
Subjt:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS

Query:  KTTSMWKPLVIHYEEKASITSYIQEPKT
        K    +K     + E   + + +Q P+T
Subjt:  KTTSMWKPLVIHYEEKASITSYIQEPKT

A0A5A7V681 Retrotrans_gag domain-containing protein3.4e-15854.22Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        GAGYLSPRFN++  EK GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLMD+K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKPLVIHYEEK SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NITATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-FSLEGGQAIVYGEEDMFVTK
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVIGDI++PLKIGPSTFNV FQ   +    + + G   +    
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ-FSLEGGQAIVYGEEDMFVTK

Query:  TSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPEVISLPKAKERFGLGYKPTTSE
               ++   +  CSYRSFE+ NATI P EGL MDRYVSK SLMI +TMIKS FQMHKGLGKDNQGD EVISLPKAKE+FGLGYKP TSE
Subjt:  TSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSFQMHKGLGKDNQGDPEVISLPKAKERFGLGYKPTTSE

A0A5A7VAU5 Uncharacterized protein6.9e-1028.07Show/hide
Query:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY
        MP YPPGFTPQRSSSPRM  RTYPTSFP  NPN T+QQ  H ++P+ST + E    +++   S         ++   E+ +   G D+  S++     L 
Subjt:  MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVG-DLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS
            + P+F +   EK+             C     Y    K+  H  +D L         + +  ++   K L    +   +  +DM  D L  +    
Subjt:  GAGYLSPRFNSNGGEKFG------------CTSGKQYLFHPKVDDHSIEDCL-------EFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASS

Query:  KTTSMWKPLVIHYEEKASITSYIQEPKT
        K    +K     + E   + + +Q P T
Subjt:  KTTSMWKPLVIHYEEKASITSYIQEPKT

A0A5A7VAU5 Uncharacterized protein8.1e-15253.32Show/hide
Query:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY
        P YP  + P           G +    FP++       Q++  +  +    T  EPDVNQNPL NHE P+IN VDTF +++K+KV D+ TSM  LFQIL+
Subjt:  PAYPPGFTPQRSSS--PRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILY

Query:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI
        G+GYLSPRFN++  +K GCT+ +Q LFHP+ +DHSIEDC EFK EVQKLM++K+L++GQMSMQE+EV+MIT+  S+E+ S++TT MWKPLVIHYEEK+SI
Subjt:  GAGYLSPRFNSNGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASI

Query:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------
         SYIQ+PK M V+IP PFAYKDN A+PW                                                                        
Subjt:  TSYIQEPKTMIVKIPSPFAYKDNRAMPWS-----------------------------------------------------------------------

Query:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK
                      ++ L       VI ++ RT               HRKVLLDILN+AHVGHDISVNALSEIV+NITATNCISFTDEEIP EGTGHTK
Subjt:  -------------SYIDL-------VIIRVFRT---------------HRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTK

Query:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------
        ALHISVKC+DH+VARVLVDN SSLNIM RSTLMKL IDPSYLRPSTMVVRAFDGARREVI DI++PLKIGPSTFN+ FQ                     
Subjt:  ALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLIDPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQ---------------------

Query:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSF
                   FS+EGGQAIVYGEEDMFVTKTS LPYVEA EEALECSYRSFE+ NATI P E L MDRYVSK SLMI +TMIKS F
Subjt:  -----------FSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIVRTMIKSSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGCATACCCTCCGGGTTTTACTCCGCAAAGATCGTCTAGTCCACGCATGGCGGGTAGGACATATCCTACATCATTTCCCGTAGAAAATCCTAACCCGACCTCACA
GCAGACAGTCCACGTGAGTGATCCGATGTCTACCCCAGTTACGGAAAATGAGCCAGACGTCAACCAGAACCCATTATCGAATCATGAGAGTCCCACCATAAATGTTGTTG
ACACATTCACGGAAAAGTACAAGGATAAGGTGGGTGATCTAACTACTTCAATGAATATGCTTTTCCAAATCCTTTATGGAGCTGGATATTTGTCGCCAAGGTTTAACAGT
AATGGTGGGGAGAAGTTTGGATGCACCAGTGGGAAGCAATATTTATTTCACCCTAAGGTAGATGACCATTCCATTGAGGATTGTTTGGAGTTTAAGGGAGAGGTACAAAA
GCTGATGGATGCAAAGGTTCTAATGGTAGGACAAATGAGCATGCAAGAAGTCGAGGTCGACATGATTACTGATACTTTATCTAGTGAGGAAGCTTCAAGTAAGACAACAT
CTATGTGGAAACCATTAGTCATTCATTACGAAGAGAAGGCAAGTATCACGTCTTATATCCAAGAGCCTAAAACAATGATAGTCAAAATACCGAGTCCTTTTGCTTACAAG
GATAATCGTGCCATGCCTTGGAGCTCGTACATCGATCTTGTCATTATTCGTGTATTCCGAACGCACCGTAAGGTTCTGTTGGATATTTTAAATCAAGCCCACGTAGGACA
TGACATTTCAGTGAATGCACTCAGTGAAATTGTGAAGAACATAACTGCTACAAATTGCATTTCCTTTACGGATGAAGAGATCCCTTTTGAAGGTACTGGACATACGAAGG
CATTGCACATATCTGTGAAGTGTAGGGATCACTATGTGGCAAGGGTCCTAGTGGATAATAGATCATCTTTAAATATAATGTTGAGATCCACTTTGATGAAACTCCTTATA
GATCCATCGTACCTAAGACCAAGTACCATGGTTGTTAGAGCTTTTGACGGTGCTCGTAGAGAAGTAATCGGGGATATAGAGGTTCCGTTGAAAATTGGGCCTTCCACTTT
CAATGTTCTATTCCAGTTTAGTTTGGAAGGTGGTCAAGCTATTGTCTACGGAGAGGAGGACATGTTTGTGACAAAAACATCAACTCTTCCTTATGTTGAAGCAGCAGAAG
AAGCCCTAGAATGCTCTTATCGATCGTTTGAAGTTGTTAATGCTACAATCTCTCCACCTGAAGGTTTAGATATGGATCGTTATGTGTCTAAAAAATCTCTGATGATTGTG
AGAACAATGATCAAAAGCAGTTTCCAAATGCACAAAGGATTGGGAAAGGATAATCAAGGAGATCCAGAGGTGATTTCACTTCCTAAAGCCAAAGAAAGGTTTGGATTGGG
CTATAAGCCAACGACTTCTGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGCATACCCTCCGGGTTTTACTCCGCAAAGATCGTCTAGTCCACGCATGGCGGGTAGGACATATCCTACATCATTTCCCGTAGAAAATCCTAACCCGACCTCACA
GCAGACAGTCCACGTGAGTGATCCGATGTCTACCCCAGTTACGGAAAATGAGCCAGACGTCAACCAGAACCCATTATCGAATCATGAGAGTCCCACCATAAATGTTGTTG
ACACATTCACGGAAAAGTACAAGGATAAGGTGGGTGATCTAACTACTTCAATGAATATGCTTTTCCAAATCCTTTATGGAGCTGGATATTTGTCGCCAAGGTTTAACAGT
AATGGTGGGGAGAAGTTTGGATGCACCAGTGGGAAGCAATATTTATTTCACCCTAAGGTAGATGACCATTCCATTGAGGATTGTTTGGAGTTTAAGGGAGAGGTACAAAA
GCTGATGGATGCAAAGGTTCTAATGGTAGGACAAATGAGCATGCAAGAAGTCGAGGTCGACATGATTACTGATACTTTATCTAGTGAGGAAGCTTCAAGTAAGACAACAT
CTATGTGGAAACCATTAGTCATTCATTACGAAGAGAAGGCAAGTATCACGTCTTATATCCAAGAGCCTAAAACAATGATAGTCAAAATACCGAGTCCTTTTGCTTACAAG
GATAATCGTGCCATGCCTTGGAGCTCGTACATCGATCTTGTCATTATTCGTGTATTCCGAACGCACCGTAAGGTTCTGTTGGATATTTTAAATCAAGCCCACGTAGGACA
TGACATTTCAGTGAATGCACTCAGTGAAATTGTGAAGAACATAACTGCTACAAATTGCATTTCCTTTACGGATGAAGAGATCCCTTTTGAAGGTACTGGACATACGAAGG
CATTGCACATATCTGTGAAGTGTAGGGATCACTATGTGGCAAGGGTCCTAGTGGATAATAGATCATCTTTAAATATAATGTTGAGATCCACTTTGATGAAACTCCTTATA
GATCCATCGTACCTAAGACCAAGTACCATGGTTGTTAGAGCTTTTGACGGTGCTCGTAGAGAAGTAATCGGGGATATAGAGGTTCCGTTGAAAATTGGGCCTTCCACTTT
CAATGTTCTATTCCAGTTTAGTTTGGAAGGTGGTCAAGCTATTGTCTACGGAGAGGAGGACATGTTTGTGACAAAAACATCAACTCTTCCTTATGTTGAAGCAGCAGAAG
AAGCCCTAGAATGCTCTTATCGATCGTTTGAAGTTGTTAATGCTACAATCTCTCCACCTGAAGGTTTAGATATGGATCGTTATGTGTCTAAAAAATCTCTGATGATTGTG
AGAACAATGATCAAAAGCAGTTTCCAAATGCACAAAGGATTGGGAAAGGATAATCAAGGAGATCCAGAGGTGATTTCACTTCCTAAAGCCAAAGAAAGGTTTGGATTGGG
CTATAAGCCAACGACTTCTGAATAA
Protein sequenceShow/hide protein sequence
MPAYPPGFTPQRSSSPRMAGRTYPTSFPVENPNPTSQQTVHVSDPMSTPVTENEPDVNQNPLSNHESPTINVVDTFTEKYKDKVGDLTTSMNMLFQILYGAGYLSPRFNS
NGGEKFGCTSGKQYLFHPKVDDHSIEDCLEFKGEVQKLMDAKVLMVGQMSMQEVEVDMITDTLSSEEASSKTTSMWKPLVIHYEEKASITSYIQEPKTMIVKIPSPFAYK
DNRAMPWSSYIDLVIIRVFRTHRKVLLDILNQAHVGHDISVNALSEIVKNITATNCISFTDEEIPFEGTGHTKALHISVKCRDHYVARVLVDNRSSLNIMLRSTLMKLLI
DPSYLRPSTMVVRAFDGARREVIGDIEVPLKIGPSTFNVLFQFSLEGGQAIVYGEEDMFVTKTSTLPYVEAAEEALECSYRSFEVVNATISPPEGLDMDRYVSKKSLMIV
RTMIKSSFQMHKGLGKDNQGDPEVISLPKAKERFGLGYKPTTSE