; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039541 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039541
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:45957752..45962675
RNA-Seq ExpressionLag0039541
SyntenyLag0039541
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036933.1 uncharacterized protein E6C27_scaffold86G00060 [Cucumis melo var. makuwa]0.0e+0047.41Show/hide
Query:  EGSWMPLKAMTYRCGDFYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMN
        +  WMPLKA+ YRCGDF+SVPLLGPWGG+NY PLLVLRQ+W KQFIP THNL                           K+K KGHYEGVT  YEAWQ N
Subjt:  EGSWMPLKAMTYRCGDFYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMN

Query:  RRKSITKFIPKEKNVKPDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG----------------------------
        RRK+I   I +E   +  E + +Q +QW E+   +E++NRLLEQENEKLRKET+QWM+HAT +Q EL+K K                             
Subjt:  RRKSITKFIPKEKNVKPDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG----------------------------

Query:  -------------------------------------------------------------------------------RVDQTIEILRFVARRANGFAE
                                                                                       RVDQTIE LR V++RANGFAE
Subjt:  -------------------------------------------------------------------------------RVDQTIEILRFVARRANGFAE

Query:  WAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPG
        WA                    K SS     HPYNTRYK +IMEE+ KDM+KM+Q+INNL EQV+KILELLS GKGK  V+  Q+SNPIQ+ DDP YPPG
Subjt:  WAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPG

Query:  FTPQHVRQRMHFPQPQTPQQYVATNPLYTPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLT
        FTP H+                        VP ++ L        N M +            +++K+KL+VLEERLRAIE TDVYGNIDATQLCLVPGL 
Subjt:  FTPQHVRQRMHFPQPQTPQQYVATNPLYTPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLT

Query:  IPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSE
        IPAKFKVPEF+KYDGS+CPRSHLIMYCRKM  HINNDKLL+HCFQDSLT PASRWYIQLDN+HIHVWKDLADAFLKQYK NIDMAPDRLDLQRMEK++SE
Subjt:  IPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSE

Query:  SFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSW
        SFKEYAQRWRDMAAEVQPPLTDKEMT+MFM+TLR+PFYE MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGEV AI F NS 
Subjt:  SFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSW

Query:  KNKPKF--------------------------------------------------------------------ELLPQLIENRQLALIPMDP-------
        K+K  F                                                                    ELLPQLI+NRQLA IPM P       
Subjt:  KNKPKF--------------------------------------------------------------------ELLPQLIENRQLALIPMDP-------

Query:  -YSLLIQNDY---------------------------------------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEH
         Y    + DY                                                         L E CK +V+EI  PME +F  L +AGYV  E+
Subjt:  -YSLLIQNDY---------------------------------------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEH

Query:  LDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQ
        LDP +RYEG   +           R   E  +D+                                                                  
Subjt:  LDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQ

Query:  TLTSPSIDSITGVSGITRSGRDIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPE
                             D+E PIIAKD+EY K VT EEA EFLKI                              I+GNITSSNSIVF+DDEIPPE
Subjt:  TLTSPSIDSITGVSGITRSGRDIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPE

Query:  GLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRP
        GLGHTKALHIQ+KC++YVIARVLVDNGSALN+MPKSTLL LPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGPC FNI FQVMEITP YSFLLGRP
Subjt:  GLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRP

Query:  WIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNL
        WIHSAGVVPSTLHQKLKF+VG+KL+ + GEEDFLITKP+STPYVEA EEALECSF SFEIAHAT++     E+ + HKSK + M  +++  GGY LN+NL
Subjt:  WIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNL

Query:  ERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGII
        E LL IPSN G FGLGYKP   + I+L  +KK+K        E D P++               SYS+D+      LL ++ +LS+AAV  E   E   +
Subjt:  ERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGII

Query:  YACSPDFELNNWDVVDLPIFSQDLQE
        YAC PDFELNNWD VDLP FS+D Q+
Subjt:  YACSPDFELNNWDVVDLPIFSQDLQE

KAA0036942.1 uncharacterized protein E6C27_scaffold86G00180 [Cucumis melo var. makuwa]0.0e+0054.62Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA
        MEE+ KDM+KM+Q+INNL EQV+KILELLS GKGK VV+ AQ+SNPIQ+ DDP YPPGFTP H+    + PQ QT Q YV TNPLY     +P +E LEA
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIE TDVYGNIDATQLCLVPGL IPAKFKVPEFDKYDGS+CPRSHLIMYCRKM  HINNDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        L+HCFQDSLT PASRWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAPDRLDLQR+EK++SESFKEYAQRWRDM AEVQPPLTDKEMT+MFM+TLR+PFYE
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGEV AI F NS K+K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDPY---------------------------------------------
                                           ELLPQLI+NRQLA IPM P                                              
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDPY---------------------------------------------

Query:  -------SLLIQN-------------DYLSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ
               S + +N             D L E C+ +V+EI  PME LFE L +AGYV  E+LDP +RYEG                         SKVQQ
Subjt:  -------SLLIQN-------------DYLSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ

Query:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT
        LMDSKIL + R     EM+  ++  L  E S    E  +SF P+PLT+ Y E+ N S S  NP+ +TIQVP PFKFKD KAVPWRY+CQ +T P +D+IT
Subjt:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT

Query:  GVSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI----------------------------
         +SGITRSGR                                  D+E PI+AKD+EY K VT+EEA EFLKI                            
Subjt:  GVSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI----------------------------

Query:  -WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGP
          I+GNITSSNSIVF+DDEIPPEGLGHTKALHIQ+KC++YVIARVLVDNGSALN+MPKSTLL LPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGP
Subjt:  -WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGP

Query:  CTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSH
        C FNI FQVMEITP YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+KL+ + GEEDFLITKP+STPYVEA EEALECSF SFEIAHAT++     E+ + H
Subjt:  CTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSH

Query:  KSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALD-----------------------SYSTDH------LL
        KSK + M  +++  GGY LN+NLE LL IPSN G FGLGYKP    K+  +K++K+     L+                       SYS+D+      LL
Subjt:  KSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALD-----------------------SYSTDH------LL

Query:  MEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE
         ++ +LS+AAV  E   E   +YAC   FELNNWD VDLP FS+D Q+
Subjt:  MEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE

XP_031738551.1 LOW QUALITY PROTEIN: uncharacterized protein LOC101203611 [Cucumis sativus]0.0e+0046.75Show/hide
Query:  FYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMNRRKSITKFIPKEKNVK
        F+SVPLLGPWGG+NY PLLVLRQ+W KQFIP THNLQ+ +F+YD ED   KK +AV AWKSVRK+K KGHYEGVT  YEAWQ NRRK++  +  +E   +
Subjt:  FYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMNRRKSITKFIPKEKNVK

Query:  PDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG--------------------------------------------
         +EP+ +Q  QW E+  ++EE+NRLLEQENEKL+KET+QW++HAT +Q+EL+K K                                             
Subjt:  PDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------RVDQTIEILRFVARRANGFAEWAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAK
                     RVDQTI  LR V+RRANGFAEWAA+LR N   ++P++DDL+                R+ + I  E G                   
Subjt:  -------------RVDQTIEILRFVARRANGFAEWAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAK

Query:  ILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGV
               GKGK VVE AQ+SNP+Q+ DDP YPPGFTP+H    M+  Q QT Q YVA NPL+    PVPD+EQLEA A IQ           +M Q+E  
Subjt:  ILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGV

Query:  QAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSH
         AK+KL+VLEERLRAIEGTDVYGNIDATQLCLVPGL IP KFKVP FDKYDGSSCPRSHLIMYCRKM AHI NDKLLIHCFQDSLTGPA+RWYIQLDN+H
Subjt:  QAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSH

Query:  IHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGI
        IHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEK++SESFKEYAQRWRD+AAEVQPPLTDKEMT MFM+TLR+PFY+ MIGNA T+FSDIIVIGERIEYGI
Subjt:  IHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGI

Query:  KHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------------------------------
        KHGR+  T+AE    KKG T KKKEGEV AI F N   +K  F                                                         
Subjt:  KHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------------------------------

Query:  -----------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS------------------------
                   ELLPQL+ NRQLA IP++P                                      LI   +LS                        
Subjt:  -----------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS------------------------

Query:  ------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-----------SKVQQLMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVI
                CK +V+EI  PME LFE L +AGYV  E+LDP +RYEG                L +  IL +   +   EM+++++  L GE +  E    
Subjt:  ------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-----------SKVQQLMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVI

Query:  ESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR-----------------------------
          F P+PLT+ Y EN N SS  NP+ + ++VP PFKFKD KAVPWRY+CQ +T PS+D+ITG+SGITRSGR                             
Subjt:  ESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR-----------------------------

Query:  -DIEEPI---IAKDMEYNKPVTEEEAEEFLKI------------------------------------------------------WIVGNITSSNSIVF
         D E+ +   + KD+E  KPVT+E A EFLKI                                                       I+G+ITSSNSIVF
Subjt:  -DIEEPI---IAKDMEYNKPVTEEEAEEFLKI------------------------------------------------------WIVGNITSSNSIVF

Query:  SDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPA
        +DDEI PEGLGH KALHIQVK ++YVIARVLVDNGSALN+MPKSTLLKLPV+MS+I+SS+MVVRAFDGSRREVIG+IELPIKIGPCTFNI FQVMEITP 
Subjt:  SDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPA

Query:  YSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNG
        YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+K++ L GEEDFLITKPISTPYVEA EEALECSFRSFEIAHAT++     E+ +SH  K      +++ +G
Subjt:  YSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNG

Query:  GYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE-----------------------------IDWPALDSYSTDHLLMEVRNLSIAAVVNEL
        GY LNQ+LE LL+ PSN G FGLGY P    K+  +K++K+                             I + + DS S   LL ++ +LSIAAV  E 
Subjt:  GYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE-----------------------------IDWPALDSYSTDHLLMEVRNLSIAAVVNEL

Query:  PHEDGIIYACSPDFELNNWDV
          ED  +YAC P+FELNNWDV
Subjt:  PHEDGIIYACSPDFELNNWDV

XP_031738857.1 LOW QUALITY PROTEIN: uncharacterized protein LOC116402780 [Cucumis sativus]8.6e-31053.19Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEA
        MEEQ KDM+KM+QDINNL EQV+KILELLS GKGK VVE AQ+SNP+Q+ DDP YPPGFTP+H    M+  Q QT Q YVA NPL+    PVPD+EQLE 
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIEGTDVYGNIDATQLCLVPGL IP KFKVPEFDKYDGSSCPRSHLIMYCRKM AHI NDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        LIHCFQDSLTGPA+RWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEK++SESFKEYAQRWRD+AAEVQPPLTDKEMT MFM+TLR+PFY+
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA T+FSDIIVIGERIEYGIKHGR+  T+AE    KKG T KKKEGEV AI F N   +K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS
                                           ELLPQL+ NRQLA IP++P                                      LI   +LS
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS

Query:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ
                                        CK +V+EI  PME LFE L +AGYV  E+LDP +RYEG                         SKVQQ
Subjt:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ

Query:  LMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITG
         MDSKIL +   +   EM+++++  L GE +  E      F P+PLT+ Y EN N SS  NP+ + ++VP PFKFKD KAVPWRY+CQ +T PS+D+ITG
Subjt:  LMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITG

Query:  VSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------
        +SGITRSGR                                  D+E  + AKD+E  KPVT+E A EFLKI                             
Subjt:  VSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------

Query:  -------------------------WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSM
                                  I+G+ITSSNSIVF+DDEIPPEGLGH KALHIQVK ++YVIARVLVDNGSALN+MPKSTLLKLPV+MS+I+SS+M
Subjt:  -------------------------WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSM

Query:  VVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALEC
        VVRAFDGSRREVIG+IELPIKIGPCTFNI FQVMEITP YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+K++ L GEEDFLITKPISTPYVEA EEALEC
Subjt:  VVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALEC

Query:  SFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE--------------------
        SFRSFEIAHAT++     E+ +SH  K      +++ +GGY LNQ+LE LL+ PSN G FGLGY P    K+  +K++K+                    
Subjt:  SFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE--------------------

Query:  ---------IDWPALDSYSTDHLLMEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDV
                 I + + DS S   LL ++ +LSIAAV  E   ED  +YAC P+FELNNWDV
Subjt:  ---------IDWPALDSYSTDHLLMEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDV

XP_031742360.1 uncharacterized protein LOC116404324 [Cucumis sativus]0.0e+0053.28Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEA
        MEEQ KDM+KM+QDINNL EQV+KILELLS GKGK VVE AQ+SNP+Q+ DDP YPPGFTP+H    M+  Q QT Q YVA NPL+    PVPD+EQLEA
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLY---TPVPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIEGTDVYGNIDATQLCLVPGL IP KFKVPEFDKYDGSSCPRSHLIMYCRKM AHI NDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        LIHCFQDSLTGPA+RWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEK++SESFKEYAQRWRD+AAEVQPPLTDKEMT MFM+TLR+PFY+
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA T+FSDIIVIGERIEYGIKHGR+  T+AE    KKG T KKKEGEV AI F N   +K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS
                                           ELLPQL+ NRQLA IP++P                                      LI   +LS
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS

Query:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ
                                        CK +V+EI  PME LFE L +AGYV  E+LDP +RYEG                         SKVQQ
Subjt:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ

Query:  LMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITG
         MDSKIL +   +   EM+++++  L GE +  E      F P+PLT+ Y EN N SS  NP+ + ++VP PFKFKD KAVPWRY+CQ +T PS+D+ITG
Subjt:  LMDSKILMIC--RRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITG

Query:  VSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------
        +SGITRSGR                                  D+E  + AKD+E  KPVT+E A EFLKI                             
Subjt:  VSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------

Query:  -------------------------WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSM
                                  I+G+ITSSNSIVF+DDEIPPEGLGH KALHIQVK ++YVIARVLVDNGSALN+MPKSTLLKLPV+MS+I+SS+M
Subjt:  -------------------------WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSM

Query:  VVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALEC
        VVRAFDGSRREVIG+IELPIKIGPCTFNI FQVMEITP YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+K++ L GEEDFLITKPISTPYVEA EEALEC
Subjt:  VVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALEC

Query:  SFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE--------------------
        SFRSFEIAHAT++     E+ +SH  K      +++ +GGY LNQ+LE LL+ PSN G FGLGY P    K+  +K++K+                    
Subjt:  SFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKE--------------------

Query:  ---------IDWPALDSYSTDHLLMEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDV
                 I + + DS S   LL ++ +LSIAAV  E   ED  +YAC P+FELNNWDV
Subjt:  ---------IDWPALDSYSTDHLLMEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDV

TrEMBL top hitse value%identityAlignment
A0A5A7T0H8 Uncharacterized protein0.0e+0054.62Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA
        MEE+ KDM+KM+Q+INNL EQV+KILELLS GKGK VV+ AQ+SNPIQ+ DDP YPPGFTP H+    + PQ QT Q YV TNPLY     +P +E LEA
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIE TDVYGNIDATQLCLVPGL IPAKFKVPEFDKYDGS+CPRSHLIMYCRKM  HINNDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        L+HCFQDSLT PASRWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAPDRLDLQR+EK++SESFKEYAQRWRDM AEVQPPLTDKEMT+MFM+TLR+PFYE
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGEV AI F NS K+K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDPY---------------------------------------------
                                           ELLPQLI+NRQLA IPM P                                              
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDPY---------------------------------------------

Query:  -------SLLIQN-------------DYLSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ
               S + +N             D L E C+ +V+EI  PME LFE L +AGYV  E+LDP +RYEG                         SKVQQ
Subjt:  -------SLLIQN-------------DYLSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ

Query:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT
        LMDSKIL + R     EM+  ++  L  E S    E  +SF P+PLT+ Y E+ N S S  NP+ +TIQVP PFKFKD KAVPWRY+CQ +T P +D+IT
Subjt:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT

Query:  GVSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI----------------------------
         +SGITRSGR                                  D+E PI+AKD+EY K VT+EEA EFLKI                            
Subjt:  GVSGITRSGR----------------------------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI----------------------------

Query:  -WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGP
          I+GNITSSNSIVF+DDEIPPEGLGHTKALHIQ+KC++YVIARVLVDNGSALN+MPKSTLL LPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGP
Subjt:  -WIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGP

Query:  CTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSH
        C FNI FQVMEITP YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+KL+ + GEEDFLITKP+STPYVEA EEALECSF SFEIAHAT++     E+ + H
Subjt:  CTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSH

Query:  KSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALD-----------------------SYSTDH------LL
        KSK + M  +++  GGY LN+NLE LL IPSN G FGLGYKP    K+  +K++K+     L+                       SYS+D+      LL
Subjt:  KSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALD-----------------------SYSTDH------LL

Query:  MEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE
         ++ +LS+AAV  E   E   +YAC   FELNNWD VDLP FS+D Q+
Subjt:  MEVRNLSIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE

A0A5A7T1W2 Retrotrans_gag domain-containing protein0.0e+0047.41Show/hide
Query:  EGSWMPLKAMTYRCGDFYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMN
        +  WMPLKA+ YRCGDF+SVPLLGPWGG+NY PLLVLRQ+W KQFIP THNL                           K+K KGHYEGVT  YEAWQ N
Subjt:  EGSWMPLKAMTYRCGDFYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMN

Query:  RRKSITKFIPKEKNVKPDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG----------------------------
        RRK+I   I +E   +  E + +Q +QW E+   +E++NRLLEQENEKLRKET+QWM+HAT +Q EL+K K                             
Subjt:  RRKSITKFIPKEKNVKPDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKG----------------------------

Query:  -------------------------------------------------------------------------------RVDQTIEILRFVARRANGFAE
                                                                                       RVDQTIE LR V++RANGFAE
Subjt:  -------------------------------------------------------------------------------RVDQTIEILRFVARRANGFAE

Query:  WAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPG
        WA                    K SS     HPYNTRYK +IMEE+ KDM+KM+Q+INNL EQV+KILELLS GKGK  V+  Q+SNPIQ+ DDP YPPG
Subjt:  WAAELRANISPMQPNADDLDRNKNSSQHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPG

Query:  FTPQHVRQRMHFPQPQTPQQYVATNPLYTPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLT
        FTP H+                        VP ++ L        N M +            +++K+KL+VLEERLRAIE TDVYGNIDATQLCLVPGL 
Subjt:  FTPQHVRQRMHFPQPQTPQQYVATNPLYTPVPDMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLT

Query:  IPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSE
        IPAKFKVPEF+KYDGS+CPRSHLIMYCRKM  HINNDKLL+HCFQDSLT PASRWYIQLDN+HIHVWKDLADAFLKQYK NIDMAPDRLDLQRMEK++SE
Subjt:  IPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSE

Query:  SFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSW
        SFKEYAQRWRDMAAEVQPPLTDKEMT+MFM+TLR+PFYE MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGEV AI F NS 
Subjt:  SFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSW

Query:  KNKPKF--------------------------------------------------------------------ELLPQLIENRQLALIPMDP-------
        K+K  F                                                                    ELLPQLI+NRQLA IPM P       
Subjt:  KNKPKF--------------------------------------------------------------------ELLPQLIENRQLALIPMDP-------

Query:  -YSLLIQNDY---------------------------------------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEH
         Y    + DY                                                         L E CK +V+EI  PME +F  L +AGYV  E+
Subjt:  -YSLLIQNDY---------------------------------------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEH

Query:  LDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQ
        LDP +RYEG   +           R   E  +D+                                                                  
Subjt:  LDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQ

Query:  TLTSPSIDSITGVSGITRSGRDIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPE
                             D+E PIIAKD+EY K VT EEA EFLKI                              I+GNITSSNSIVF+DDEIPPE
Subjt:  TLTSPSIDSITGVSGITRSGRDIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPE

Query:  GLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRP
        GLGHTKALHIQ+KC++YVIARVLVDNGSALN+MPKSTLL LPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGPC FNI FQVMEITP YSFLLGRP
Subjt:  GLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRP

Query:  WIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNL
        WIHSAGVVPSTLHQKLKF+VG+KL+ + GEEDFLITKP+STPYVEA EEALECSF SFEIAHAT++     E+ + HKSK + M  +++  GGY LN+NL
Subjt:  WIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNL

Query:  ERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGII
        E LL IPSN G FGLGYKP   + I+L  +KK+K        E D P++               SYS+D+      LL ++ +LS+AAV  E   E   +
Subjt:  ERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGII

Query:  YACSPDFELNNWDVVDLPIFSQDLQE
        YAC PDFELNNWD VDLP FS+D Q+
Subjt:  YACSPDFELNNWDVVDLPIFSQDLQE

A0A5A7T5W6 RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein1.1e-29252.81Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA
        MEE+ KDM+KM+Q+INNL EQV+KILELLS GKGK VV+ AQ+SNPIQ+ DDP YPPGFTP H+    + PQ QT Q YV TNPLY     +P +E LEA
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIE TDVYGNIDATQLCLVPGL IPAKFKVPEFDKYDGS+CPRSHLIMY RKM  HINNDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        L+HCFQDSLT PASRWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAPD LDLQRMEK++SESFKEYAQRWRDMAAEVQPPLTDKEMT+MFM+TLR+PFYE
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGEV AI F NS K+K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS
                                           E+LPQLI+NRQLA IPM P                                      LI   +LS
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS

Query:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALT
                                      E CK +++EI  PME LFE L +AGYV  E+LDP +RYEG       +S+  +  R V            
Subjt:  ------------------------------EVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALT

Query:  GESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR------------------
           + H  +  + F  K           V  L + + +T+                      +T P +D+ITG+SGITRSGR                  
Subjt:  GESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR------------------

Query:  ----------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPEGLGHT
                        D+E PIIAKD+EY K VT+EEA EFLKI                              I+GNITSSNSIVF+DDEIPPEGLGHT
Subjt:  ----------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDDEIPPEGLGHT

Query:  KALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSA
        KALHIQ+KC++YVIARVLVDNGSALN+MPKSTLLKLPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGPC FNI FQVMEITP YSFLLGRPWIHSA
Subjt:  KALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSA

Query:  GVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLS
        GVVPSTLHQKLKF+VG+KL+ + GEEDFLITKP+STPYVEA EEALECSFRSFEIAHAT++     E+ + HKSK + M  +++  GGY LN+NLE LL 
Subjt:  GVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLS

Query:  IPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGIIYACSP
        IPSN G FGLGYKP   + I+L   KK+K        E D P++               SYS+D+      LL ++ +LS+AAV  E   E   +YAC P
Subjt:  IPSNVGTFGLGYKPP--NVIKLDFRKKRK--------EIDWPALD--------------SYSTDH------LLMEVRNLSIAAVVNELPHEDGIIYACSP

Query:  DFELNNWDVVDLPIFSQDLQE
        DFELNNWD VDLP FS+D  +
Subjt:  DFELNNWDVVDLPIFSQDLQE

A0A5A7V3W7 Retrotrans_gag domain-containing protein7.1e-28452.03Show/hide
Query:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA
        MEE+ KDM+KM+Q+INN+ EQV+KILELLS GKGK  V+  Q+SNPIQ+ DDP YPP FTP H    M+ PQ QT Q Y+ T+PLY     VP +E LEA
Subjt:  MEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTP---VPDMEQLEA

Query:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL
         A IQ           +M Q+E   AK+KL+VLEERLRAIEGTDVYGNIDATQLCLVPGL IPAKFKVPEFDKYDGS+CPRSHLIMYCRKM  HINNDKL
Subjt:  HAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKL

Query:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE
        L+HCFQDSLTGPASRWYIQLDN+HIHVWKDLADAFLKQYKHNIDMAP+RLDLQRMEK++SESFKEYAQRW+DM AEVQPPLTDKEMT+MFM+TLR+PFYE
Subjt:  LIHCFQDSLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYE

Query:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------
         MIGNA+T+FSDIIVIGERIEYGIKHGR+A  T E    KKGT SKKKEGE+ AI F NS K+K  F                                 
Subjt:  HMIGNAATSFSDIIVIGERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF---------------------------------

Query:  -----------------------------------ELLPQLIENRQLALIPMDP--------YSLLIQNDY-----------------------------
                                           ELLPQLI+NRQLA IPM P        Y    + DY                             
Subjt:  -----------------------------------ELLPQLIENRQLALIPMDP--------YSLLIQNDY-----------------------------

Query:  ----------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ
                                    L E CK +V+EI  PME LFE L +AGYV  E+LDP +RYEG                         SKVQQ
Subjt:  ----------------------------LSEVCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQ

Query:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT
        LMDSKIL + R     EM++ ++ AL  E S    E  +SF P+PLT+ Y E+HN S S  NP+ +TIQVP PFKFKD KAVPWRY+C  +T    D++T
Subjt:  LMDSKILMICR--RVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSIT

Query:  GVSG--ITRSGR--------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDD
          S   I   GR              D+E PIIAKD++Y K VT+EEA EFLKI                              I+GNITSSNSIVF DD
Subjt:  GVSG--ITRSGR--------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI-----------------------------WIVGNITSSNSIVFSDD

Query:  EIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSF
        EI PEGLGHTKALHIQVKC++YVIARVLVDNGS LN+MPKSTLLKLPV+MSHI+SS+MVV+AFDGSRREV+                             
Subjt:  EIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSF

Query:  LLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYR
                  GVVPS LHQKLKF+VG+KL+ + GE+DFLITKP+STPYVEA EEALECSFRSFEIAHAT++     E+ + HKSK + M  +++  GGY 
Subjt:  LLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKEMAAKVVVNGGYR

Query:  LNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALDSYSTDHLLMEVRNL------SIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPI
        LN+NLE LL  PSN G FGLGYKP    K+  ++++K+     L+    D  +  +  L      ++A V  E   E   +YAC P+FELNNWD VDLP 
Subjt:  LNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALDSYSTDHLLMEVRNL------SIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPI

Query:  FSQDLQE
        FS+D Q+
Subjt:  FSQDLQE

A0A5D3DEB3 Retrotrans_gag domain-containing protein1.7e-28551.31Show/hide
Query:  GKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTPVP---DMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLE
        GKGK +V+ AQ+SNPIQ+ DDP YPP F P H    M+ PQ QT Q YV TNPLY   P    +E LEA A IQ           +M Q+E   AK+KL+
Subjt:  GKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTPVP---DMEQLEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLE

Query:  VLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDL
        VLEERL+ IEGTDVYGNIDATQLCLVPGL IPAKFKVP+F+KYDGS+C RSHLIMYCRKM   INNDKLL+HCFQDSLTGPASRWYIQLDN+HIHVWKDL
Subjt:  VLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQDSLTGPASRWYIQLDNSHIHVWKDL

Query:  ADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAG
        ADAFLKQYKHNIDMAPDRLDLQRMEK++SESFKEYAQRWRDMAAEVQPPLTDKEMT+MFM+TLR+PFYE MI  A T                ++GRI  
Subjt:  ADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIGERIEYGIKHGRIAG

Query:  TTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF----------------------------------------------------------------
                KKGT SKKKEGEV  I F NS K+K  F                                                                
Subjt:  TTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKF----------------------------------------------------------------

Query:  ----ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS------------------------------E
            ELLPQLI+NRQLA IPM P                                      LI   +LS                              E
Subjt:  ----ELLPQLIENRQLALIPMDP-----------------------------------YSLLIQNDYLS------------------------------E

Query:  VCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQLMDSKILMICR--RVSEMREDEVSALTGESS
         CK +V+EI  PME LFE    AGYV  E+LDP +RYEG                         SKVQ+LMDSKIL + R     EM++ ++ AL  E S
Subjt:  VCKEKVYEITTPMEVLFETLSDAGYVKQEHLDPTVRYEG-------------------------SKVQQLMDSKILMICR--RVSEMREDEVSALTGESS

Query:  SHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR---------------------
            E  +SF P+PLT+ Y E+HN S S  NP+ +TIQVP PFKFKD KAVPWRY+CQ +T PS+D+ITG+SGITRSGR                     
Subjt:  SHEREVIESFPPKPLTIIYHENHNVS-SLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDSITGVSGITRSGR---------------------

Query:  -------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI------------------------------------------------------WIVGN
                     D+E PIIAKD+EY K VT+EEA EFLKI                                                       I+GN
Subjt:  -------------DIEEPIIAKDMEYNKPVTEEEAEEFLKI------------------------------------------------------WIVGN

Query:  ITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNIT
        ITSSNSIVF+DDEIPPEGLGHTKALHIQVKC++YVIARVLVDNGSALN+MPKSTLLKLPV+MSHI+SS+MVV+AFDGSRREV+G+IELP+KIGPC FNI 
Subjt:  ITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHIRSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNIT

Query:  FQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKE
        FQVMEITP YSFLLGRPWIHSAGVVPSTLHQKLKF+VG+KL+ + GEEDFLITK +STPYVEA EEALECSFRSFEIAHAT++     E+ + HKSK + 
Subjt:  FQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSFEIAHATIVGGF--ELTRSHKSKAKE

Query:  MAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRKE---------------------------IDWPALDSYSTDHLLMEVRNL
        M  +++  GGY LN+NLE LL IPSN G FGLGYKP   + I+L  +KK+K                            I + + +S   D LL ++ +L
Subjt:  MAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPP--NVIKLDFRKKRKE---------------------------IDWPALDSYSTDHLLMEVRNL

Query:  SIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE
        S+AAV  E   E   +YAC PDFELNNWD VDLP FS+D Q+
Subjt:  SIAAVVNELPHEDGIIYACSPDFELNNWDVVDLPIFSQDLQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGCTCATGGATGCCCTTAAAGGCCATGACATACAGATGTGGAGATTTCTATAGTGTGCCTTTGTTAGGGCCATGGGGAGGCATTAATTATGCACCTCTGTTAGT
TTTACGGCAAATGTGGTTCAAGCAGTTTATACCGGTCACTCACAATCTACAAGAATTAGAGTTTTCTTATGATCTCGAGGACTTCAACGAGAAAAAACGTCAGGCGGTCA
ATGCATGGAAATCTGTGAGGAAGGTGAAAAGTAAAGGGCATTACGAAGGGGTTACTGGTGAATATGAAGCTTGGCAAATGAATAGAAGGAAGAGCATAACAAAGTTCATC
CCAAAGGAGAAAAATGTGAAGCCAGATGAGCCAACACAAGATCAGTTGAGTCAATGGAGTGAAGAATGCAGGAGAATGGAAGAAAGAAATCGGTTATTGGAACAAGAGAA
TGAGAAGCTACGTAAAGAGACAACTCAATGGATGAATCATGCAACCCAAATACAAAAAGAACTTGATAAACTAAAGGGGCGTGTGGATCAAACCATTGAGATTCTTAGAT
TCGTTGCTAGAAGGGCTAATGGCTTTGCAGAATGGGCAGCTGAACTAAGAGCCAATATTTCTCCGATGCAACCTAACGCAGATGACTTAGATAGGAATAAAAATAGCTCA
CAACATTCCAATTCCCATCCCTACAATACAAGATATAAAAGACGAATCATGGAAGAGCAAGGAAAGGATATGGAGAAAATGAAACAAGATATCAATAATCTTAGTGAACA
AGTAGCGAAGATTCTAGAATTACTATCAACTGGCAAAGGGAAAGGGGTTGTAGAAGCAGCGCAGACAAGCAACCCAATCCAGGAAATCGATGATCCTGCTTATCCACCAG
GATTTACTCCTCAACATGTACGACAACGCATGCATTTTCCACAGCCACAAACCCCCCAACAATATGTTGCTACGAATCCACTCTACACTCCTGTTCCAGATATGGAACAA
TTGGAAGCTCATGCTGGAATTCAAATCAATAATATGACAATGTCTGGTCAACAATCGGAGATGAGACAACATGAGGGCGTTCAAGCTAAGGAAAAGTTGGAGGTTCTAGA
AGAAAGATTACGAGCAATTGAAGGGACAGATGTTTATGGAAATATTGATGCCACGCAATTGTGTTTAGTGCCAGGTTTGACGATTCCAGCAAAATTTAAAGTCCCAGAGT
TTGATAAGTATGATGGATCATCATGTCCTAGGAGTCATCTCATAATGTACTGCAGAAAAATGGTCGCACACATCAATAATGACAAGTTATTGATCCATTGCTTTCAAGAT
AGTTTAACTGGGCCAGCCTCTCGATGGTATATACAGTTAGACAATAGTCACATTCATGTTTGGAAGGACCTAGCGGACGCATTTCTAAAGCAGTATAAGCATAATATTGA
CATGGCGCCAGATCGTCTTGACTTGCAGAGGATGGAGAAAAGAACTTCAGAAAGTTTCAAGGAATACGCCCAGAGGTGGAGGGATATGGCTGCGGAGGTTCAACCACCGC
TAACAGATAAGGAAATGACGGCTATGTTTATGCACACTTTGCGATCTCCCTTTTATGAGCACATGATTGGTAACGCAGCAACTAGTTTCTCTGACATCATCGTTATTGGG
GAAAGAATTGAATATGGAATTAAGCATGGAAGGATAGCTGGCACTACAGCTGAATCGACTATGGGAAAGAAAGGAACAACATCAAAGAAGAAGGAAGGGGAGGTTCAAGC
AATTAACTTTACAAACTCTTGGAAGAATAAGCCAAAGTTTGAGCTTTTACCTCAGCTAATTGAGAATCGACAGTTGGCTCTCATCCCAATGGACCCGTACAGCCTCCTTA
TCCAAAATGATTATCTCTCTGAAGTGTGTAAAGAGAAGGTTTATGAGATTACAACGCCTATGGAGGTACTTTTCGAGACTCTTTCTGATGCGGGGTATGTCAAACAAGAG
CACTTGGACCCTACAGTAAGATATGAAGGATCCAAAGTACAACAACTTATGGATTCAAAGATACTCATGATATGTAGAAGAGTAAGCGAAATGAGGGAAGATGAAGTCAG
CGCATTAACAGGAGAATCTTCTTCCCACGAAAGGGAAGTAATTGAGTCCTTCCCTCCTAAGCCTTTGACTATCATATACCATGAGAATCATAATGTTTCATCTCTTCAGA
ATCCTCAACCTATCACAATTCAAGTGCCTGGTCCTTTCAAGTTTAAGGACACAAAAGCAGTACCATGGAGATATGAGTGTCAAACTCTTACAAGTCCATCAATTGACAGC
ATTACAGGAGTTAGTGGAATAACTCGGAGTGGGCGAGATATAGAAGAACCTATCATAGCTAAAGATATGGAATACAATAAGCCTGTCACTGAAGAGGAAGCAGAAGAGTT
CTTAAAGATATGGATTGTTGGAAACATCACATCTTCCAATTCCATTGTCTTTTCTGATGATGAAATTCCTCCTGAGGGTTTAGGTCACACAAAAGCACTACATATTCAAG
TAAAGTGCAGAAACTATGTCATAGCAAGAGTGCTAGTTGATAATGGATCGGCTCTCAATGTGATGCCTAAATCTACATTGTTAAAGCTTCCAGTAGAGATGTCACACATA
AGATCAAGCAGTATGGTTGTAAGAGCATTTGACGGATCGCGTAGAGAAGTTATTGGTAATATTGAATTGCCAATCAAAATTGGCCCGTGTACTTTCAACATAACTTTCCA
AGTCATGGAAATTACACCAGCGTATAGTTTTCTGCTAGGGCGACCTTGGATCCACTCTGCAGGAGTAGTACCATCCACGTTGCACCAAAAATTGAAATTTGTTGTTGGAA
ATAAACTGGTTAGTTTATCGGGAGAGGAGGATTTCTTGATAACCAAGCCCATCTCAACCCCATATGTGGAAGCAGCAGAGGAAGCATTAGAATGTTCTTTTCGTTCTTTT
GAAATTGCCCATGCCACTATAGTGGGAGGGTTTGAACTGACAAGATCTCACAAGTCTAAGGCAAAAGAAATGGCAGCTAAAGTAGTGGTAAATGGAGGATATCGTTTGAA
TCAAAACTTAGAAAGGTTGTTGAGTATACCAAGCAATGTTGGGACATTTGGGTTGGGCTATAAACCACCAAATGTGATAAAATTAGACTTCAGGAAGAAAAGAAAGGAAA
TCGATTGGCCAGCATTGGACTCTTACTCAACAGATCACTTATTAATGGAAGTACGAAACTTGTCAATCGCAGCAGTTGTTAATGAGCTACCACACGAAGATGGCATAATC
TACGCATGTTCACCTGATTTCGAGCTTAACAATTGGGATGTTGTAGATCTACCTATATTTTCACAAGATTTGCAAGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGCTCATGGATGCCCTTAAAGGCCATGACATACAGATGTGGAGATTTCTATAGTGTGCCTTTGTTAGGGCCATGGGGAGGCATTAATTATGCACCTCTGTTAGT
TTTACGGCAAATGTGGTTCAAGCAGTTTATACCGGTCACTCACAATCTACAAGAATTAGAGTTTTCTTATGATCTCGAGGACTTCAACGAGAAAAAACGTCAGGCGGTCA
ATGCATGGAAATCTGTGAGGAAGGTGAAAAGTAAAGGGCATTACGAAGGGGTTACTGGTGAATATGAAGCTTGGCAAATGAATAGAAGGAAGAGCATAACAAAGTTCATC
CCAAAGGAGAAAAATGTGAAGCCAGATGAGCCAACACAAGATCAGTTGAGTCAATGGAGTGAAGAATGCAGGAGAATGGAAGAAAGAAATCGGTTATTGGAACAAGAGAA
TGAGAAGCTACGTAAAGAGACAACTCAATGGATGAATCATGCAACCCAAATACAAAAAGAACTTGATAAACTAAAGGGGCGTGTGGATCAAACCATTGAGATTCTTAGAT
TCGTTGCTAGAAGGGCTAATGGCTTTGCAGAATGGGCAGCTGAACTAAGAGCCAATATTTCTCCGATGCAACCTAACGCAGATGACTTAGATAGGAATAAAAATAGCTCA
CAACATTCCAATTCCCATCCCTACAATACAAGATATAAAAGACGAATCATGGAAGAGCAAGGAAAGGATATGGAGAAAATGAAACAAGATATCAATAATCTTAGTGAACA
AGTAGCGAAGATTCTAGAATTACTATCAACTGGCAAAGGGAAAGGGGTTGTAGAAGCAGCGCAGACAAGCAACCCAATCCAGGAAATCGATGATCCTGCTTATCCACCAG
GATTTACTCCTCAACATGTACGACAACGCATGCATTTTCCACAGCCACAAACCCCCCAACAATATGTTGCTACGAATCCACTCTACACTCCTGTTCCAGATATGGAACAA
TTGGAAGCTCATGCTGGAATTCAAATCAATAATATGACAATGTCTGGTCAACAATCGGAGATGAGACAACATGAGGGCGTTCAAGCTAAGGAAAAGTTGGAGGTTCTAGA
AGAAAGATTACGAGCAATTGAAGGGACAGATGTTTATGGAAATATTGATGCCACGCAATTGTGTTTAGTGCCAGGTTTGACGATTCCAGCAAAATTTAAAGTCCCAGAGT
TTGATAAGTATGATGGATCATCATGTCCTAGGAGTCATCTCATAATGTACTGCAGAAAAATGGTCGCACACATCAATAATGACAAGTTATTGATCCATTGCTTTCAAGAT
AGTTTAACTGGGCCAGCCTCTCGATGGTATATACAGTTAGACAATAGTCACATTCATGTTTGGAAGGACCTAGCGGACGCATTTCTAAAGCAGTATAAGCATAATATTGA
CATGGCGCCAGATCGTCTTGACTTGCAGAGGATGGAGAAAAGAACTTCAGAAAGTTTCAAGGAATACGCCCAGAGGTGGAGGGATATGGCTGCGGAGGTTCAACCACCGC
TAACAGATAAGGAAATGACGGCTATGTTTATGCACACTTTGCGATCTCCCTTTTATGAGCACATGATTGGTAACGCAGCAACTAGTTTCTCTGACATCATCGTTATTGGG
GAAAGAATTGAATATGGAATTAAGCATGGAAGGATAGCTGGCACTACAGCTGAATCGACTATGGGAAAGAAAGGAACAACATCAAAGAAGAAGGAAGGGGAGGTTCAAGC
AATTAACTTTACAAACTCTTGGAAGAATAAGCCAAAGTTTGAGCTTTTACCTCAGCTAATTGAGAATCGACAGTTGGCTCTCATCCCAATGGACCCGTACAGCCTCCTTA
TCCAAAATGATTATCTCTCTGAAGTGTGTAAAGAGAAGGTTTATGAGATTACAACGCCTATGGAGGTACTTTTCGAGACTCTTTCTGATGCGGGGTATGTCAAACAAGAG
CACTTGGACCCTACAGTAAGATATGAAGGATCCAAAGTACAACAACTTATGGATTCAAAGATACTCATGATATGTAGAAGAGTAAGCGAAATGAGGGAAGATGAAGTCAG
CGCATTAACAGGAGAATCTTCTTCCCACGAAAGGGAAGTAATTGAGTCCTTCCCTCCTAAGCCTTTGACTATCATATACCATGAGAATCATAATGTTTCATCTCTTCAGA
ATCCTCAACCTATCACAATTCAAGTGCCTGGTCCTTTCAAGTTTAAGGACACAAAAGCAGTACCATGGAGATATGAGTGTCAAACTCTTACAAGTCCATCAATTGACAGC
ATTACAGGAGTTAGTGGAATAACTCGGAGTGGGCGAGATATAGAAGAACCTATCATAGCTAAAGATATGGAATACAATAAGCCTGTCACTGAAGAGGAAGCAGAAGAGTT
CTTAAAGATATGGATTGTTGGAAACATCACATCTTCCAATTCCATTGTCTTTTCTGATGATGAAATTCCTCCTGAGGGTTTAGGTCACACAAAAGCACTACATATTCAAG
TAAAGTGCAGAAACTATGTCATAGCAAGAGTGCTAGTTGATAATGGATCGGCTCTCAATGTGATGCCTAAATCTACATTGTTAAAGCTTCCAGTAGAGATGTCACACATA
AGATCAAGCAGTATGGTTGTAAGAGCATTTGACGGATCGCGTAGAGAAGTTATTGGTAATATTGAATTGCCAATCAAAATTGGCCCGTGTACTTTCAACATAACTTTCCA
AGTCATGGAAATTACACCAGCGTATAGTTTTCTGCTAGGGCGACCTTGGATCCACTCTGCAGGAGTAGTACCATCCACGTTGCACCAAAAATTGAAATTTGTTGTTGGAA
ATAAACTGGTTAGTTTATCGGGAGAGGAGGATTTCTTGATAACCAAGCCCATCTCAACCCCATATGTGGAAGCAGCAGAGGAAGCATTAGAATGTTCTTTTCGTTCTTTT
GAAATTGCCCATGCCACTATAGTGGGAGGGTTTGAACTGACAAGATCTCACAAGTCTAAGGCAAAAGAAATGGCAGCTAAAGTAGTGGTAAATGGAGGATATCGTTTGAA
TCAAAACTTAGAAAGGTTGTTGAGTATACCAAGCAATGTTGGGACATTTGGGTTGGGCTATAAACCACCAAATGTGATAAAATTAGACTTCAGGAAGAAAAGAAAGGAAA
TCGATTGGCCAGCATTGGACTCTTACTCAACAGATCACTTATTAATGGAAGTACGAAACTTGTCAATCGCAGCAGTTGTTAATGAGCTACCACACGAAGATGGCATAATC
TACGCATGTTCACCTGATTTCGAGCTTAACAATTGGGATGTTGTAGATCTACCTATATTTTCACAAGATTTGCAAGAGTAA
Protein sequenceShow/hide protein sequence
MEGSWMPLKAMTYRCGDFYSVPLLGPWGGINYAPLLVLRQMWFKQFIPVTHNLQELEFSYDLEDFNEKKRQAVNAWKSVRKVKSKGHYEGVTGEYEAWQMNRRKSITKFI
PKEKNVKPDEPTQDQLSQWSEECRRMEERNRLLEQENEKLRKETTQWMNHATQIQKELDKLKGRVDQTIEILRFVARRANGFAEWAAELRANISPMQPNADDLDRNKNSS
QHSNSHPYNTRYKRRIMEEQGKDMEKMKQDINNLSEQVAKILELLSTGKGKGVVEAAQTSNPIQEIDDPAYPPGFTPQHVRQRMHFPQPQTPQQYVATNPLYTPVPDMEQ
LEAHAGIQINNMTMSGQQSEMRQHEGVQAKEKLEVLEERLRAIEGTDVYGNIDATQLCLVPGLTIPAKFKVPEFDKYDGSSCPRSHLIMYCRKMVAHINNDKLLIHCFQD
SLTGPASRWYIQLDNSHIHVWKDLADAFLKQYKHNIDMAPDRLDLQRMEKRTSESFKEYAQRWRDMAAEVQPPLTDKEMTAMFMHTLRSPFYEHMIGNAATSFSDIIVIG
ERIEYGIKHGRIAGTTAESTMGKKGTTSKKKEGEVQAINFTNSWKNKPKFELLPQLIENRQLALIPMDPYSLLIQNDYLSEVCKEKVYEITTPMEVLFETLSDAGYVKQE
HLDPTVRYEGSKVQQLMDSKILMICRRVSEMREDEVSALTGESSSHEREVIESFPPKPLTIIYHENHNVSSLQNPQPITIQVPGPFKFKDTKAVPWRYECQTLTSPSIDS
ITGVSGITRSGRDIEEPIIAKDMEYNKPVTEEEAEEFLKIWIVGNITSSNSIVFSDDEIPPEGLGHTKALHIQVKCRNYVIARVLVDNGSALNVMPKSTLLKLPVEMSHI
RSSSMVVRAFDGSRREVIGNIELPIKIGPCTFNITFQVMEITPAYSFLLGRPWIHSAGVVPSTLHQKLKFVVGNKLVSLSGEEDFLITKPISTPYVEAAEEALECSFRSF
EIAHATIVGGFELTRSHKSKAKEMAAKVVVNGGYRLNQNLERLLSIPSNVGTFGLGYKPPNVIKLDFRKKRKEIDWPALDSYSTDHLLMEVRNLSIAAVVNELPHEDGII
YACSPDFELNNWDVVDLPIFSQDLQE