; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g13480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g13480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr7:10134119..10144846
RNA-Seq ExpressionMoc07g13480
SyntenyMoc07g13480
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.2e-12549.14Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCKPVD-LHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C+  +   +  +  E  F+  +L+APIPP+FKAP +  YDGS DP  YVEVFE  MDF AASDA+KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCKPVD-LHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LWYR+L   SI +Y QLR  F+  F +R   K + +HL  ++Q++ E+L EY+ RF +E +KV  C+D+ AM YF TGL D  LT++LG   PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKANGA------WRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA
         +A++ IDG EL +           R   GKD +   P  K + S S G        +++ RR E  P+  R   +++FTP    I+EI    E++ +E 
Subjt:  ARARQYIDGFELWKANGA------WRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA

Query:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG
        L    EKL   P +R K  YCRFH++HGH+TS  + LK Q+E+LI+ GY KK+VG   K     + ++EER++S  P R+ DRPA INTI GGPS GQSG
Subjt:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG

Query:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS
        +KRK LAR    EVC    + P  PI FD  D E V +PHNDALVIAPLIDHV V RVLVDGG S NILS  TY  LGW R  LK SPTPLVGF  ESV 
Subjt:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS

Query:  AEGCISLPVTIGEGGQQVTKVAEFV
         EG I LPVT+G+   QVT++AEFV
Subjt:  AEGCISLPVTIGEGGQQVTKVAEFV

XP_022149029.1 uncharacterized protein LOC111017548 [Momordica charantia]2.4e-13779.56Show/hide
Query:  MDFLAASDAMKCRAFQIALEGSARLWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFT
        MDFLAASDA+KCRAFQIALEGS RLWY+QLKPRSIDSY+QLR LFINQF ARQLLKL PSHL  VKQRDNESLTEYIAR MDEHVKVVSCTD+IAMMYFT
Subjt:  MDFLAASDAMKCRAFQIALEGSARLWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFT

Query:  TGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPL
        TGLNDRNLTIE G+RPPASLN+MLARARQYIDG ELWKA GA RS+ GKDRDQ+S PPKK+ SD   SSR+A D++S+G+ +E+  SDR GPKFDKFTPL
Subjt:  TGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPL

Query:  NASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKED
        NAS+AEIYA  E+TD++ALF A +KLHRP GKRDKRLYCRFHKDHGH++SRCFHLKEQV+DLIRRGYLKKYVG RE+ +PEGS REE+RE+S  P RKED
Subjt:  NASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKED

Query:  RPAFINTIHGGPSRGQSG
        RPA INTIHGGPS  +SG
Subjt:  RPAFINTIHGGPSRGQSG

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]7.5e-12345.03Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCKPVDLHHVAEQEEPSFSPAILDAPIPPR-FKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C+        ++E P     + ++P      +AP + SYDGS DP  YVEVFEG MDF AASDA+KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCKPVDLHHVAEQEEPSFSPAILDAPIPPR-FKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LW                                                     F ++ +KV   +D+ AM YF TGL D  LT++LG   PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKA------NGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA
         +A++ IDG EL +        G  R   GKD   +    K +   S+ S R      ++ RR    P+  R   +++FTP    I+EI    E++ +E 
Subjt:  ARARQYIDGFELWKA------NGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA

Query:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG
        L    EKL   P +R+K  YCRFH++H H+TS  + LK Q+EDLI+  Y KK+VG   K     + ++EER+ S  P R+ DRPA INTI GGPS GQSG
Subjt:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG

Query:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS
         KRK LAR    EVC    + P  PI FD  D E V +PHNDALVIAPLIDHV V RVLVD G S NI+S  TY  LGW R  LK S TPLVGF  ESV 
Subjt:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS

Query:  AEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCAAVT-----DEP
         EGCI LPVT+G    QVT++AEFVVID  SAYNAI GRP+IH  RA+PST HQVLKY TP G+  +RGEQ  SRECYA+A+KG+  CA  T        
Subjt:  AEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCAAVT-----DEP

Query:  EPYADKSEPSRGTPAEKLELVPLL
        E  A+        P E+LELVPLL
Subjt:  EPYADKSEPSRGTPAEKLELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.6e-13949.23Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-KPVDLHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D L+ +F  ++E LK +C K        +  E SFS  IL+A IPP+FK P +  YDGS DP  YVEVFE  MDF AA+DA+KC AFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQC-KPVDLHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LWYR+L  R I +Y QLR  FI+QF +R   + +P+HL  ++Q++ E+L EY+ RF +E +KV  C+D+ AM YF TGL D  LT++L    PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKA-NGAWRSNCGKDR---DQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALF
         + ++ IDG EL +   G    N  + R   D+     K +      SS R D  +S    N+  P       ++ +TP    I EI    E+T +E L 
Subjt:  ARARQYIDGFELWKA-NGAWRSNCGKDR---DQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALF

Query:  AALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQK
           EKL   P KR+   YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK+VG+      E   ++EER++   P R++DRPA IN             K
Subjt:  AALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQK

Query:  RKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAE
        +K LARE   EVC    + P   I F+  D EGV +PHNDALVIAPLID V V R+LVDGGAS NILS STY  LGW R  LK SPTPLVGF  ES+S E
Subjt:  RKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAE

Query:  GCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCA
        GCI LPV+I +   QVT++AEFVVID  SAYNAI GRP+IH  RAVPST HQVLKY T  G+ T+RGE KTSRECYA+  K +  CA
Subjt:  GCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCA

XP_022158844.1 uncharacterized protein LOC111025310 [Momordica charantia]3.3e-16380.32Show/hide
Query:  MDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGR
        MDEHVKVVSCTD+IAMMYFTTGLNDRNLTIE  +RPPASLNEM ARARQYIDG ELWKANGA RS+ G+DRD KSPP KK+C D   SSRRADD++S+ R
Subjt:  MDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGR

Query:  RNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEP
        R+E+  S+RRGPKFDKFTPLNASIAEIYA  EDTD+E LFA+ EKL RP GKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLIR GYLKKYVG RE+ E 
Subjt:  RNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEP

Query:  EGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDG
        EGSAREE+RE+S  PR KEDRPA INTIHGGPS  +SGQKRKALAREVAHEVCTSY K PV+PILFD+QD E V MPHNDALVIAPLIDHVKV RV VDG
Subjt:  EGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDG

Query:  GASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAY
        GAS NI SFSTYT LGWER+HLK   T LVGF  ESVS EGCISLPVTI EG  QVT+VAEFVVIDRSSAY
Subjt:  GASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAY

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.0e-12549.14Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCKPVD-LHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D LR Q   ++E LK +C+  +   +  +  E  F+  +L+APIPP+FKAP +  YDGS DP  YVEVFE  MDF AASDA+KCRAF+IAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCKPVD-LHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LWYR+L   SI +Y QLR  F+  F +R   K + +HL  ++Q++ E+L EY+ RF +E +KV  C+D+ AM YF TGL D  LT++LG   PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKANGA------WRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA
         +A++ IDG EL +           R   GKD +   P  K + S S G        +++ RR E  P+  R   +++FTP    I+EI    E++ +E 
Subjt:  ARARQYIDGFELWKANGA------WRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA

Query:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG
        L    EKL   P +R K  YCRFH++HGH+TS  + LK Q+E+LI+ GY KK+VG   K     + ++EER++S  P R+ DRPA INTI GGPS GQSG
Subjt:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG

Query:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS
        +KRK LAR    EVC    + P  PI FD  D E V +PHNDALVIAPLIDHV V RVLVDGG S NILS  TY  LGW R  LK SPTPLVGF  ESV 
Subjt:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS

Query:  AEGCISLPVTIGEGGQQVTKVAEFV
         EG I LPVT+G+   QVT++AEFV
Subjt:  AEGCISLPVTIGEGGQQVTKVAEFV

A0A6J1D5T3 uncharacterized protein LOC1110175481.2e-13779.56Show/hide
Query:  MDFLAASDAMKCRAFQIALEGSARLWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFT
        MDFLAASDA+KCRAFQIALEGS RLWY+QLKPRSIDSY+QLR LFINQF ARQLLKL PSHL  VKQRDNESLTEYIAR MDEHVKVVSCTD+IAMMYFT
Subjt:  MDFLAASDAMKCRAFQIALEGSARLWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFT

Query:  TGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPL
        TGLNDRNLTIE G+RPPASLN+MLARARQYIDG ELWKA GA RS+ GKDRDQ+S PPKK+ SD   SSR+A D++S+G+ +E+  SDR GPKFDKFTPL
Subjt:  TGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPL

Query:  NASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKED
        NAS+AEIYA  E+TD++ALF A +KLHRP GKRDKRLYCRFHKDHGH++SRCFHLKEQV+DLIRRGYLKKYVG RE+ +PEGS REE+RE+S  P RKED
Subjt:  NASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKED

Query:  RPAFINTIHGGPSRGQSG
        RPA INTIHGGPS  +SG
Subjt:  RPAFINTIHGGPSRGQSG

A0A6J1D9E1 uncharacterized protein LOC1110188233.6e-12345.03Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQCKPVDLHHVAEQEEPSFSPAILDAPIPPR-FKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D LR +   ++E LK +C+        ++E P     + ++P      +AP + SYDGS DP  YVEVFEG MDF AASDA+KCRAFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQCKPVDLHHVAEQEEPSFSPAILDAPIPPR-FKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LW                                                     F ++ +KV   +D+ AM YF TGL D  LT++LG   PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKA------NGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA
         +A++ IDG EL +        G  R   GKD   +    K +   S+ S R      ++ RR    P+  R   +++FTP    I+EI    E++ +E 
Subjt:  ARARQYIDGFELWKA------NGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEA

Query:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG
        L    EKL   P +R+K  YCRFH++H H+TS  + LK Q+EDLI+  Y KK+VG   K     + ++EER+ S  P R+ DRPA INTI GGPS GQSG
Subjt:  LFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSG

Query:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS
         KRK LAR    EVC    + P  PI FD  D E V +PHNDALVIAPLIDHV V RVLVD G S NI+S  TY  LGW R  LK S TPLVGF  ESV 
Subjt:  QKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVS

Query:  AEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCAAVT-----DEP
         EGCI LPVT+G    QVT++AEFVVID  SAYNAI GRP+IH  RA+PST HQVLKY TP G+  +RGEQ  SRECYA+A+KG+  CA  T        
Subjt:  AEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCAAVT-----DEP

Query:  EPYADKSEPSRGTPAEKLELVPLL
        E  A+        P E+LELVPLL
Subjt:  EPYADKSEPSRGTPAEKLELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204791.2e-13949.23Show/hide
Query:  RSEVDLLRDQFQREIEDLKRQC-KPVDLHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR
        R E D L+ +F  ++E LK +C K        +  E SFS  IL+A IPP+FK P +  YDGS DP  YVEVFE  MDF AA+DA+KC AFQIAL GSAR
Subjt:  RSEVDLLRDQFQREIEDLKRQC-KPVDLHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSAR

Query:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML
        LWYR+L  R I +Y QLR  FI+QF +R   + +P+HL  ++Q++ E+L EY+ RF +E +KV  C+D+ AM YF TGL D  LT++L    PA+  E+L
Subjt:  LWYRQLKPRSIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEML

Query:  ARARQYIDGFELWKA-NGAWRSNCGKDR---DQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALF
         + ++ IDG EL +   G    N  + R   D+     K +      SS R D  +S    N+  P       ++ +TP    I EI    E+T +E L 
Subjt:  ARARQYIDGFELWKA-NGAWRSNCGKDR---DQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALF

Query:  AALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQK
           EKL   P KR+   YCRFH+DHGH+TS  + LK Q+EDLI+ GY KK+VG+      E   ++EER++   P R++DRPA IN             K
Subjt:  AALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQK

Query:  RKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAE
        +K LARE   EVC    + P   I F+  D EGV +PHNDALVIAPLID V V R+LVDGGAS NILS STY  LGW R  LK SPTPLVGF  ES+S E
Subjt:  RKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAE

Query:  GCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCA
        GCI LPV+I +   QVT++AEFVVID  SAYNAI GRP+IH  RAVPST HQVLKY T  G+ T+RGE KTSRECYA+  K +  CA
Subjt:  GCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNAIIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCA

A0A6J1E0L8 uncharacterized protein LOC1110253101.6e-16380.32Show/hide
Query:  MDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGR
        MDEHVKVVSCTD+IAMMYFTTGLNDRNLTIE  +RPPASLNEM ARARQYIDG ELWKANGA RS+ G+DRD KSPP KK+C D   SSRRADD++S+ R
Subjt:  MDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWKANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGR

Query:  RNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEP
        R+E+  S+RRGPKFDKFTPLNASIAEIYA  EDTD+E LFA+ EKL RP GKR+KRLYCRFHKDHGHDTSRCFHLKEQVEDLIR GYLKKYVG RE+ E 
Subjt:  RNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHKDHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEP

Query:  EGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDG
        EGSAREE+RE+S  PR KEDRPA INTIHGGPS  +SGQKRKALAREVAHEVCTSY K PV+PILFD+QD E V MPHNDALVIAPLIDHVKV RV VDG
Subjt:  EGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQKRKALAREVAHEVCTSYLKEPVIPILFDDQDSEGVRMPHNDALVIAPLIDHVKVIRVLVDG

Query:  GASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAY
        GAS NI SFSTYT LGWER+HLK   T LVGF  ESVS EGCISLPVTI EG  QVT+VAEFVVIDRSSAY
Subjt:  GASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAY

SwissProt top hitse value%identityAlignment
Q5RBK0 Gypsy retrotransposon integrase-like protein 12.0e-0933.04Show/hide
Query:  PSHARKLRRQAAHYLIQESKLFKRGYSLPLLRW--LDPEEARYVMREIHEGICGSHEGAQSMAAKVVRQGYYWPTIERDTREFARACDKCQHFAN-FLRQ
        PS  R +RR A  ++ +E KLF  G      R   +  EE + V+RE HE   G+H G  S    +V   YYW ++  D +++  AC  CQ   N  +  
Subjt:  PSHARKLRRQAAHYLIQESKLFKRGYSLPLLRW--LDPEEARYVMREIHEGICGSHEGAQSMAAKVVRQGYYWPTIERDTREFARACDKCQHFAN-FLRQ

Query:  PPEMLTPISSPW
        P + L  + +PW
Subjt:  PPEMLTPISSPW

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGACCACCAAGAGATCTTCCTAGCTATTCTCAAGGCTTGGAATAGAGTGAAATGGGCAGCGACGGTGCAGCGGGCGGCTACAGCGGGAGGAGATGGTGCA
GCGGGCGATGGCGACGAGAGGAGATGGTACAGTGAGCGGCGGTGGCGGCGGGAGGAGATAGTGCAGTGGGCGATAGCGGCGGCGAGAGGAGATCGTGAGAGCTTC
AAGCTTGGTTGGGTGACAAAAATGGAGGAGAGGGTCTCGAGAGTGTTCGGAGGGGGTGTTGGGCAGTTGGCTGTAGTTCGTTCTAACCGTGGCTGGGACTCCCGT
CATCGAATGACGACCCCAGGCGGGGGTGGTCAAGGAAAATGGAGGTCAGCCGGCCACATCCGATCCAGTAGCAGTCCGGTACTTCCACCTCGCCTCAGATCAGTT
TCCGCCACTATAGCCTCAAGGGAACGGGTTGCCGCCCCGTGCACTGACTGGGTGTTTATGCTGAACGCACGCACTAAGCGTGGTGCGCGCTCCGGGGCAAGTGCC
GACGCAGGCGTGGACCCCGTCATGGTAGCCAACGTGATCGTCGAGCTTACGGAAGTCAAGGCCAGGCTCGAAGCAGTTGAGAGAGGCAACAAGGTGTCCGGCTCT
TCTGTCTCCAGGTATCCCAATCGAGGGAAGGGGCTGATGCATCCGACCCAGAGAATGGAGTATCAGTTCCGACCTCGCAGGGAGGCCCGAGCTGGAGCACCCTTG
CGAAGGCCACAACCGGTGGGGACAGGCAATGCCCTGGGAGCACAGGTATGTGACCACCCTCAGCAGGACAACCGGGTCGAGGACCGACGCCCGAGGGTCCGACCA
ATTCGGACTCCCCTGGCATCTTTCGATAGCTACAACGCTCATCAGGGCCGAGAGGTCGACGATCGAGAGAGGTCCTCCCACGGTGACCATTTGTTTCGGTCTGAA
GTGGACCTCCTCCGGGATCAGTTTCAAAGGGAGATAGAAGATCTCAAGCGGCAGTGCAAGCCTGTAGATCTGCATCACGTGGCCGAGCAAGAGGAACCGTCTTTC
TCCCCAGCGATCCTAGACGCACCTATCCCACCGAGGTTCAAGGCCCCGGTCATAAGTTCCTACGACGGGTCTGGAGATCCGATCTCCTACGTGGAGGTGTTCGAG
GGGAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAGGGCTCGGCAAGATTGTGGTACCGACAGTTGAAGCCCCGA
TCCATCGACAGTTATCGGCAGTTGAGAGGGTTGTTCATCAACCAATTCTTAGCTCGGCAGTTGTTGAAGTTGTCGCCCTCTCACCTCAGAATAGTGAAGCAACGG
GACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGAGATTGCCATGATGTACTTCACTACGGGTTTG
AATGACAGGAACTTGACGATAGAGTTAGGAAACCGACCGCCGGCCTCCCTAAACGAGATGCTCGCCAGAGCTCGCCAGTACATCGACGGCTTTGAGTTGTGGAAA
GCCAATGGAGCCTGGCGAAGCAATTGTGGCAAAGATCGGGACCAAAAGTCCCCTCCTCCCAAGAAGCAATGCAGCGACAGTTGGGGATCGTCTCGGCGGGCCGAC
GACAATCAGAGTAAAGGTCGTCGCAATGAGAAAGCCCCTTCAGACCGTCGAGGGCCAAAGTTCGACAAGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTAC
GCAGCGGCCGAAGATACCGACCTGGAGGCGCTTTTCGCAGCCCTAGAAAAGCTCCACCGACCTCCAGGGAAACGAGACAAGCGACTCTACTGCCGATTCCACAAG
GATCATGGCCACGACACTTCACGCTGTTTCCACTTGAAGGAGCAGGTCGAAGATCTGATCCGGAGGGGTTATTTGAAAAAATACGTCGGCAGGCGTGAAAAGATA
GAGCCAGAGGGGTCGGCTCGAGAGGAAGAGCGAGAAAAGTCGCCACTGCCGAGACGGAAGGAAGATCGTCCCGCCTTTATAAATACCATCCATGGGGGCCCGAGT
AGGGGACAGTCAGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGTAGCACACGAGGTTTGTACCTCGTACCTCAAGGAGCCTGTGATACCGATCCTGTTCGACGAC
CAAGACAGCGAAGGAGTGCGCATGCCTCATAATGACGCCCTAGTAATCGCCCCACTCATAGATCACGTGAAGGTGATAAGAGTTCTTGTCGACGGTGGAGCGTCG
GTCAATATTTTATCATTCTCGACCTACACGACCCTAGGGTGGGAGAGGAAACACCTAAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGTAGAGGAGTCAGTCAGC
GCGGAAGGATGCATCTCGCTCCCCGTTACCATCGGCGAGGGAGGTCAACAAGTAACTAAGGTTGCGGAGTTTGTTGTGATTGATCGGAGCTCTGCGTACAACGCC
ATAATTGGTCGGCCATTGATTCATGATCTCCGGGCAGTTCCGTCCACTTACCACCAGGTCTTGAAGTACCCCACGCCGGCCGGAATTGCGACAATCCGAGGTGAG
CAAAAAACGTCCAGAGAATGCTACGCTGCCGCGATGAAGGGAACAGTCACTTGTGCAGCGGTCACGGACGAGCCAGAGCCATATGCCGACAAATCAGAGCCGAGC
CGCGGTACCCCAGCTGAAAAGCTAGAACTTGTCCCCCTGCTGGGGCCAGAAAAGCAGGTGCCGAGGTCCGAAAACTCCAATGCCAACGCATTGGCTCGCCTAGCC
TCGGCATATGAGACCGACCTACCAAGAACGGTTCCAGTTGAAATACTCGTCGAGTCGTCCATCGACCAGCCTGAGGTAATGGAGATCCAGTCAGCTCAGCCTACA
TGGATGGACCCGATTAAGGAGTTCCTGATCAGTGGCTCCGTCCCTGCTGATCCGAGCCATGCCAGAAAACTCCGACGCCAAGCTGCTCACTACTTGATACAGGAA
AGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGCTGCGGTGGCTCGACCCAGAGGAGGCGAGGTATGTCATGCGCGAGATTCACGAAGGGATTTGCGGAAGC
CACGAAGGAGCTCAGTCCATGGCCGCCAAGGTCGTTCGCCAGGGCTACTATTGGCCGACGATTGAACGCGACACCAGGGAGTTTGCTAGAGCATGCGACAAATGC
CAACATTTCGCGAACTTTTTGAGACAACCTCCAGAGATGCTGACACCGATCTCAAGCCCTTGGCCTTTCGCTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGACCACCAAGAGATCTTCCTAGCTATTCTCAAGGCTTGGAATAGAGTGAAATGGGCAGCGACGGTGCAGCGGGCGGCTACAGCGGGAGGAGATGGTGCA
GCGGGCGATGGCGACGAGAGGAGATGGTACAGTGAGCGGCGGTGGCGGCGGGAGGAGATAGTGCAGTGGGCGATAGCGGCGGCGAGAGGAGATCGTGAGAGCTTC
AAGCTTGGTTGGGTGACAAAAATGGAGGAGAGGGTCTCGAGAGTGTTCGGAGGGGGTGTTGGGCAGTTGGCTGTAGTTCGTTCTAACCGTGGCTGGGACTCCCGT
CATCGAATGACGACCCCAGGCGGGGGTGGTCAAGGAAAATGGAGGTCAGCCGGCCACATCCGATCCAGTAGCAGTCCGGTACTTCCACCTCGCCTCAGATCAGTT
TCCGCCACTATAGCCTCAAGGGAACGGGTTGCCGCCCCGTGCACTGACTGGGTGTTTATGCTGAACGCACGCACTAAGCGTGGTGCGCGCTCCGGGGCAAGTGCC
GACGCAGGCGTGGACCCCGTCATGGTAGCCAACGTGATCGTCGAGCTTACGGAAGTCAAGGCCAGGCTCGAAGCAGTTGAGAGAGGCAACAAGGTGTCCGGCTCT
TCTGTCTCCAGGTATCCCAATCGAGGGAAGGGGCTGATGCATCCGACCCAGAGAATGGAGTATCAGTTCCGACCTCGCAGGGAGGCCCGAGCTGGAGCACCCTTG
CGAAGGCCACAACCGGTGGGGACAGGCAATGCCCTGGGAGCACAGGTATGTGACCACCCTCAGCAGGACAACCGGGTCGAGGACCGACGCCCGAGGGTCCGACCA
ATTCGGACTCCCCTGGCATCTTTCGATAGCTACAACGCTCATCAGGGCCGAGAGGTCGACGATCGAGAGAGGTCCTCCCACGGTGACCATTTGTTTCGGTCTGAA
GTGGACCTCCTCCGGGATCAGTTTCAAAGGGAGATAGAAGATCTCAAGCGGCAGTGCAAGCCTGTAGATCTGCATCACGTGGCCGAGCAAGAGGAACCGTCTTTC
TCCCCAGCGATCCTAGACGCACCTATCCCACCGAGGTTCAAGGCCCCGGTCATAAGTTCCTACGACGGGTCTGGAGATCCGATCTCCTACGTGGAGGTGTTCGAG
GGGAAGATGGATTTCCTGGCCGCGAGCGACGCCATGAAGTGCCGAGCATTTCAAATAGCCTTGGAGGGCTCGGCAAGATTGTGGTACCGACAGTTGAAGCCCCGA
TCCATCGACAGTTATCGGCAGTTGAGAGGGTTGTTCATCAACCAATTCTTAGCTCGGCAGTTGTTGAAGTTGTCGCCCTCTCACCTCAGAATAGTGAAGCAACGG
GACAATGAGTCCCTGACGGAGTACATCGCTCGGTTCATGGACGAGCATGTCAAAGTGGTAAGTTGCACCGATGAGATTGCCATGATGTACTTCACTACGGGTTTG
AATGACAGGAACTTGACGATAGAGTTAGGAAACCGACCGCCGGCCTCCCTAAACGAGATGCTCGCCAGAGCTCGCCAGTACATCGACGGCTTTGAGTTGTGGAAA
GCCAATGGAGCCTGGCGAAGCAATTGTGGCAAAGATCGGGACCAAAAGTCCCCTCCTCCCAAGAAGCAATGCAGCGACAGTTGGGGATCGTCTCGGCGGGCCGAC
GACAATCAGAGTAAAGGTCGTCGCAATGAGAAAGCCCCTTCAGACCGTCGAGGGCCAAAGTTCGACAAGTTCACTCCGCTGAACGCCTCAATCGCGGAGATCTAC
GCAGCGGCCGAAGATACCGACCTGGAGGCGCTTTTCGCAGCCCTAGAAAAGCTCCACCGACCTCCAGGGAAACGAGACAAGCGACTCTACTGCCGATTCCACAAG
GATCATGGCCACGACACTTCACGCTGTTTCCACTTGAAGGAGCAGGTCGAAGATCTGATCCGGAGGGGTTATTTGAAAAAATACGTCGGCAGGCGTGAAAAGATA
GAGCCAGAGGGGTCGGCTCGAGAGGAAGAGCGAGAAAAGTCGCCACTGCCGAGACGGAAGGAAGATCGTCCCGCCTTTATAAATACCATCCATGGGGGCCCGAGT
AGGGGACAGTCAGGGCAGAAGAGAAAAGCTCTGGCTCGGGAGGTAGCACACGAGGTTTGTACCTCGTACCTCAAGGAGCCTGTGATACCGATCCTGTTCGACGAC
CAAGACAGCGAAGGAGTGCGCATGCCTCATAATGACGCCCTAGTAATCGCCCCACTCATAGATCACGTGAAGGTGATAAGAGTTCTTGTCGACGGTGGAGCGTCG
GTCAATATTTTATCATTCTCGACCTACACGACCCTAGGGTGGGAGAGGAAACACCTAAAGCTCAGCCCGACGCCTTTGGTCGGTTTTGTAGAGGAGTCAGTCAGC
GCGGAAGGATGCATCTCGCTCCCCGTTACCATCGGCGAGGGAGGTCAACAAGTAACTAAGGTTGCGGAGTTTGTTGTGATTGATCGGAGCTCTGCGTACAACGCC
ATAATTGGTCGGCCATTGATTCATGATCTCCGGGCAGTTCCGTCCACTTACCACCAGGTCTTGAAGTACCCCACGCCGGCCGGAATTGCGACAATCCGAGGTGAG
CAAAAAACGTCCAGAGAATGCTACGCTGCCGCGATGAAGGGAACAGTCACTTGTGCAGCGGTCACGGACGAGCCAGAGCCATATGCCGACAAATCAGAGCCGAGC
CGCGGTACCCCAGCTGAAAAGCTAGAACTTGTCCCCCTGCTGGGGCCAGAAAAGCAGGTGCCGAGGTCCGAAAACTCCAATGCCAACGCATTGGCTCGCCTAGCC
TCGGCATATGAGACCGACCTACCAAGAACGGTTCCAGTTGAAATACTCGTCGAGTCGTCCATCGACCAGCCTGAGGTAATGGAGATCCAGTCAGCTCAGCCTACA
TGGATGGACCCGATTAAGGAGTTCCTGATCAGTGGCTCCGTCCCTGCTGATCCGAGCCATGCCAGAAAACTCCGACGCCAAGCTGCTCACTACTTGATACAGGAA
AGCAAGCTCTTCAAGAGGGGATATTCCCTACCATTGCTGCGGTGGCTCGACCCAGAGGAGGCGAGGTATGTCATGCGCGAGATTCACGAAGGGATTTGCGGAAGC
CACGAAGGAGCTCAGTCCATGGCCGCCAAGGTCGTTCGCCAGGGCTACTATTGGCCGACGATTGAACGCGACACCAGGGAGTTTGCTAGAGCATGCGACAAATGC
CAACATTTCGCGAACTTTTTGAGACAACCTCCAGAGATGCTGACACCGATCTCAAGCCCTTGGCCTTTCGCTCAGTAG
Protein sequenceShow/hide protein sequence
MKDHQEIFLAILKAWNRVKWAATVQRAATAGGDGAAGDGDERRWYSERRWRREEIVQWAIAAARGDRESFKLGWVTKMEERVSRVFGGGVGQLAVVRSNRGWDSR
HRMTTPGGGGQGKWRSAGHIRSSSSPVLPPRLRSVSATIASRERVAAPCTDWVFMLNARTKRGARSGASADAGVDPVMVANVIVELTEVKARLEAVERGNKVSGS
SVSRYPNRGKGLMHPTQRMEYQFRPRREARAGAPLRRPQPVGTGNALGAQVCDHPQQDNRVEDRRPRVRPIRTPLASFDSYNAHQGREVDDRERSSHGDHLFRSE
VDLLRDQFQREIEDLKRQCKPVDLHHVAEQEEPSFSPAILDAPIPPRFKAPVISSYDGSGDPISYVEVFEGKMDFLAASDAMKCRAFQIALEGSARLWYRQLKPR
SIDSYRQLRGLFINQFLARQLLKLSPSHLRIVKQRDNESLTEYIARFMDEHVKVVSCTDEIAMMYFTTGLNDRNLTIELGNRPPASLNEMLARARQYIDGFELWK
ANGAWRSNCGKDRDQKSPPPKKQCSDSWGSSRRADDNQSKGRRNEKAPSDRRGPKFDKFTPLNASIAEIYAAAEDTDLEALFAALEKLHRPPGKRDKRLYCRFHK
DHGHDTSRCFHLKEQVEDLIRRGYLKKYVGRREKIEPEGSAREEEREKSPLPRRKEDRPAFINTIHGGPSRGQSGQKRKALAREVAHEVCTSYLKEPVIPILFDD
QDSEGVRMPHNDALVIAPLIDHVKVIRVLVDGGASVNILSFSTYTTLGWERKHLKLSPTPLVGFVEESVSAEGCISLPVTIGEGGQQVTKVAEFVVIDRSSAYNA
IIGRPLIHDLRAVPSTYHQVLKYPTPAGIATIRGEQKTSRECYAAAMKGTVTCAAVTDEPEPYADKSEPSRGTPAEKLELVPLLGPEKQVPRSENSNANALARLA
SAYETDLPRTVPVEILVESSIDQPEVMEIQSAQPTWMDPIKEFLISGSVPADPSHARKLRRQAAHYLIQESKLFKRGYSLPLLRWLDPEEARYVMREIHEGICGS
HEGAQSMAAKVVRQGYYWPTIERDTREFARACDKCQHFANFLRQPPEMLTPISSPWPFAQ