; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g26860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g26860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr10:19749161..19756360
RNA-Seq ExpressionMoc10g26860
SyntenyMoc10g26860
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]1.2e-19369.62Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPF +D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWY+RLPA SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG S SSGR EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPS
        IEESGMEKLLKRPEKLRG PE+R+KDKYCRFHR++GHNT+  WELKRQI++LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFS
        GGQSG KRKELAR ARR                                                                        KKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFS

Query:  EESVSLEGCINLLVTIGQDATQVTQMVEFV
         ESV  EG I+L VT+GQD TQVTQM EFV
Subjt:  EESVSLEGCINLLVTIGQDATQVTQMVEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]9.9e-19161.1Show/hide
Query:  NSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +S+ +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPF +D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFG
        LTNIEESGMEKLLKRPEKLRG PE+RNKDKYCRFHR++ HNT+  WELKRQI+DLIQD YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFG

Query:  GPSGGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLV
        GPSGGQSG+KRKELAR ARR                                                                        KKS TPLV
Subjt:  GPSGGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLV

Query:  GFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEE
        GFS ESV  EGCI+L VT+G D TQVTQM EFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCALE 
Subjt:  GFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEE

Query:  QINHDKQQESGTDLPKEGKRQFSPPTEELELVPLL
         ++ D   E   +LP   +R+F+ PTEELELVPLL
Subjt:  QINHDKQQESGTDLPKEGKRQFSPPTEELELVPLL

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]1.3e-22772.26Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWY+RLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDPEKRN DKYCRFHRD+GHNT++ WELKRQI+DLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DRPAVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFSE
            NK+KELAREARR                                                                        KKSPTPLVGFS 
Subjt:  GQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFSE

Query:  ESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINH
        ES+SLEGCI+L V+I QD TQVTQM EFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCALEEQ   
Subjt:  ESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINH

Query:  DK
        D+
Subjt:  DK

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]6.3e-22264.56Show/hide
Query:  PGAPGEKGVPSIQPGYREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQPG REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPGYREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRK
        ARCEKK+  FDD DLGESPF +DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKT THLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNG
        PEKQIDQK+LSQ++R+ D KSKDKG SSS  RTEYRRSE G  RSRPYER                                                  
Subjt:  PEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNG

Query:  HNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSGGQSGNKRKELAREARR-------------------
             CWELKRQI+DLIQD YFKKFVGKPRSNSVEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  NKRKELA EARR                   
Subjt:  HNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSGGQSGNKRKELAREARR-------------------

Query:  -----------------------------------------------------KKSPTPLVGFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKS
                                                             KKSPTPLVGFS ESVS EGCI+L VTIGQD+TQVTQM EFVV+DG+ 
Subjt:  -----------------------------------------------------KKSPTPLVGFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        AYNAIF RPIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASALK SSVCALEEQ + D       DLP+E K           L P L+
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]6.3e-22282.05Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWY+RLPARSISTYSQLRKEFISQFSS HYDRKT THLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRGD EKRNK+KYCRFHRD+GHNTTSCWELKRQI+DLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREAR------RKKSPTPLVGFSE---ESVSLE-----------------------GCINLLVTIGQDATQVTQMVEFVVVD
        TIFGGP+GGQSGNKRKELAREAR      R+  PT  + F +   E V L                        GCI+L VTIGQDATQVTQM EFVV+D
Subjt:  TIFGGPSGGQSGNKRKELAREAR------RKKSPTPLVGFSE---ESVSLE-----------------------GCINLLVTIGQDATQVTQMVEFVVVD

Query:  GKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        G+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VCALEEQ N  K QES  DLPKEGKRQF PPTEELELVPLLS
Subjt:  GKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS

Query:  PEKQVGP
        PE+Q  P
Subjt:  PEKQVGP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088136.0e-19469.62Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +DGDLGESPF +D+LEAPIPPKFK P +KPYDGSKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWY+RLPA SISTYSQLR+EF++ FSSRHYD+KT THLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN
        LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG S SSGR EYRR+E G  RSRPYER+TPTTIPISEILTN
Subjt:  LTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTN

Query:  IEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPS
        IEESGMEKLLKRPEKLRG PE+R+KDKYCRFHR++GHNT+  WELKRQI++LIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVINTIFGGPS
Subjt:  IEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPS

Query:  GGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFS
        GGQSG KRKELAR ARR                                                                        KKSPTPLVGFS
Subjt:  GGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFS

Query:  EESVSLEGCINLLVTIGQDATQVTQMVEFV
         ESV  EG I+L VT+GQD TQVTQM EFV
Subjt:  EESVSLEGCINLLVTIGQDATQVTQMVEFV

A0A6J1D9E1 uncharacterized protein LOC1110188234.8e-19161.1Show/hide
Query:  NSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ
        +S+ +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +DGDLGESPF +D+LEA        P +K YDGSKDPKDYVEVFEGLMDFQ
Subjt:  NSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI
        DEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  + D+KSKDKG S SSGR E+RR+  G  RSRPYER+TPTTIPISEI
Subjt:  DEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEI

Query:  LTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFG
        LTNIEESGMEKLLKRPEKLRG PE+RNKDKYCRFHR++ HNT+  WELKRQI+DLIQD YFKKFVGKPR++S EKKEERK SRTP RR DRPAVINTIFG
Subjt:  LTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFG

Query:  GPSGGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLV
        GPSGGQSG+KRKELAR ARR                                                                        KKS TPLV
Subjt:  GPSGGQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLV

Query:  GFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEE
        GFS ESV  EGCI+L VT+G D TQVTQM EFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASALKGSSVCALE 
Subjt:  GFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEE

Query:  QINHDKQQESGTDLPKEGKRQFSPPTEELELVPLL
         ++ D   E   +LP   +R+F+ PTEELELVPLL
Subjt:  QINHDKQQESGTDLPKEGKRQFSPPTEELELVPLL

A0A6J1DHB3 uncharacterized protein LOC1110204796.4e-22872.26Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDDGDLGE  F +DILEA IPPKFKTP MKPYDGSKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWY+RLPAR ISTYSQLRKEFISQFSSRHYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI
        TVKL EEAPATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ + D KS+DKGPSSSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDPEKRN DKYCRFHRD+GHNT++ WELKRQI+DLIQDGYFKKFVGKPRSNSVEKKEERKR RTPPRR+DRPAVI         
Subjt:  EESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSG

Query:  GQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFSE
            NK+KELAREARR                                                                        KKSPTPLVGFS 
Subjt:  GQSGNKRKELAREARR------------------------------------------------------------------------KKSPTPLVGFSE

Query:  ESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINH
        ES+SLEGCI+L V+I QD TQVTQM EFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  K SSVCALEEQ   
Subjt:  ESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINH

Query:  DK
        D+
Subjt:  DK

A0A6J1DPC9 uncharacterized protein LOC1110222803.1e-22264.56Show/hide
Query:  PGAPGEKGVPSIQPGYREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQPG REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNS+LKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPGYREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRK
        ARCEKK+  FDD DLGESPF +DI+EAPIPPKFKTP MKPYDGSKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW +RLPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRK

Query:  EFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS RHYDRKT THLATIRQKE                                   DE LTVKLGEEAPATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNG
        PEKQIDQK+LSQ++R+ D KSKDKG SSS  RTEYRRSE G  RSRPYER                                                  
Subjt:  PEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNG

Query:  HNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSGGQSGNKRKELAREARR-------------------
             CWELKRQI+DLIQD YFKKFVGKPRSNSVEKKEERKRSRTPPRR DRPAVINTIFGGPSGGQ  NKRKELA EARR                   
Subjt:  HNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSGGQSGNKRKELAREARR-------------------

Query:  -----------------------------------------------------KKSPTPLVGFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKS
                                                             KKSPTPLVGFS ESVS EGCI+L VTIGQD+TQVTQM EFVV+DG+ 
Subjt:  -----------------------------------------------------KKSPTPLVGFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        AYNAIF RPIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASALK SSVCALEEQ + D       DLP+E K           L P L+
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS

A0A6J1DZB9 uncharacterized protein LOC1110249043.1e-22282.05Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWY+RLPARSISTYSQLRKEFISQFSS HYDRKT THLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP
        T LADE LTVKLGEEAP TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+R+ D KS+DKG SSS+ RTEYRR E G  RSRPYERYT +TIP
Subjt:  TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVIN
        ISEILTNIEESGMEKLLKRPEKLRGD EKRNK+KYCRFHRD+GHNTTSCWELKRQI+DLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRR DRPAVIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVIN

Query:  TIFGGPSGGQSGNKRKELAREAR------RKKSPTPLVGFSE---ESVSLE-----------------------GCINLLVTIGQDATQVTQMVEFVVVD
        TIFGGP+GGQSGNKRKELAREAR      R+  PT  + F +   E V L                        GCI+L VTIGQDATQVTQM EFVV+D
Subjt:  TIFGGPSGGQSGNKRKELAREAR------RKKSPTPLVGFSE---ESVSLE-----------------------GCINLLVTIGQDATQVTQMVEFVVVD

Query:  GKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS
        G+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASALKGS+VCALEEQ N  K QES  DLPKEGKRQF PPTEELELVPLLS
Subjt:  GKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQESGTDLPKEGKRQFSPPTEELELVPLLS

Query:  PEKQVGP
        PE+Q  P
Subjt:  PEKQVGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAGAGAAGGAAGCAAGGCTTGATGATTTTGGTGGAGAAGTTGCTGGAATTTATAGCTGTGCAGAATTGGTTAGGCGGGAATGAGCTGAGGTTGAGCTCTTCGTC
TTGCCGGGATGAGCTGAGATTGAGCTCTTCCTCTTGGGGAGCTATCTTGATAGCTCTTCCTCTAGTCGGGGATGAGCTGAGATTGAGCTCTTCCTCTTGCCGGGATGACC
TCGGTGCAAGAATAGTCGATGACCAGGTCCGAGCAGGGCAAGGGGGAGATCTGCTGCGCAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCC
TTAAAAGCCAACAGAGGCCGAGGAGGGACATCGAGAAAAACCTCCCAAAGGGCCAACCAGGCAGCAGACCCTGAAGCTCTGTCTGCTCTTCAGCGCGAGTTGGATGGTAT
GCGCCATCGGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCGCGCTAACCGAACTGCATCTCCCTCTATGGCTCCGGGCGCACCCGGTGAAAAGGGAGTTCCAT
CTATCCAACCTGGCTATCGCGAGCCCATTCCTAACGATGGGGGAGTTGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCT
CGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGGACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAAGA
GTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCA
TCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCCTC
ATGGACTTTCAAGCGGCGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCAAAGACTGCCGGCCAGGTCGATCTCGAC
CTACTCCCAGTTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGACATTATGATAGAAAGACAACGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACGCTGA
GAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACCGGCCTGGCCGATGAGGCCTTAACCGTA
AAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAACA
GATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCG
GCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCT
GAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGACAAGCTGCTGGGAACTGAAGCGCCAGATTAA
AGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGAATG
ACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCGAGGCGCAAGAAGAGTCCAACACCC
CTGGTTGGATTCTCTGAAGAATCGGTCTCCCTAGAGGGGTGCATCAACCTGCTGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGTCGAGTTCGTGGTAGT
CGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAG
TGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGCGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGACAAGCAGCAG
GAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAAGTCGGTCCCGGTCG
AAATCCTAGACACTCCTTCAATCTTGGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAA
GAGCCGAAGGAGTAAAAGAAGATGACGAGAAGAGCAGCTTGGTTCACGCTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCGAGATC
GGCATACCAACAGACAGGATAAAACAGTATGAGCCAATGAAGAACGAGGAAGAGCTACTTCTTAACCTGGACTTGTTGGAAGGGAAAAGGGAAATGGCTCAGCTACGCTT
AGTAGAGTATCATAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTAAGAAAAATTCAGAGTCGTGTTGGCACCC
TTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATAGTCCGACCTGAAACTTATATGCTGGCCGACTTGGAAGGAAAAGTGCTTGCGCATCCATGGAACGCGGAG
CACTTGAAGCGCTATTACCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATCAAGAGAAGGAAGCAAGGCTTGATGATTTTGGTGGAGAAGTTGCTGGAATTTATAGCTGTGCAGAATTGGTTAGGCGGGAATGAGCTGAGGTTGAGCTCTTCGTC
TTGCCGGGATGAGCTGAGATTGAGCTCTTCCTCTTGGGGAGCTATCTTGATAGCTCTTCCTCTAGTCGGGGATGAGCTGAGATTGAGCTCTTCCTCTTGCCGGGATGACC
TCGGTGCAAGAATAGTCGATGACCAGGTCCGAGCAGGGCAAGGGGGAGATCTGCTGCGCAGATCTGCCCGTCATGCGAACCAAGAGCTACCACCTGCTCACCCGAAACCC
TTAAAAGCCAACAGAGGCCGAGGAGGGACATCGAGAAAAACCTCCCAAAGGGCCAACCAGGCAGCAGACCCTGAAGCTCTGTCTGCTCTTCAGCGCGAGTTGGATGGTAT
GCGCCATCGGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCGCGCTAACCGAACTGCATCTCCCTCTATGGCTCCGGGCGCACCCGGTGAAAAGGGAGTTCCAT
CTATCCAACCTGGCTATCGCGAGCCCATTCCTAACGATGGGGGAGTTGATTACAGCTTGCGAGACAACGATTTGAGAAAGCATCTCACTGAAAAGAAGAAGAGAGCATCT
CGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGGACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAAGA
GTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGGCGACTTGGGAGAATCGCCATTCA
TCGCGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTCAAGACTCCCGCCATGAAGCCCTATGACGGGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCCTC
ATGGACTTTCAAGCGGCGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCAAAGACTGCCGGCCAGGTCGATCTCGAC
CTACTCCCAGTTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTCGACATTATGATAGAAAGACAACGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACGCTGA
GAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACCGGCCTGGCCGATGAGGCCTTAACCGTA
AAACTTGGAGAGGAAGCTCCAGCCACTTTCGCCGAAGTTTTACAGAAGGCGAAGAAAGTCATTGATGGGCAGGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAACA
GATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAGGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCG
GCTCCATCCGGAGCCGACCTTATGAGCGGTATACTCCAACCACCATCCCCATCTCCGAGATACTCACGAACATTGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCT
GAGAAGCTCCGAGGAGACCCAGAGAAGCGCAACAAAGATAAGTACTGCCGTTTTCACCGCGATAACGGCCATAATACGACAAGCTGCTGGGAACTGAAGCGCCAGATTAA
AGATCTCATTCAAGATGGCTACTTCAAAAAATTTGTGGGCAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAGGAAGCGTTCAAGAACGCCGCCTCGCCGGAATG
ACCGACCTGCAGTCATCAACACTATTTTCGGAGGCCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCGAGGCGCAAGAAGAGTCCAACACCC
CTGGTTGGATTCTCTGAAGAATCGGTCTCCCTAGAGGGGTGCATCAACCTGCTGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGTCGAGTTCGTGGTAGT
CGACGGCAAATCGGCCTACAACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAG
TGGGCACGGTCCGAGGTGAACAAAAAACTTCAAGAGAGTGCTACGCGTCCGCGCTCAAGGGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGACAAGCAGCAG
GAGTCAGGGACCGACCTGCCAAAAGAAGGTAAAAGGCAGTTCTCCCCGCCAACAGAAGAGCTCGAGCTTGTTCCTTTACTTAGCCCCGAAAAACAAGTCGGTCCCGGTCG
AAATCCTAGACACTCCTTCAATCTTGGAACCAGATGTGATGAAGGTGGATACTCCGTCACCCACTTGGATGGACCCAATCGTGGAGTTCATCAAAGGAAGCCCACCGCAA
GAGCCGAAGGAGTAAAAGAAGATGACGAGAAGAGCAGCTTGGTTCACGCTCCGAGAAGGAGTGTTGTACCGACGTGGCTTCTCCCTGCCTCTCCTCAAGTGTTCGAGATC
GGCATACCAACAGACAGGATAAAACAGTATGAGCCAATGAAGAACGAGGAAGAGCTACTTCTTAACCTGGACTTGTTGGAAGGGAAAAGGGAAATGGCTCAGCTACGCTT
AGTAGAGTATCATAACAGAATGGCCAGACATTACAATGCCCGAGTTCGACCTCGAAGCTTCCAAGTTGGACATTTGGTCTTAAGAAAAATTCAGAGTCGTGTTGGCACCC
TTGACCCAAGTTGGGAGGGACCGTTCGAAGTCAAAGGCATAGTCCGACCTGAAACTTATATGCTGGCCGACTTGGAAGGAAAAGTGCTTGCGCATCCATGGAACGCGGAG
CACTTGAAGCGCTATTACCCCTGA
Protein sequenceShow/hide protein sequence
MIKRRKQGLMILVEKLLEFIAVQNWLGGNELRLSSSSCRDELRLSSSSWGAILIALPLVGDELRLSSSSCRDDLGARIVDDQVRAGQGGDLLRRSARHANQELPPAHPKP
LKANRGRGGTSRKTSQRANQAADPEALSALQRELDGMRHRLRTMEEMYAEATRANRTASPSMAPGAPGEKGVPSIQPGYREPIPNDGGVDYSLRDNDLRKHLTEKKKRAS
REPEDSPSYSREFSNSDLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDGDLGESPFIADILEAPIPPKFKTPAMKPYDGSKDPKDYVEVFEGL
MDFQAATDAIKCRAFQIALTGSARLWYQRLPARSISTYSQLRKEFISQFSSRHYDRKTTTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTV
KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRRIDVKSKDKGPSSSSGRTEYRRSEGGSIRSRPYERYTPTTIPISEILTNIEESGMEKLLKRP
EKLRGDPEKRNKDKYCRFHRDNGHNTTSCWELKRQIKDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRRNDRPAVINTIFGGPSGGQSGNKRKELAREARRKKSPTP
LVGFSEESVSLEGCINLLVTIGQDATQVTQMVEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASALKGSSVCALEEQINHDKQQ
ESGTDLPKEGKRQFSPPTEELELVPLLSPEKQVGPGRNPRHSFNLGTRCDEGGYSVTHLDGPNRGVHQRKPTARAEGVKEDDEKSSLVHAPRRSVVPTWLLPASPQVFEI
GIPTDRIKQYEPMKNEEELLLNLDLLEGKREMAQLRLVEYHNRMARHYNARVRPRSFQVGHLVLRKIQSRVGTLDPSWEGPFEVKGIVRPETYMLADLEGKVLAHPWNAE
HLKRYYP