; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00640 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00640
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:394543..396651
RNA-Seq ExpressionMoc07g00640
SyntenyMoc07g00640
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]9.6e-23379.02Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +D DLGESPF  D+LEAPIPPK K P +KPYD SKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWYR LPA SISTYSQLR+EF++ FSS HYD+KT THLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI
        LTVKLGEE PATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG  SSGR EYRR+E G  RSRPYE++TPTTIPISEILTNI
Subjt:  LTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG
        EESGMEKLLKRPEKLRG P++R+KDKYC FHR++GHNT+  WELKRQIE+L QDGYFKKFV KPR++S EKKEE+KRS TPPRR DRP VINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIA LIDH     VLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFV
        ESV PEG I+LPVT+GQD TQVTQMAEFV
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFV

XP_022150760.1 uncharacterized protein LOC111018823 [Momordica charantia]1.4e-22068.46Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +D DLGESPF  D+LEA        P +K YD SKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEIL
        DEALTVKLG+E PATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  K D+KSKDKG  SSGR E+RR+  G  RSRPYE++TPTTIPISEIL
Subjt:  DEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGG
        TNIEESGMEKLLKRPEKLRG P++RNKDKYC FHR++ HNT+  WELKRQIEDL QD YFKKFV KPR++S EKKEE+K S TP RR DRP VINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGG

Query:  PSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSG+KRKELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIA LIDH     VLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ
        FS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASA K SSVCALE  
Subjt:  FSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ

Query:  INHGKQQESGTDLPK
        ++     E   +LP+
Subjt:  INHGKQQESGTDLPK

XP_022152854.1 uncharacterized protein LOC111020479 [Momordica charantia]2.9e-26180.74Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDD DLGE  F  DILEA IPPK KTP MKPYD SKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYR LPAR ISTYSQLRKEFISQFSS HYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGP-SSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI
        TVKL EE PATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ K D KS+DKGP SSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGP-SSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDP+KRN DKYC FHRD+GHNT++ WELKRQIEDL QDGYFKKFV KPRSNSVEKKEE+KR  TPPRR+DRP VI         
Subjt:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLID-----HVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADL G+HLPHNDALVIA LID      +LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLID-----HVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ
        ES+S EGCI+LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  KRSSVCALEEQ
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ

XP_022155139.1 uncharacterized protein LOC111022280 [Momordica charantia]2.7e-25973.82Show/hide
Query:  PGAPGEKGVPSIQPIDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQP +REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPIDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRK
        ARCEKK+  FDDDDLGESPF  DI+EAPIPPK KTP MKPYD SKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW R LPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRK

Query:  EFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS  HYDRKT THLATIRQKE                                   DE LTVKLGEE PATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLSQERRKIDVKSKDKGPSSSG-RTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNG
        PEKQIDQK+LSQ++RK D KSKDKG SSSG RTEYRRSE G  RSRPYE+                                                  
Subjt:  PEKQIDQKKLSQERRKIDVKSKDKGPSSSG-RTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNG

Query:  HNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDA
             CWELKRQIEDL QD YFKKFV KPRSNSVEKKEE+KRS TPPRR DRP VINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D 
Subjt:  HNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDA

Query:  DLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS
        DL G+HLPHNDALVIA LIDH     VLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ 
Subjt:  DLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQINHGKQQESGTDLPKEGK
        AYNAIF RPIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASA KRSSVCALEE       Q S  DLP+E K
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQINHGKQQESGTDLPKEGK

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]4.5e-23081.3Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYR LPARSISTYSQLRKEFISQFSSWHYDRKT THLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSS-GRTEYRRSEGGSIRSRPYEQYTPTTIP
        T LADE LTVKLGEE P TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+RK D KS+DKG SSS  RTEYRR E G  RSRPYE+YT +TIP
Subjt:  TGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSS-GRTEYRRSEGGSIRSRPYEQYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVIN
        ISEILTNIEESGMEKLLKRPEKLRGD +KRNK+KYC FHRD+GHNTTSCWELKRQIEDL QDGYFKKFV KPRSNSVEKKEE+KRS TPPRR DRP VIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA LIDH     VL+DG                          
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASA K S+VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVC

Query:  ALEEQINHGKQQESGTDLPKEGKR
        ALEEQ N GK QES  DLPKEGKR
Subjt:  ALEEQINHGKQQESGTDLPKEGKR

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088134.6e-23379.02Show/hide
Query:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAAT
        +KA+S   P  P  VITREEFD ++ + D QVEALKA+CE+K+   +D DLGESPF  D+LEAPIPPK K P +KPYD SKDPKDYVEVFE LMDFQAA+
Subjt:  LKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAAT

Query:  DAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA
        DAIKCRAF+IALTGSARLWYR LPA SISTYSQLR+EF++ FSS HYD+KT THLATIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA
Subjt:  DAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEA

Query:  LTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI
        LTVKLGEE PATFAEVLQKAKKVIDGQELLRTKTGRPE++I + +  ++    D KSKDKG  SSGR EYRR+E G  RSRPYE++TPTTIPISEILTNI
Subjt:  LTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG
        EESGMEKLLKRPEKLRG P++R+KDKYC FHR++GHNT+  WELKRQIE+L QDGYFKKFV KPR++S EKKEE+KRS TPPRR DRP VINTIFGGPSG
Subjt:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
        GQSG KRKELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIA LIDH     VLVDGG SANILSLPTYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFV
        ESV PEG I+LPVT+GQD TQVTQMAEFV
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFV

A0A6J1D9E1 uncharacterized protein LOC1110188237.0e-22168.46Show/hide
Query:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQ
        +SN +A+S + P  P+ VITREEFD ++ K + QVEALKA+CE+K+   +D DLGESPF  D+LEA        P +K YD SKDPKDYVEVFEGLMDFQ
Subjt:  NSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQ

Query:  AATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA
        AA+DAIKCRAFQIALTGSARLW                                                     FQE+QLKV   SDDSAMCYFLTGLA
Subjt:  AATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLA

Query:  DEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEIL
        DEALTVKLG+E PATFAEVLQKAKKVIDGQELLRTKTGRPE+ ID+ + S +  K D+KSKDKG  SSGR E+RR+  G  RSRPYE++TPTTIPISEIL
Subjt:  DEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEIL

Query:  TNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGG
        TNIEESGMEKLLKRPEKLRG P++RNKDKYC FHR++ HNT+  WELKRQIEDL QD YFKKFV KPR++S EKKEE+K S TP RR DRP VINTIFGG
Subjt:  TNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGG

Query:  PSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG
        PSGGQSG+KRKELAR ARREVCIIREQ+PTC ITF  ADL  +HLPHNDALVIA LIDH     VLVD G SANI+SL TYLALGWTRSQLKKS TPLVG
Subjt:  PSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVG

Query:  FSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ
        FS ESV PEGCI+LPVT+G D TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPNGVG VRGEQ  SRECYASA K SSVCALE  
Subjt:  FSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ

Query:  INHGKQQESGTDLPK
        ++     E   +LP+
Subjt:  INHGKQQESGTDLPK

A0A6J1DHB3 uncharacterized protein LOC1110204791.4e-26180.74Show/hide
Query:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATD
        KA+S Y P+ P  VITREEFD +K KFD QVEALKARCEKK+ SFDD DLGE  F  DILEA IPPK KTP MKPYD SKDPKDYVEVFE LMDFQAATD
Subjt:  KAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATD

Query:  AIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL
        AIKC AFQIALTGSARLWYR LPAR ISTYSQLRKEFISQFSS HYDRKT THLATIRQKEGETLREYVTRF EEQLKV HCSDDSAMCYFLTGLADE L
Subjt:  AIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGP-SSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI
        TVKL EE PATFAEVLQK KKVIDGQELLRTKTGRPEK IDQ +  +++ K D KS+DKGP SSS R +YRRS     +SRPYE YTPTTIPI EILTNI
Subjt:  TVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGP-SSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNI

Query:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG
        EE+GMEKLLKRPEKLRGDP+KRN DKYC FHRD+GHNT++ WELKRQIEDL QDGYFKKFV KPRSNSVEKKEE+KR  TPPRR+DRP VI         
Subjt:  EESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSG

Query:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLID-----HVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG
            NK+KELAREARREVCIIREQ+PT SI F  ADL G+HLPHNDALVIA LID      +LVDGGASANILSL TYLALGWTRSQLKKSPTPLVGFSG
Subjt:  GQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLID-----HVLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSG

Query:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ
        ES+S EGCI+LPV+I QD TQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYST NGVGTVRGE KTSRECYAS  KRSSVCALEEQ
Subjt:  ESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQ

A0A6J1DPC9 uncharacterized protein LOC1110222801.3e-25973.82Show/hide
Query:  PGAPGEKGVPSIQPIDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK
        PGAPGEKG PSIQP +REPIPND GVDYSLRDNDLRKHLT+KKK+AS EPEDS SYSREFSNSNLKAQSKYKPL PEAVI REEFDLMKH+FDEQVEALK
Subjt:  PGAPGEKGVPSIQPIDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITREEFDLMKHKFDEQVEALK

Query:  ARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRK
        ARCEKK+  FDDDDLGESPF  DI+EAPIPPK KTP MKPYD SKDPKDYVEVFEGLMDFQAATDAIKC AFQIALTGSARLW R LPARSISTYSQLRK
Subjt:  ARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRK

Query:  EFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGR
        EFI QFS  HYDRKT THLATIRQKE                                   DE LTVKLGEE PATFAEVLQ AKKVIDGQELLRTKT R
Subjt:  EFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGR

Query:  PEKQIDQKKLSQERRKIDVKSKDKGPSSSG-RTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNG
        PEKQIDQK+LSQ++RK D KSKDKG SSSG RTEYRRSE G  RSRPYE+                                                  
Subjt:  PEKQIDQKKLSQERRKIDVKSKDKGPSSSG-RTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNG

Query:  HNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDA
             CWELKRQIEDL QD YFKKFV KPRSNSVEKKEE+KRS TPPRR DRP VINTIFGGPSGGQ  NKRKELA EARR+V IIREQKPTCSITF D 
Subjt:  HNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDA

Query:  DLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS
        DL G+HLPHNDALVIA LIDH     VLVDGGASANILSLPTYLAL  TRSQLKKSPTPLVGFS ESVSPEGCI+LPVTIGQD+TQVTQMAEFVV+DG+ 
Subjt:  DLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKS

Query:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQINHGKQQESGTDLPKEGK
        AYNAIF RPIIHSF+AVPS LHQVLKYSTPNGVGTVRGEQKTSRECYASA KRSSVCALEE       Q S  DLP+E K
Subjt:  AYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVCALEEQINHGKQQESGTDLPKEGK

A0A6J1DZB9 uncharacterized protein LOC1110249042.2e-23081.3Show/hide
Query:  MDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL
        MDFQAATDAIKCRAFQIALTGSARLWYR LPARSISTYSQLRKEFISQFSSWHYDRKT THLATIRQKE ETLREYVTRFQEEQLKV HCSDDSAMCYFL
Subjt:  MDFQAATDAIKCRAFQIALTGSARLWYRTLPARSISTYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFL

Query:  TGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSS-GRTEYRRSEGGSIRSRPYEQYTPTTIP
        T LADE LTVKLGEE P TF EVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQE+RK D KS+DKG SSS  RTEYRR E G  RSRPYE+YT +TIP
Subjt:  TGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQERRKIDVKSKDKGPSSS-GRTEYRRSEGGSIRSRPYEQYTPTTIP

Query:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVIN
        ISEILTNIEESGMEKLLKRPEKLRGD +KRNK+KYC FHRD+GHNTTSCWELKRQIEDL QDGYFKKFV KPRSNSVEKKEE+KRS TPPRR DRP VIN
Subjt:  ISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIEDLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVIN

Query:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSP
        TIFGGP+GGQSGNKRKELAREARREVCIIRE KPTCSITFGDADL G+HLPHNDALVIA LIDH     VL+DG                          
Subjt:  TIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDH-----VLVDGGASANILSLPTYLALGWTRSQLKKSP

Query:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVC
                      GCI+LPVTIGQDATQVTQMAEFVV+DG+SAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPN VG VRGEQKTSRECYASA K S+VC
Subjt:  TPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGEQKTSRECYASAFKRSSVC

Query:  ALEEQINHGKQQESGTDLPKEGKR
        ALEEQ N GK QES  DLPKEGKR
Subjt:  ALEEQINHGKQQESGTDLPKEGKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCCATCGGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCACGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGCGCACCCGGTGAAAAGGGAGTTCC
ATCTATCCAACCTATCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGATAACGATTTGAGAAAGCATCTTACTGAAAAGAAGAAGAGAGCAT
CTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAA
GAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGACGACTTGGGAGAATCGCCATT
CATCGTGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTGAAGACTCCCGCCATGAAGCCCTATGACGAGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCC
TCATGGACTTTCAAGCGGCGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGAACACTGCCAGCCAGGTCGATCTCG
ACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTTGGCATTATGATAGAAAGACAGGGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACACT
GAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACCGGTTTGGCCGATGAGGCCTTAACCG
TAAAACTTGGAGAGGAAACTCCAGCCACTTTCGCCGAAGTTTTACAGAAAGCGAAGAAAGTCATTGATGGGCAAGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAA
CAGATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAAGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGG
CTCCATCCGGAGCCGACCTTATGAGCAGTATACTCCAACTACCATCCCCATCTCAGAGATACTCACGAACATTGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCTG
AGAAGCTTCGAGGAGACCCAAAGAAGCGCAACAAAGATAAGTACTGCTGTTTTCACCGCGATAACGGCCATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGATTGAA
GATCTCACTCAAGATGGCTACTTCAAAAAATTTGTGGACAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAAGAAGCGTTCAATAACGCCGCCTCGCCGGAATGA
CCGGCCTACAGTCATCAACACTATTTTCGGAGGCCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATCAGGG
AGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCCCTTCTCATTGATCACGTCCTGGTT
GATGGAGGCGCGTCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCAGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGAGA
ATCGGTCTCCCCAGAGGGGTGCATCAACCTACCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACA
ACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGTGAA
CAAAAAACTTCACGAGAGTGCTACGCGTCCGCGTTCAAGAGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCC
AAAAGAAGGTAAAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCGCCATCGGTTGCGCACAATGGAAGAGATGTACGCCGAGGCAACGCACGCTAACCGAACTGCGTCTCCCTCTATGGCTCCGGGCGCACCCGGTGAAAAGGGAGTTCC
ATCTATCCAACCTATCGATCGCGAGCCCATTCCGAACGATGGGGGAGTGGATTACAGCTTGCGAGATAACGATTTGAGAAAGCATCTTACTGAAAAGAAGAAGAGAGCAT
CTCGGGAGCCGGAAGACTCTCCTTCCTACTCCCGAGAATTCTCCAACTCGAACCTAAAGGCTCAATCAAAATACAAGCCCCTGGCACCAGAAGCTGTGATCACGAGGGAA
GAGTTCGACCTGATGAAGCACAAGTTCGATGAGCAGGTCGAGGCGCTTAAGGCCAGGTGCGAGAAGAAAGACTGTTCGTTCGACGATGACGACTTGGGAGAATCGCCATT
CATCGTGGACATCCTGGAGGCTCCAATCCCTCCGAAGTTGAAGACTCCCGCCATGAAGCCCTATGACGAGTCTAAGGACCCTAAAGATTATGTTGAGGTTTTCGAGGGCC
TCATGGACTTTCAAGCGGCGACGGATGCGATCAAATGCCGCGCCTTCCAGATCGCGCTTACCGGTAGCGCGCGCCTGTGGTATCGAACACTGCCAGCCAGGTCGATCTCG
ACCTACTCCCAGCTGAGAAAGGAGTTCATTAGTCAATTCTCCTCTTGGCATTATGATAGAAAGACAGGGACTCACCTTGCCACCATCAGGCAGAAGGAAGGAGAGACACT
GAGAGAGTATGTCACACGGTTCCAGGAAGAGCAGCTTAAGGTCGTGCACTGCTCTGACGATTCGGCTATGTGCTACTTCCTCACCGGTTTGGCCGATGAGGCCTTAACCG
TAAAACTTGGAGAGGAAACTCCAGCCACTTTCGCCGAAGTTTTACAGAAAGCGAAGAAAGTCATTGATGGGCAAGAGCTCCTCCGAACCAAGACTGGCCGACCTGAAAAA
CAGATCGATCAGAAGAAACTTAGCCAGGAGAGGAGGAAGATTGATGTCAAGTCCAAAGATAAGGGACCATCCTCCAGTGGCAGAACAGAGTACCGAAGGTCGGAGGGCGG
CTCCATCCGGAGCCGACCTTATGAGCAGTATACTCCAACTACCATCCCCATCTCAGAGATACTCACGAACATTGAGGAAAGTGGGATGGAAAAGCTCCTCAAGCGACCTG
AGAAGCTTCGAGGAGACCCAAAGAAGCGCAACAAAGATAAGTACTGCTGTTTTCACCGCGATAACGGCCATAATACGACAAGCTGCTGGGAATTGAAGCGCCAGATTGAA
GATCTCACTCAAGATGGCTACTTCAAAAAATTTGTGGACAAACCGAGGTCTAACTCGGTCGAAAAGAAAGAAGAGAAGAAGCGTTCAATAACGCCGCCTCGCCGGAATGA
CCGGCCTACAGTCATCAACACTATTTTCGGAGGCCCGAGCGGGGGCCAGTCCGGAAACAAGAGGAAGGAGCTAGCTCGCGAAGCCAGGCGCGAGGTATGCATCATCAGGG
AGCAGAAACCTACTTGCTCCATCACTTTCGGCGATGCCGATTTGGGGGGGATTCATTTGCCCCACAATGACGCGCTCGTGATCGCCCTTCTCATTGATCACGTCCTGGTT
GATGGAGGCGCGTCTGCGAACATCTTGTCCCTCCCAACATATTTAGCATTGGGATGGACCAGGTCACAATTGAAGAAGAGTCCAACACCCCTGGTTGGATTCTCTGGAGA
ATCGGTCTCCCCAGAGGGGTGCATCAACCTACCGGTAACTATAGGGCAAGATGCCACCCAAGTAACGCAGATGGCCGAGTTCGTGGTAGTCGACGGCAAATCGGCCTACA
ACGCCATTTTTGGGAGACCCATTATCCACTCATTTCGAGCTGTTCCTTCCACGTTGCATCAAGTCTTGAAGTACTCAACCCCTAATGGAGTGGGCACGGTCCGAGGTGAA
CAAAAAACTTCACGAGAGTGCTACGCGTCCGCGTTCAAGAGATCGTCGGTATGTGCCCTGGAGGAACAAATCAATCATGGCAAGCAGCAGGAGTCAGGGACCGACCTGCC
AAAAGAAGGTAAAAGGTAG
Protein sequenceShow/hide protein sequence
MRHRLRTMEEMYAEATHANRTASPSMAPGAPGEKGVPSIQPIDREPIPNDGGVDYSLRDNDLRKHLTEKKKRASREPEDSPSYSREFSNSNLKAQSKYKPLAPEAVITRE
EFDLMKHKFDEQVEALKARCEKKDCSFDDDDLGESPFIVDILEAPIPPKLKTPAMKPYDESKDPKDYVEVFEGLMDFQAATDAIKCRAFQIALTGSARLWYRTLPARSIS
TYSQLRKEFISQFSSWHYDRKTGTHLATIRQKEGETLREYVTRFQEEQLKVVHCSDDSAMCYFLTGLADEALTVKLGEETPATFAEVLQKAKKVIDGQELLRTKTGRPEK
QIDQKKLSQERRKIDVKSKDKGPSSSGRTEYRRSEGGSIRSRPYEQYTPTTIPISEILTNIEESGMEKLLKRPEKLRGDPKKRNKDKYCCFHRDNGHNTTSCWELKRQIE
DLTQDGYFKKFVDKPRSNSVEKKEEKKRSITPPRRNDRPTVINTIFGGPSGGQSGNKRKELAREARREVCIIREQKPTCSITFGDADLGGIHLPHNDALVIALLIDHVLV
DGGASANILSLPTYLALGWTRSQLKKSPTPLVGFSGESVSPEGCINLPVTIGQDATQVTQMAEFVVVDGKSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNGVGTVRGE
QKTSRECYASAFKRSSVCALEEQINHGKQQESGTDLPKEGKR