; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014974 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014974
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionE3 ubiquitin-protein ligase complex slx8-rfp subunit slx8-like
Genome locationtig00002486:740558..750807
RNA-Seq ExpressionSgr014974
SyntenySgr014974
Gene Ontology termsNA
InterPro domainsIPR001841 - Zinc finger, RING-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017907 - Zinc finger, RING-type, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582484.1 hypothetical protein SDJN03_22486, partial [Cucurbita argyrosperma subsp. sororia]1.2e-12569.11Show/hide
Query:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA
        +KIMPESMEATPSVS +LDLQAVRSRI ELEELQRSL EDEA STDSLGSEKLLKECALHLESRL Q+LSECSNVDSFLGI+DLDAY+EH+KEELV VEA
Subjt:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA

Query:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------
        ESSKISNEIEVLKRTNIE SN+L++DLE++ +SLDRFTSQD +K T N CS++GEDQ NVI   ECNAFE                              
Subjt:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------

Query:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------
           VEDTIGGLKVI VADNFIRLSLRTHIPNLED SSLQRLEGMI PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKS              
Subjt:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------

Query:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
                          S+ F+Y DQDETI+C MIGGIDA IKVSQGWPL+DSPLKL+SLKSSDHYTKG SLSL+CKVE M
Subjt:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

KAG7018870.1 hypothetical protein SDJN02_20743, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-18159.08Show/hide
Query:  ARDVERRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSW
        AR VERRT +D DLNCPPPDE IDPTGPH+E AQY NH++ Q     D VDED+AIISPRKFAEARKNFRRNHFESS GV +R NG+TEVY AL+DV+SW
Subjt:  ARDVERRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSW

Query:  PPFTIWPALTINNNVSVRE-QTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSIPPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRS
        PPFTIW   T  N+VS++E QT HNLDL LS E+SSR     TD +  +  A +SSI PA   LRCAICIEPLVEETTTKCGHIFCRNCIE AIATQH  
Subjt:  PPFTIWPALTINNNVSVRE-QTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSIPPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRS

Query:  PGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRLRKEKIMPESMEATPSVSPNLDLQAVRSR---ISELEELQRSLEEDEASSTDSLG
                  K P+    Y  L S   F+H     +   I     L   +I     E    +  +  +   + R    S+LEELQRSL EDEA STDSLG
Subjt:  PGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRLRKEKIMPESMEATPSVSPNLDLQAVRSR---ISELEELQRSLEEDEASSTDSLG

Query:  SEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIE------VLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPD
        SEKLLKECALHLESRL Q+LSECSNVDSFLGI+DLDAY+EH+KEELV VEAESSKISNEIE      VL    I  SN+L++DLE++ +SLDRFTSQD +
Subjt:  SEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIE------VLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPD

Query:  KATSNCCSVDGEDQ-NVIDKRECNAFE-------------------------------------------------VEDTIGGLKVIDVADNFIRLSLRT
        K T N CS++GEDQ NVI   ECNAFE                                                 VEDTIGGLKVI VADNFIRLSLRT
Subjt:  KATSNCCSVDGEDQ-NVIDKRECNAFE-------------------------------------------------VEDTIGGLKVIDVADNFIRLSLRT

Query:  HIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------------------------SYCFEYFDQ
        HIPNLED SSLQRLEGMI PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKS                                S+ F+Y DQ
Subjt:  HIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------------------------SYCFEYFDQ

Query:  DETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
        DETI+C MIGGIDA IKVSQGWPL+DSPLKL+SLKSSDHYTKG SLSL+CKVE M
Subjt:  DETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

KGN56746.2 hypothetical protein Csa_011800 [Cucumis sativus]4.0e-20163.72Show/hide
Query:  MSIQSTNDLREWSARDVERRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGN
        MSIQS+NDL +WS+R +ERRTV+DFDLNCPPPDE IDPTG  +E AQYYNHYQGQ     D +DED+AIISPRKFAEARKNFRRNHFES CG  IR NGN
Subjt:  MSIQSTNDLREWSARDVERRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGN

Query:  TEVYGALSDVTSWPPFTIWPALTINNNVSVRE-QTIHNLDLCLSSESSSR-TRTKATDDIPSAVLAQSSSIPPAAPSLRCAICIEPLVEETTTKCGHIFC
        TEVYGALSDVT+WPPFTIW  LTI+NNVS++E QTIHNLDL LS ESSSR T+ K   DIPS  LA SSSIPP   +LRCAICIEPLVEETTTKCGH   
Subjt:  TEVYGALSDVTSWPPFTIWPALTINNNVSVRE-QTIHNLDLCLSSESSSR-TRTKATDDIPSAVLAQSSSIPPAAPSLRCAICIEPLVEETTTKCGHIFC

Query:  RNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRLRKEKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLE
                                     ++  +  LS+F     RI              + K+MPESMEATPSV P+LDLQAVR   SELEELQRSLE
Subjt:  RNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRLRKEKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLE

Query:  EDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFT
        E+E S+TDSLGSEKLL+ECALHLESR+ Q+LSE SNVDSFLGI+DLDAY+EH+KEELV VEAESSKISNEIEVLKRTNIEDSN+L+MDLEV+KLSLDRF 
Subjt:  EDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFT

Query:  SQDPDKATSNCCSVDGED-QNVIDKRECNAFE---------------------------------VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSL
        SQDP++AT NC S++GED  NVI  RECNAFE                                 VE TIGG+KVIDVADN IRLSL THIPN+ED S+L
Subjt:  SQDPDKATSNCCSVDGED-QNVIDKRECNAFE---------------------------------VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSL

Query:  QRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSFS--------------------------------YCFEYFDQDETIICNMIG
        QRLEG+I  SELDHEL+IEVL+GTM     EIFP  VHLHDIINASKS S                                + FEY DQDE I+C+MIG
Subjt:  QRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSFS--------------------------------YCFEYFDQDETIICNMIG

Query:  GIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
        GIDA IKVSQGWPL+DSPLKLISLKSSDHYTKG+SLSLICKVE M
Subjt:  GIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

XP_022147070.1 uncharacterized protein LOC111016098 [Momordica charantia]6.4e-13067.88Show/hide
Query:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS
        MPESMEATPSV P LDLQAVR RISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRL QILSE SNVDSFLGI+DLDAY+EH+KEELV VEAESS
Subjt:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS

Query:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------
        KISNEIE +KR NIEDSNRL+MDLEV+KLSLDR TS+DP+KAT NC S DGEDQ N+IDKRECNAFE                                 
Subjt:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------

Query:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------
        VEDTIGGLKVIDVA+NFIRLSLRTHIPNLEDLSSLQRLEG+I PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKSF                
Subjt:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------

Query:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQS
                        S+ FEY D+D TIIC MIGGI A+I+VSQGWPLSDSPLKLISLK+SDHYTKGISLSLICKVE M     ++         NL S
Subjt:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQS

Query:  PVDQIEILSTE
          D IE +  E
Subjt:  PVDQIEILSTE

XP_023528067.1 uncharacterized protein LOC111791098 isoform X1 [Cucurbita pepo subsp. pepo]8.6e-12769.11Show/hide
Query:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA
        EK MPESMEATPSVS +LDLQAVRSRISELEELQRSLEEDEA  TDSLGSEKLLKECALHLESRL Q+LSECSNVDSFL I+DLDAY+EH+KEELV VEA
Subjt:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA

Query:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------
        ESSKISNEIEVLKRTNIE SN+L++DLE++ +SLDRFTSQDP+  T N CS++GEDQ NVI   ECNAFE                              
Subjt:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------

Query:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------
           VEDTIGGLKVIDVADNFIRLSLRTHIPNLED SSLQ+LEGMI PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKS              
Subjt:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------

Query:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
                          S+ F+Y DQDETI+C+MIGGIDA IKVSQGWPL+DSPLKL+SLKSSDHYTKG SLSL+CKVE M
Subjt:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

TrEMBL top hitse value%identityAlignment
A0A5D3D0R5 Uncharacterized protein1.7e-12577.06Show/hide
Query:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAES
        +MPESME TPSV P+LDLQAVR   SELEELQRSLEE+E SS DSLGSEKLL+ECALHLESR+ Q+LSE SNVDSFLGI+DLDAY+EH+KEELV VEAES
Subjt:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAES

Query:  SKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFEVEDTIGGLKVIDVADNFIRLSLRTHIPNLEDL
        SKISNEIEVLKRT IEDSN+L+MDLEV+KLSLDRF SQDP++AT NC S++GED+ NVI  RECNAFEVE TIGG+KVIDVADN IRLSL THIPN+ED 
Subjt:  SKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFEVEDTIGGLKVIDVADNFIRLSLRTHIPNLEDL

Query:  SSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-----------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSP
        S+LQRLEG+I  SELDHEL+IEV  GTM     EIFP  VHLHDIINASKS            S+ FEY DQDE I+C+MIGGIDA IKVSQGWPL+DSP
Subjt:  SSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-----------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSP

Query:  LKLISLKSSDHYTKGISLSLICKVELM
        LKLISLKSSDHYTKGISLSLICKVE M
Subjt:  LKLISLKSSDHYTKGISLSLICKVELM

A0A6J1CZ44 uncharacterized protein LOC1110160983.1e-13067.88Show/hide
Query:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS
        MPESMEATPSV P LDLQAVR RISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRL QILSE SNVDSFLGI+DLDAY+EH+KEELV VEAESS
Subjt:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS

Query:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------
        KISNEIE +KR NIEDSNRL+MDLEV+KLSLDR TS+DP+KAT NC S DGEDQ N+IDKRECNAFE                                 
Subjt:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------

Query:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------
        VEDTIGGLKVIDVA+NFIRLSLRTHIPNLEDLSSLQRLEG+I PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKSF                
Subjt:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------

Query:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQS
                        S+ FEY D+D TIIC MIGGI A+I+VSQGWPLSDSPLKLISLK+SDHYTKGISLSLICKVE M     ++         NL S
Subjt:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQS

Query:  PVDQIEILSTE
          D IE +  E
Subjt:  PVDQIEILSTE

A0A6J1E9M5 uncharacterized protein LOC111432106 isoform X11.3e-12368.13Show/hide
Query:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIED----LDAYIEHLKEELV
        +KIMPESMEATPSVS +LDLQAVRSRI ELEELQRSL EDEA STDSLGSEKLLKECALHLESRL Q+LSECSNVDSFLGI+D    LDAY+EH+KEELV
Subjt:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIED----LDAYIEHLKEELV

Query:  TVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE--------------------------
         VEAESS+ISNEIEVLKRTNIE SN+L+++LE++ +SLDRFTSQDP+K T N CS++GEDQ NVI  RE NAFE                          
Subjt:  TVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE--------------------------

Query:  -------VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF---------
               VEDTIGGLKVI VADNFIRLSLRTHIPNLED SSLQRLEGMI PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKS          
Subjt:  -------VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF---------

Query:  ----------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
                              S+ F+Y DQDETI+C MIGGIDA IKVSQGWPL+DSPLKL+SLKSSDHYTKG SLSL+CKVE M
Subjt:  ----------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

A0A6J1E9V8 uncharacterized protein LOC111432106 isoform X32.3e-12568.85Show/hide
Query:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA
        +KIMPESMEATPSVS +LDLQAVRSRI ELEELQRSL EDEA STDSLGSEKLLKECALHLESRL Q+LSECSNVDSFLGI+DLDAY+EH+KEELV VEA
Subjt:  EKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEA

Query:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------
        ESS+ISNEIEVLKRTNIE SN+L+++LE++ +SLDRFTSQDP+K T N CS++GEDQ NVI  RE NAFE                              
Subjt:  ESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE------------------------------

Query:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------
           VEDTIGGLKVI VADNFIRLSLRTHIPNLED SSLQRLEGMI PSEL+HELLIEVLEGTM     EIFPG VHLHDIINASKS              
Subjt:  ---VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF-------------

Query:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
                          S+ F+Y DQDETI+C MIGGIDA IKVSQGWPL+DSPLKL+SLKSSDHYTKG SLSL+CKVE M
Subjt:  ------------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

A0A6J1F0V9 uncharacterized protein LOC1114413635.8e-12167.89Show/hide
Query:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS
        M E MEATPSVSP++D+QAVRS ISELEELQRSLEEDEA +TDSLGS KLLKEC+L LESRL Q LSE SNVDSFLGI+DLDAY+E +KEEL+ VEAESS
Subjt:  MPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESS

Query:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------
        KISNEIEVLKRT+IEDSN+L+MDLEV+KLSLDR  SQDP+KAT NC SV+GEDQ ++I +RECNAFE                                 
Subjt:  KISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQ-NVIDKRECNAFE---------------------------------

Query:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------
        VEDTIGGLKVIDV DNFIRLSL +HIPNLE+ SSLQRLEGMI PSELDHELLIEVLEGTM     EIFPG VHLHDIINASKSF                
Subjt:  VEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTM-----EIFPGAVHLHDIINASKSF----------------

Query:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM
                        S+ FEY DQDETIIC MIGGIDA+IKV QGWPLSDSPLKLISLKSSDHY  G+SLSLICKVE M
Subjt:  ----------------SYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELM

SwissProt top hitse value%identityAlignment
P38398 Breast cancer type 1 susceptibility protein1.5e-0425.97Show/hide
Query:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK
        L C IC+E + E  +TKC HIFC+ C+            L+ + G  + P+  ++ +    Q      ++VEE L I+C F+L             +KE 
Subjt:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK

Query:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK
          PE ++   S+  ++  +    R+ + E    SL+E   S    +LG+ + L+
Subjt:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK

Q6J6I8 Breast cancer type 1 susceptibility protein homolog1.9e-0425.97Show/hide
Query:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK
        L C IC+E + E  +TKC HIFC+ C+            L+ + G  + P+  ++ +    Q      ++VEE L I+C F+L             +KE 
Subjt:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK

Query:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK
          PE ++   S+  +   ++   R+ + E    SL+E   S    +LG+ + L+
Subjt:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK

Q6J6J0 Breast cancer type 1 susceptibility protein homolog1.5e-0425.97Show/hide
Query:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK
        L C IC+E + E  +TKC HIFC+ C+            L+ + G  + P+  ++ +    Q      ++VEE L I+C F+L             +KE 
Subjt:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK

Query:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK
          PE ++   S+  ++  +    R+ + E    SL+E   S    +LG+ + L+
Subjt:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK

Q9GKK8 Breast cancer type 1 susceptibility protein homolog1.5e-0425.97Show/hide
Query:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK
        L C IC+E + E  +TKC HIFC+ C+            L+ + G  + P+  ++ +    Q      ++VEE L I+C F+L             +KE 
Subjt:  LRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPGLHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRL-------------RKEK

Query:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK
          PE ++   S+  ++  +    R+ + E    SL+E   S    +LG+ + L+
Subjt:  IMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEAS-STDSLGSEKLLK

Arabidopsis top hitse value%identityAlignment
AT3G23910.1 BEST Arabidopsis thaliana protein match is: RNA-directed DNA polymerase (reverse transcriptase)-related family protein (TAIR:AT3G24255.2)7.2e-5536.6Show/hide
Query:  NLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIEVLKRTN
        +LDLQ +R R+ EL+   R+  E+   S  S     ++++  L  E ++ +I+ E  +VD  L +ED DAY+E+L+ EL +VEAES+K+S EIE L +++
Subjt:  NLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIEVLKRTN

Query:  IEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG----------------------EDQNVIDKRECNAFEVEDTIGGLKVIDVADN
         +DS+RLQ DLE + LSLD  +SQD +K+  N         C  +D                       ED + + KR   A +VED + GLKV++   N
Subjt:  IEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG----------------------EDQNVIDKRECNAFEVEDTIGGLKVIDVADN

Query:  FIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA------------------------------------
        FIRL LRT+I  L+      + + +  PSEL HELLI + + T EI     FP  +++ DII A                                    
Subjt:  FIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA------------------------------------

Query:  -------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQSPVDQI
               SK+  Y FEY+D+DETI+ ++ GGIDA +KVS GWPL ++PLKL SLK+SD+ +KGISLSLICKVE       + + L L    NL   +D I
Subjt:  -------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQSPVDQI

Query:  EILSTETATYIIVAEKSS
        E +  E     + + KSS
Subjt:  EILSTETATYIIVAEKSS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.8e-4336Show/hide
Query:  DAYIEHLKEELVTVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG---------------------
        DAY+E+L+ EL +VEAES+K+S EIE L +++  DS+RLQ DLE + LSLD  +SQD +K+  N         C  +D                      
Subjt:  DAYIEHLKEELVTVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG---------------------

Query:  -EDQNVIDKRECNAFEVEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA----
         ED + + KR   A +VED + GLKV++   NFIRL LRT+I  L+      + + +  PSEL HELLI + + T EI     FP  +++ DII A    
Subjt:  -EDQNVIDKRECNAFEVEDTIGGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA----

Query:  ---------------------------------------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSL
                                               SK+  Y FEY+D+DETI+ ++ GGIDA +KVS GWPL ++PLKL SLK+SD+ +KG SLSL
Subjt:  ---------------------------------------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSL

Query:  ICKVELMVLPTSIKSPLYLLGISNLQSPVDQIEILSTETATYIIVAEKSS
        I K+E       + + L L    NL   +D +E +  +     + + +SS
Subjt:  ICKVELMVLPTSIKSPLYLLGISNLQSPVDQIEILSTETATYIIVAEKSS

AT3G24255.2 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-4733.65Show/hide
Query:  NLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIED-------LDAYIEHLKEELVTVEAESSKISNEI
        +LDLQ +R R+ E +   R+  E+   S  S     ++++  L  E ++ +I+ +  +VD  L ++         DAY+E+L+ EL +VEAES+K+S EI
Subjt:  NLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESRLLQILSECSNVDSFLGIED-------LDAYIEHLKEELVTVEAESSKISNEI

Query:  EVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG----------------------EDQNVIDKRECNAFEVEDTIGGLK
        E L +++  DS+RLQ DLE + LSLD  +SQD +K+  N         C  +D                       ED + + KR   A +VED + GLK
Subjt:  EVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSN---------CCSVDG----------------------EDQNVIDKRECNAFEVEDTIGGLK

Query:  VIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA-----------------------------
        V++   NFIRL LRT+I  L+      + + +  PSEL HELLI + + T EI     FP  +++ DII A                             
Subjt:  VIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEI-----FPGAVHLHDIINA-----------------------------

Query:  --------------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNL
                      SK+  Y FEY+D+DETI+ ++ GGIDA +KVS GWPL ++PLKL SLK+SD+ +KG SLSLI K+E       + + L L    NL
Subjt:  --------------SKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDSPLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNL

Query:  QSPVDQIEILSTETATYIIVAEKSS
           +D +E +  +     + + +SS
Subjt:  QSPVDQIEILSTETATYIIVAEKSS

AT5G48655.1 RING/U-box superfamily protein1.0e-0827.27Show/hide
Query:  RRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSWPPFTI
        RR     DLN  P D+                     T+ + D +++DV   S   FAEA K+  RN       V +   G T                 
Subjt:  RRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSWPPFTI

Query:  WPALTINNNVSVREQTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSI---PPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSP
        +PA     N+S + + I + +  +  E +S       D++  +     S     PP  P   C IC+ P  EE +TKCGHIFC+ CI+ AI+ Q + P
Subjt:  WPALTINNNVSVREQTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSI---PPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSP

AT5G48655.2 RING/U-box superfamily protein1.0e-0827.27Show/hide
Query:  RRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSWPPFTI
        RR     DLN  P D+                     T+ + D +++DV   S   FAEA K+  RN       V +   G T                 
Subjt:  RRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVYGALSDVTSWPPFTI

Query:  WPALTINNNVSVREQTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSI---PPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSP
        +PA     N+S + + I + +  +  E +S       D++  +     S     PP  P   C IC+ P  EE +TKCGHIFC+ CI+ AI+ Q + P
Subjt:  WPALTINNNVSVREQTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSI---PPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAGGCTGTTGGAATGAGCATTCAAAGTACAAATGATCTACGTGAATGGAGTGCAAGAGATGTTGAGAGAAGAACAGTAATGGACTTTGACCTAAATTGTCCACC
TCCAGATGAGTACATTGATCCAACTGGCCCTCATGAGGAAGAAGCACAGTACTATAATCATTACCAAGGACAAACTATCGACAATCCTGACGTTGTCGATGAGGACGTTG
CTATAATCTCGCCAAGGAAATTTGCTGAAGCGAGGAAGAATTTTCGAAGAAACCATTTTGAGAGTAGCTGTGGTGTAACAATCAGACTTAATGGCAACACAGAAGTTTAT
GGTGCTCTCTCAGATGTAACAAGTTGGCCCCCTTTTACAATTTGGCCGGCTCTGACAATTAACAATAATGTATCTGTACGGGAACAAACAATTCACAACTTGGACCTTTG
CTTGAGCTCTGAAAGCAGTAGCAGGACCAGGACTAAGGCAACTGATGACATTCCTTCAGCAGTACTTGCACAAAGTAGTAGCATCCCACCTGCAGCCCCGAGTCTGCGGT
GCGCGATCTGCATAGAACCGTTGGTTGAGGAAACAACAACAAAATGTGGGCACATTTTCTGCAGGAATTGCATCGAGACAGCCATAGCTACGCAGCACAGATCGCCCGGA
CTCCATTTTCGGTCTGGTTCAAAGAAGTCCCCTGTGCGTAGTGACGAATACTCCCCACTTCTCTCTCAATTTGATTTTCTGCACAGACGGATAGTTGAGGAATTCCTTTC
GATTCTGTGCAGATTCCGGCTGCGGAAGGAGAAAATAATGCCAGAATCGATGGAAGCTACACCGTCTGTATCCCCAAACCTCGATCTCCAAGCAGTTCGCAGTCGCATAA
GCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAGATGAAGCTTCTTCGACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTCTCCATCTCGAGAGCAGA
CTACTGCAGATTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGGGGATTGAAGATTTAGATGCGTATATTGAACATTTGAAAGAGGAACTCGTCACGGTGGAGGCTGA
AAGCAGCAAAATCTCCAATGAGATTGAGGTTCTTAAGAGAACCAATATAGAAGATTCTAATAGATTACAGATGGATCTTGAAGTAGTAAAATTGTCATTAGATCGTTTTA
CATCACAGGATCCAGACAAGGCAACATCTAATTGCTGCTCTGTGGATGGTGAAGATCAAAACGTGATAGACAAGCGTGAATGCAATGCTTTTGAGGTTGAGGACACAATT
GGTGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCGTTACGAACACATATTCCAAACTTGGAAGATTTATCAAGCTTACAGAGACTAGAAGGTATGAT
TGTGCCATCCGAATTGGATCACGAGTTGCTAATTGAAGTTTTGGAGGGGACAATGGAAATCTTTCCTGGTGCTGTCCACTTGCACGATATCATCAACGCTTCAAAGTCAT
TCAGTTATTGCTTTGAGTATTTCGATCAAGATGAAACAATAATATGTAATATGATTGGGGGAATTGACGCACTTATTAAGGTGTCTCAAGGGTGGCCATTATCTGATTCT
CCTTTGAAGCTTATATCTCTCAAGAGCTCAGACCATTATACAAAAGGAATTTCTTTAAGCCTCATTTGCAAGGTGGAGTTGATGGTTCTACCTACTTCAATCAAATCTCC
ATTGTATCTACTGGGTATTTCAAATTTACAATCACCGGTAGATCAAATCGAGATCCTTTCAACAGAGACAGCCACATATATTATTGTGGCGGAGAAGTCATCGTACCCAC
AACATCAGCTACAGTCTACACAATCAATTTTCGAAAAGAAGAAATCAACCATCACCTTCATCGTGGGGATTGAGCCTGAAATGTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAGGCTGTTGGAATGAGCATTCAAAGTACAAATGATCTACGTGAATGGAGTGCAAGAGATGTTGAGAGAAGAACAGTAATGGACTTTGACCTAAATTGTCCACC
TCCAGATGAGTACATTGATCCAACTGGCCCTCATGAGGAAGAAGCACAGTACTATAATCATTACCAAGGACAAACTATCGACAATCCTGACGTTGTCGATGAGGACGTTG
CTATAATCTCGCCAAGGAAATTTGCTGAAGCGAGGAAGAATTTTCGAAGAAACCATTTTGAGAGTAGCTGTGGTGTAACAATCAGACTTAATGGCAACACAGAAGTTTAT
GGTGCTCTCTCAGATGTAACAAGTTGGCCCCCTTTTACAATTTGGCCGGCTCTGACAATTAACAATAATGTATCTGTACGGGAACAAACAATTCACAACTTGGACCTTTG
CTTGAGCTCTGAAAGCAGTAGCAGGACCAGGACTAAGGCAACTGATGACATTCCTTCAGCAGTACTTGCACAAAGTAGTAGCATCCCACCTGCAGCCCCGAGTCTGCGGT
GCGCGATCTGCATAGAACCGTTGGTTGAGGAAACAACAACAAAATGTGGGCACATTTTCTGCAGGAATTGCATCGAGACAGCCATAGCTACGCAGCACAGATCGCCCGGA
CTCCATTTTCGGTCTGGTTCAAAGAAGTCCCCTGTGCGTAGTGACGAATACTCCCCACTTCTCTCTCAATTTGATTTTCTGCACAGACGGATAGTTGAGGAATTCCTTTC
GATTCTGTGCAGATTCCGGCTGCGGAAGGAGAAAATAATGCCAGAATCGATGGAAGCTACACCGTCTGTATCCCCAAACCTCGATCTCCAAGCAGTTCGCAGTCGCATAA
GCGAGCTAGAAGAGTTGCAGAGATCTTTGGAGGAAGATGAAGCTTCTTCGACGGATTCATTAGGTTCTGAGAAGTTACTGAAGGAGTGTGCTCTCCATCTCGAGAGCAGA
CTACTGCAGATTCTGTCAGAATGCTCTAACGTTGATAGTTTCTTGGGGATTGAAGATTTAGATGCGTATATTGAACATTTGAAAGAGGAACTCGTCACGGTGGAGGCTGA
AAGCAGCAAAATCTCCAATGAGATTGAGGTTCTTAAGAGAACCAATATAGAAGATTCTAATAGATTACAGATGGATCTTGAAGTAGTAAAATTGTCATTAGATCGTTTTA
CATCACAGGATCCAGACAAGGCAACATCTAATTGCTGCTCTGTGGATGGTGAAGATCAAAACGTGATAGACAAGCGTGAATGCAATGCTTTTGAGGTTGAGGACACAATT
GGTGGTCTGAAGGTCATTGATGTTGCTGATAATTTCATTAGATTGTCGTTACGAACACATATTCCAAACTTGGAAGATTTATCAAGCTTACAGAGACTAGAAGGTATGAT
TGTGCCATCCGAATTGGATCACGAGTTGCTAATTGAAGTTTTGGAGGGGACAATGGAAATCTTTCCTGGTGCTGTCCACTTGCACGATATCATCAACGCTTCAAAGTCAT
TCAGTTATTGCTTTGAGTATTTCGATCAAGATGAAACAATAATATGTAATATGATTGGGGGAATTGACGCACTTATTAAGGTGTCTCAAGGGTGGCCATTATCTGATTCT
CCTTTGAAGCTTATATCTCTCAAGAGCTCAGACCATTATACAAAAGGAATTTCTTTAAGCCTCATTTGCAAGGTGGAGTTGATGGTTCTACCTACTTCAATCAAATCTCC
ATTGTATCTACTGGGTATTTCAAATTTACAATCACCGGTAGATCAAATCGAGATCCTTTCAACAGAGACAGCCACATATATTATTGTGGCGGAGAAGTCATCGTACCCAC
AACATCAGCTACAGTCTACACAATCAATTTTCGAAAAGAAGAAATCAACCATCACCTTCATCGTGGGGATTGAGCCTGAAATGTTGTAA
Protein sequenceShow/hide protein sequence
MRKAVGMSIQSTNDLREWSARDVERRTVMDFDLNCPPPDEYIDPTGPHEEEAQYYNHYQGQTIDNPDVVDEDVAIISPRKFAEARKNFRRNHFESSCGVTIRLNGNTEVY
GALSDVTSWPPFTIWPALTINNNVSVREQTIHNLDLCLSSESSSRTRTKATDDIPSAVLAQSSSIPPAAPSLRCAICIEPLVEETTTKCGHIFCRNCIETAIATQHRSPG
LHFRSGSKKSPVRSDEYSPLLSQFDFLHRRIVEEFLSILCRFRLRKEKIMPESMEATPSVSPNLDLQAVRSRISELEELQRSLEEDEASSTDSLGSEKLLKECALHLESR
LLQILSECSNVDSFLGIEDLDAYIEHLKEELVTVEAESSKISNEIEVLKRTNIEDSNRLQMDLEVVKLSLDRFTSQDPDKATSNCCSVDGEDQNVIDKRECNAFEVEDTI
GGLKVIDVADNFIRLSLRTHIPNLEDLSSLQRLEGMIVPSELDHELLIEVLEGTMEIFPGAVHLHDIINASKSFSYCFEYFDQDETIICNMIGGIDALIKVSQGWPLSDS
PLKLISLKSSDHYTKGISLSLICKVELMVLPTSIKSPLYLLGISNLQSPVDQIEILSTETATYIIVAEKSSYPQHQLQSTQSIFEKKKSTITFIVGIEPEML