; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026149 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026149
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr10:30679393..30687474
RNA-Seq ExpressionLag0026149
SyntenyLag0026149
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022142953.1 uncharacterized protein LOC111012947 [Momordica charantia]1.4e-11636.02Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILE-RPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV
        NPI++A+ R+R +RDYA  + + LN  +    P DA  FE KP+M QMLN +GQF GL +EDP  H+K F++V N+F+L G+ +D L+L LF FSL G  
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILE-RPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV

Query:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK
         AWL++FP ++I + +D+ +KFL K+FP  +N+  + EII+FRQ  NE ++ AWERF+ L                            ++N +ANG F  
Subjt:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK

Query:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN-QTNIAPANAVSSPCCEICDEENIADSCPLNPASVF
        KS NE   ILD ++ +N  W   +P T  K  +P   +  +   +MQ QI  I  M+  M  +N    +APA    SP  +I +      +C +      
Subjt:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN-QTNIAPANAVSSPCCEICDEENIADSCPLNPASVF

Query:  YVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE-------
                NQ ++N YS +YNPGW+QHPNFSW GQ  SS +  +Q  KQ              PTPQQ  +QK + Q A Q   SN+E LM E       
Subjt:  YVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE-------

Query:  ---------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKIC
                             Y+ RND  VRNLE+QLGQ+A E++ RPQG+L S TE P R                              P  +REK  
Subjt:  ---------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKIC

Query:  EKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQ
               T++            P+ + E E  + +  +  N                  R P  +PQR    +QD  F+KFLD+LKQLHIN+P V+ALEQ
Subjt:  EKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQ

Query:  MPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSL
        MPTY KFLKDI+++K+                                        ++VGRA C LGA INLMP S+FK+L IG A PTTVTL LADRS+
Subjt:  MPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSL

Query:  THPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEI
        T P GKIED+LVKVDKF+F ADFIILDCEADK                               +++TFN++ AMKYP D +EC +I     ++   L ++
Subjt:  THPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEI

Query:  VRRETLEDILGGEELDEKG
        +  E   ++   EE +++G
Subjt:  VRRETLEDILGGEELDEKG

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]9.2e-13236.56Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILER-PIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV
        NPI++A+ R+R +RDYA  + + LN  ++   P DA +FE KP+M QMLN +GQF GL +EDP  H+K F++V N+F+L G+ +D L+L LF FS+ G  
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILER-PIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV

Query:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK
         AWL++FP ++IT+W+D+ +KFL K+FP  +N+  + EII+FRQ  NE ++ AWERF+ L                            ++N +ANG F  
Subjt:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK

Query:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN--------QTNIAPANAVSSPCCEICDEENIADSCP
        KS NE   ILD ++ +N  W   +  T  K  +P   +  +   +MQ QI  I  M+  M  +N         TN +P   ++   C  C + + +++CP
Subjt:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN--------QTNIAPANAVSSPCCEICDEENIADSCP

Query:  LNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE
         NP+S++YV   G  NQ ++N YS  YNPGW+QHPNFSW GQ  S+ + ++Q  K+              PTP Q  +QK + Q A Q   SN+E LM E
Subjt:  LNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE

Query:  ----------------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPV
                                    Y+ RND  VR LE+QLGQ+  E++ RPQG+L S TE P R GKE C ++  RS L+Y+GP+ P  S      
Subjt:  ----------------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPV

Query:  VAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMP
                              E +    P+ + E    + +  +  N+                 RPP  +PQR    +QD  F+KFLD+LKQLHIN+P
Subjt:  VAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMP

Query:  LVKALEQMPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTL
         V+ALEQMPTY KF+KDI+++K+                                        ++VGRA C LGASINLMP S+FK+  IG A PTTVTL
Subjt:  LVKALEQMPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTL

Query:  QLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELS
        QLADRS+T P GKIED+LVKVDKF+F  DFIILDCEADK                               +++TFN++ AMKY  D +EC++I     ++
Subjt:  QLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELS

Query:  RSTLQEIVRRETLEDILGGEELDEKG
           L +++  E   ++   EE +++G
Subjt:  RSTLQEIVRRETLEDILGGEELDEKG

XP_024949903.1 uncharacterized protein LOC112496715 [Citrus sinensis]3.9e-11438.66Show/hide
Query:  RPEPENPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSL
        +P   N I++A DR++ IRDYA+   Q ++PGI+   + A  FELKPVMFQML T+GQF+GLP++D H H+KLFL V ++FK+ G  ++ L+L+LF FSL
Subjt:  RPEPENPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSL

Query:  QGDVEAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANG
        +    AWL+S PP+SIT+W+DLA+KFL K+FP  KN+K + EI +F Q  +E L  AWERF+ L                            +V+ASANG
Subjt:  QGDVEAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANG

Query:  SFLKKSANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASV
        + L KS NEA+ IL+ IA NN  W         +        +A   + AQ+ ++  M+  M+    T  A  N +S   C  C E ++ D+CP NP SV
Subjt:  SFLKKSANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASV

Query:  FYVRPQGNQN-QNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQQVAVQS-QESNLETLMNEYIARNDAAV-------RNLE
         YV   GN N QN+ N YS  YN GWRQH NFSW  Q   + + + Q      P   Q+  +Q+++ + Q S+LE L+ +YI RN+A V       RNLE
Subjt:  FYVRPQGNQN-QNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQQVAVQS-QESNLETLMNEYIARNDAAV-------RNLE

Query:  VQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEK-KSDNSTEIAKNKEENEKTKEPEVLAEKEEKL
         Q+GQ+A  + NRPQG+L S TE P REGKE CK + LRS          ++ E   P+ A++ +     S  S + A   EE       ++   KE+  
Subjt:  VQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEK-KSDNSTEIAKNKEENEKTKEPEVLAEKEEKL

Query:  KLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRRNVGRAQCYLGASINLM
                       +P +      +R P  + QRF+   QD QF K L VLKQLHIN+  ++ LEQM  Y KFLKDIL+KKRR        +    + M
Subjt:  KLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRRNVGRAQCYLGASINLM

Query:  PYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADKE-------------------------------EITFNVMRA
          S   Q  + D  PTTV+LQLADRS  +P G+I+D+LVKVDKF+F  DFI+LD EADKE                               ++TFNV+  
Subjt:  PYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADKE-------------------------------EITFNVMRA

Query:  MKYPGDFKECSMIDEIDELSRSTLQEIVRRETLEDILGGEELDEK
        MK P + ++C+ I+ +D +    L     +E + +++  EELD +
Subjt:  MKYPGDFKECSMIDEIDELSRSTLQEIVRRETLEDILGGEELDEK

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]3.9e-11445.61Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE
        NPI +A+DR R IR+YA P+F  LNPGI+   I AP FELKPVMFQML T+GQF G P EDPH HI+ FL V +SFKL GV E+ L+LKLF FSL+    
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE

Query:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK
        AWL++ PP+S+T+WNDLAEKFL K+FP  +N+K+++EI++F+QS +E    AWERF+ L                            +++ASANG+ L K
Subjt:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK

Query:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRP
        S NEA  IL+ IA+NN  W  N   T  K     +E +A   + AQ+ ++ +++  M+M      A A   +   C  C + +  ++CP N ASV YV  
Subjt:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRP

Query:  QGNQNQNRWNH-YSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQQVAVQSQESNLETLMNEYIARND-------AAVRNLEVQLGQI
         GNQN NR N+ YS  YNP W+ HPNFSWGGQ   S      F +QP PQQ  +        SQ S+LE+LM +Y+A+ND       A++RNLEVQLGQ+
Subjt:  QGNQNQNRWNH-YSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQQVAVQSQESNLETLMNEYIARND-------AAVRNLEVQLGQI

Query:  AQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQG
        A ++KNRPQGTL S TE P R+GKE CKAVTLRS       K  +S+ A              +  S E +  ++E E  K+P   A +   +     Q 
Subjt:  AQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQG

Query:  NATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKD
        +A   +  K           PP  +PQRFK    D QF++FLDVLKQLHIN+PLV+ALEQMPTYVKFLKD
Subjt:  NATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKD

XP_030505184.1 uncharacterized protein LOC115720166 [Cannabis sativa]2.4e-13241.36Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE
        +PI + +DR R IR+YA P+F  LNPGI+   I APQFELKPVMFQML T+GQFS +P EDPH H++ FL + +SFK+ GV E+  +LKLF FSL+    
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE

Query:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK
        +WL++  P+S+T+WND AEKFL K+FP  +N+K+++EI++F Q  +E    AWERF+ L                            +++ASANG+ L K
Subjt:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK

Query:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSP--CCEICDEENIADSCPLNPASVFYV
        S NEA  IL+TIA+NN  W  N      +     +E +A   +  Q+ ++ +++  +S+ N  NI PA A+ S    C  C E +  + CP NP SV Y+
Subjt:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSP--CCEICDEENIADSCPLNPASVFYV

Query:  RPQGNQNQNRWN-HYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQ--QKQKKFQQVAVQSQESNLETLMNEYIARNDAAV-------RNLEVQ
           GNQN NR N  +S  YN  W+ HPN SWG +   S   +   G+Q  P    +Q +  Q A  SQ S+LE+LM +Y+A+NDA +       RNLE+Q
Subjt:  RPQGNQNQNRWN-HYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQ--QKQKKFQQVAVQSQESNLETLMNEYIARNDAAV-------RNLEVQ

Query:  LGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLE
        LG +A E+K RPQG+L S TE P R+GKEQCK++ LRS       K+ ++SE EI          K S   T I  +++ ++KT +     E  +  +++
Subjt:  LGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLE

Query:  KKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRRNVGRAQCYLGASINLM---
           G  + +  + P    +    +PPL +PQRF+   QD QFKKFLDVLKQLHIN+PLV+ALEQMP YVKFLKDIL+KKRR +G  +  L      M   
Subjt:  KKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRRNVGRAQCYLGASINLM---

Query:  --------------PYSV-FKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK--------------------------
                      P S+  + LGIG+ARPTTVTLQLADRS+ HP GKIED+LV+VDKF+F ADFIILD E D+                          
Subjt:  --------------PYSV-FKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK--------------------------

Query:  -----EEITFNVMRAMKYPGDFKECSMIDEID----ELSRSTLQEIVRRE-TLEDILGGEELDE
             E+ TF V R ++ P    EC  I ++D    E S+  + + VR++  L++I    E  E
Subjt:  -----EEITFNVMRAMKYPGDFKECSMIDEID----ELSRSTLQEIVRRE-TLEDILGGEELDE

TrEMBL top hitse value%identityAlignment
A0A6A2WLX1 Reverse transcriptase3.2e-9833.55Show/hide
Query:  PVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVEAWLDSFPPNSITSWNDLA
        P+   LNPGI+   I A  FE+KPVMF MLN++GQF G+P ED  +HI+ FL VC+SF+  GV ED LKLKLF +SL+    AWL   P  S+ SW DL 
Subjt:  PVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVEAWLDSFPPNSITSWNDLA

Query:  EKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKKSANEAHTILDTIATNNRH
        + FL ++ P N N++ + EI +FRQ  +E +   W+R++ L                            L++ASANG+ L KS  EA  ILD IA N+  
Subjt:  EKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKKSANEAHTILDTIATNNRH

Query:  WGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRPQGNQNQNRWNHYSAMYNP
        +  +   +  + P  A E EA  ++ AQ+  I +M+  +  S         +++   C +C   +    CP N  S+ +V   GN N+   N YS  YN 
Subjt:  WGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRPQGNQNQNRWNHYSAMYNP

Query:  GWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQ------------------------QVAVQSQESNLETLMNEY-----------------IAR
        GWRQHPNFSW  Q G+ N+N      QPT QQ   + Q                        +  +Q   S  +T++ ++                 I  
Subjt:  GWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQ------------------------QVAVQSQESNLETLMNEY-----------------IAR

Query:  NDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEV
        + +++R LE Q+GQIA  ++ R QG L S TE     GKE C  +TLRS  + +                       + D   + AK  + + K KE  +
Subjt:  NDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEV

Query:  LAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKR---------
         A KE                             RPP  +PQR K  + +VQFKKF+D+L QLHIN+PL++A+EQMP Y KF+KDI +KKR         
Subjt:  LAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKR---------

Query:  ------------RN---------------VGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCE
                    RN               VG+A C LG+S+NL+P S+F +LGIGDARPT+V LQLAD+S     G++ED++V+VDKFVF  DF+ILDCE
Subjt:  ------------RN---------------VGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCE

Query:  ADKEE-------------------------------ITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEIVRRETLE
         D +                                +T NV R +KY  D +EC  I E++ +     + + +   ++
Subjt:  ADKEE-------------------------------ITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEIVRRETLE

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.0e-11236.25Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE
        N I +  + NR +RDY +P+ Q L+  I    I+A  FE+KP   QM+ +  QFSGLP++DP+ H+  FL +C++FK  GV +D ++L+LF FSL+   +
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVE

Query:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK
        +WL+S P  SIT+W DLA+KFL KFFP  K +K + +I +F Q   E L  AWERF+ L                            +++A+A G+ + K
Subjt:  AWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLKK

Query:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRP
        +A +A+ +L+ +A+NN  W      +  +  V A E +A  T+  Q+ A+   +  + +    N       S   CE+C + +  D CP N  SV +V  
Subjt:  SANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCPLNPASVFYVRP

Query:  QGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQ---FGKQPTPQQKQKKFQQVAVQSQESNLETLMNEYIARND-------AAVRNLEVQLG
         GN N+ + N YS  YNPGWR HPNFSW   AG SN        F +Q  PQ  +KK          S LE L+ +YI++ D       A++RNLE Q+G
Subjt:  QGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQ---FGKQPTPQQKQKKFQQVAVQSQESNLETLMNEYIARND-------AAVRNLEVQLG

Query:  QIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKK
        Q+A  I NRPQG+L S T+  + +GKEQC+A+TLRS  E +G    ++ E+EI  V +E +CE    N  EI +  +                    +K 
Subjt:  QIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKK

Query:  QGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRR---------------------
        +   T+                PP  +PQR +    + QF+KFL+V K+LHIN+P  +ALEQMP+YVKFLKDILSKKR+                     
Subjt:  QGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRR---------------------

Query:  --------------NVG-----RAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK------
                       +G     +A   LGASINLMP+S+F++LG+G+ +PT+VTLQLADRS  +P G IED+LVKVDKF+F  DF+ILD E D+      
Subjt:  --------------NVG-----RAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK------

Query:  -------------------------EEITFNVMRAMKYPGDFKECSMIDEIDE
                                 E + FN+  A K+P     C  ++ IDE
Subjt:  -------------------------EEITFNVMRAMKYPGDFKECSMIDEIDE

A0A6J1CPJ3 uncharacterized protein LOC1110129473.4e-11636.02Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILE-RPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV
        NPI++A+ R+R +RDYA  + + LN  +    P DA  FE KP+M QMLN +GQF GL +EDP  H+K F++V N+F+L G+ +D L+L LF FSL G  
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILE-RPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV

Query:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK
         AWL++FP ++I + +D+ +KFL K+FP  +N+  + EII+FRQ  NE ++ AWERF+ L                            ++N +ANG F  
Subjt:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK

Query:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN-QTNIAPANAVSSPCCEICDEENIADSCPLNPASVF
        KS NE   ILD ++ +N  W   +P T  K  +P   +  +   +MQ QI  I  M+  M  +N    +APA    SP  +I +      +C +      
Subjt:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN-QTNIAPANAVSSPCCEICDEENIADSCPLNPASVF

Query:  YVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE-------
                NQ ++N YS +YNPGW+QHPNFSW GQ  SS +  +Q  KQ              PTPQQ  +QK + Q A Q   SN+E LM E       
Subjt:  YVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE-------

Query:  ---------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKIC
                             Y+ RND  VRNLE+QLGQ+A E++ RPQG+L S TE P R                              P  +REK  
Subjt:  ---------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKIC

Query:  EKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQ
               T++            P+ + E E  + +  +  N                  R P  +PQR    +QD  F+KFLD+LKQLHIN+P V+ALEQ
Subjt:  EKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQ

Query:  MPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSL
        MPTY KFLKDI+++K+                                        ++VGRA C LGA INLMP S+FK+L IG A PTTVTL LADRS+
Subjt:  MPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSL

Query:  THPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEI
        T P GKIED+LVKVDKF+F ADFIILDCEADK                               +++TFN++ AMKYP D +EC +I     ++   L ++
Subjt:  THPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEI

Query:  VRRETLEDILGGEELDEKG
        +  E   ++   EE +++G
Subjt:  VRRETLEDILGGEELDEKG

A0A6J1DY39 uncharacterized protein LOC1110256534.5e-13236.56Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILER-PIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV
        NPI++A+ R+R +RDYA  + + LN  ++   P DA +FE KP+M QMLN +GQF GL +EDP  H+K F++V N+F+L G+ +D L+L LF FS+ G  
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILER-PIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDV

Query:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK
         AWL++FP ++IT+W+D+ +KFL K+FP  +N+  + EII+FRQ  NE ++ AWERF+ L                            ++N +ANG F  
Subjt:  EAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASANGSFLK

Query:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN--------QTNIAPANAVSSPCCEICDEENIADSCP
        KS NE   ILD ++ +N  W   +  T  K  +P   +  +   +MQ QI  I  M+  M  +N         TN +P   ++   C  C + + +++CP
Subjt:  KSANEAHTILDTIATNNRHWGENEPTTILK--NPVKAVETEANYTMQAQIKAIHSMMMGMSMSN--------QTNIAPANAVSSPCCEICDEENIADSCP

Query:  LNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE
         NP+S++YV   G  NQ ++N YS  YNPGW+QHPNFSW GQ  S+ + ++Q  K+              PTP Q  +QK + Q A Q   SN+E LM E
Subjt:  LNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQ--------------PTPQQ--KQKKFQQVAVQSQESNLETLMNE

Query:  ----------------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPV
                                    Y+ RND  VR LE+QLGQ+  E++ RPQG+L S TE P R GKE C ++  RS L+Y+GP+ P  S      
Subjt:  ----------------------------YIARNDAAVRNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPV

Query:  VAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMP
                              E +    P+ + E    + +  +  N+                 RPP  +PQR    +QD  F+KFLD+LKQLHIN+P
Subjt:  VAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMP

Query:  LVKALEQMPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTL
         V+ALEQMPTY KF+KDI+++K+                                        ++VGRA C LGASINLMP S+FK+  IG A PTTVTL
Subjt:  LVKALEQMPTYVKFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTL

Query:  QLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELS
        QLADRS+T P GKIED+LVKVDKF+F  DFIILDCEADK                               +++TFN++ AMKY  D +EC++I     ++
Subjt:  QLADRSLTHPIGKIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEIDELS

Query:  RSTLQEIVRRETLEDILGGEELDEKG
           L +++  E   ++   EE +++G
Subjt:  RSTLQEIVRRETLEDILGGEELDEKG

A0A6J1EQ90 uncharacterized protein LOC1114364115.3e-10934.69Show/hide
Query:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRV-------CNSFKLTGVPEDTLKLKLFLF
        NPI++A+DR R IR YA P  + LNP I+   I    FELKPVMFQML T+GQF GLP EDPH H+K FL V        +SF+  GV +D ++L LF +
Subjt:  NPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKPVMFQMLNTMGQFSGLPNEDPHKHIKLFLRV-------CNSFKLTGVPEDTLKLKLFLF

Query:  SLQGDVEAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASA
         L+   ++WL++  P +I SWN LAE FL K+FP  +N+++K EI+ F+Q  +E L  A ERF+ +                            +V+ASA
Subjt:  SLQGDVEAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAAWERFQRL---------------------------ALVNASA

Query:  NGSFLKKSANEAHTILDTIATNNRHWGENEPTTILKNPVK----AVETEANYTMQAQIKAIHSMMMGMSMSNQTNI-------APANAVSSPCCEICDEE
        NG+ L K+ NEA+ IL+ IA+NN  W +     +  NP +     +E +A  ++ AQ+ ++ +++  +++   + I       A  N  ++  C  C EE
Subjt:  NGSFLKKSANEAHTILDTIATNNRHWGENEPTTILKNPVK----AVETEANYTMQAQIKAIHSMMMGMSMSNQTNI-------APANAVSSPCCEICDEE

Query:  NIADSCPLNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQ--------------AGSSNSNNHQFGKQPTPQQKQKKFQQVAVQSQESNLE
        +  D CP NPAS+FYV  Q +Q   + N +S  YNPGWR HPNFSW GQ              +G    N   +  Q    Q +   Q  A  + E+++E
Subjt:  NIADSCPLNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQ--------------AGSSNSNNHQFGKQPTPQQKQKKFQQVAVQSQESNLE

Query:  TLMNEYIARNDAAV-------RNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTE
        +L+ EY+A+NDA +       RNLEVQ+G                                                          EK  E+   +S E
Subjt:  TLMNEYIARNDAAV-------RNLEVQLGQIAQEIKNRPQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTE

Query:  IAKNKEENEKT----KEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYV
         A  ++ NE+     +  +  AE EE+ K++    +   + T           Y P   +PQR K   ++  F+KF+D+LK++HIN+PLV+AL+QMP YV
Subjt:  IAKNKEENEKT----KEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDPDYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYV

Query:  KFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIG
        KFLKD+L  +R                                        + +GRA C LGA+INLMP S++K+LGIG+ARPTTVTLQLADRS+T+P G
Subjt:  KFLKDILSKKR----------------------------------------RNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIG

Query:  KIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEI
        KIEDIL++VDKF+F ADFIILD E D                                +++ FN+  +MKYP   +ECS + E+
Subjt:  KIEDILVKVDKFVFHADFIILDCEADK-------------------------------EEITFNVMRAMKYPGDFKECSMIDEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAATTTTCAAGCAAGCTCTCTAAGGAAGACTTTTCTTCTATGGAAGCAGATCGCACACCACTCTCCTCCTCTTGGGGATACAATCTCCCTATGAAAAATAGAGGAGA
AGTTGGAATTCCCCAGAAATGCGACCGCATTTCTGGAAAAATGGAGGCCGTTCTGAGTCGTGCATGGGTCGTTGTTGACGAGTCTTCTTCGTACCTAACCGAGATTCTGA
CGGAATCTCGGTCTTCGCTTGGTTCCCATACTGTCGCGCCTCTTGAAGAGGCGCGACATCACCATCTGTTACCAGTCGTTCCCTTTCCGTCCAGTTCCGTCCAGTTGCAT
CTTCGTTCGCTCGGAACAGAGACGAAAGAAGAAGGAGCAACAAGAGTTGAACGCACGGGAATTTCTAGAGGAAGCATCTTACATTCAAGAGTTTCCAATGGTCTTGGAGT
CGATCCTCAAGTTGATCCACAAGATCGTGGAAGGGAGCAAAATGGCGGGAGAATTTCTCTAGTTCCTCCAATTCCACTGAGGCCAGAACCAGAAAACCCGATCTACATTG
CTGAAGATCGAAACCGAGAAATAAGGGATTACGCTTTACCAGTTTTCCAGACATTGAATCCAGGAATCTTGGAGCGACCGATTGATGCTCCACAATTTGAGCTTAAACCA
GTGATGTTCCAGATGTTAAATACGATGGGCCAATTCTCAGGACTTCCGAATGAGGATCCACACAAGCATATTAAACTGTTCTTGAGAGTTTGCAATTCTTTTAAACTAAC
AGGAGTGCCAGAAGATACTTTAAAGCTAAAACTTTTTCTGTTTTCTTTACAAGGAGATGTTGAAGCCTGGTTGGATTCATTTCCTCCTAATTCCATCACCTCTTGGAATG
ATTTGGCGGAGAAGTTCTTGGAGAAGTTCTTTCCCTCCAATAAAAATTCCAAGTACAAAGCAGAAATCATTGCTTTCAGACAGTCATACAATGAACCTTTGGATGCAGCT
TGGGAGAGATTCCAGAGGCTGGCACTTGTAAATGCATCTGCAAACGGATCTTTTCTAAAGAAGTCTGCTAATGAGGCGCACACTATTCTAGATACTATAGCCACCAATAA
CCGACATTGGGGAGAAAATGAACCAACAACCATTTTGAAAAATCCAGTGAAAGCAGTAGAAACAGAGGCTAACTATACGATGCAAGCCCAGATCAAGGCCATTCATAGCA
TGATGATGGGCATGTCCATGAGCAATCAAACCAATATAGCCCCTGCAAACGCTGTTTCTTCTCCCTGTTGTGAGATATGTGATGAGGAAAACATCGCTGATAGTTGTCCA
CTAAACCCTGCATCCGTTTTTTATGTACGTCCACAAGGAAATCAGAACCAAAATAGATGGAATCATTATTCGGCCATGTACAACCCAGGATGGAGACAACATCCGAACTT
TTCTTGGGGAGGACAAGCCGGATCGAGTAATTCCAATAATCATCAGTTTGGGAAGCAACCTACACCACAACAGAAGCAAAAAAAATTCCAACAAGTTGCAGTACAGAGCC
AAGAGTCAAATCTAGAGACTCTTATGAATGAGTACATAGCGAGGAACGATGCTGCAGTTAGGAATTTGGAAGTTCAGCTTGGACAAATTGCTCAAGAGATAAAGAATAGA
CCTCAAGGGACGTTGTCCAGCATGACCGAGACCCCACACAGAGAAGGAAAGGAACAATGCAAGGCAGTGACCCTAAGAAGTGAATTAGAATATGATGGACCAAAGTATCC
ACAAAGTTCAGAAGCAGAGATCCCAGTTGTAGCTCGAGAAAAGATTTGTGAAAAGAAGTCAGACAATTCCACAGAGATTGCAAAGAATAAAGAAGAAAACGAAAAAACCA
AGGAGCCTGAAGTCTTGGCAGAAAAAGAAGAAAAATTGAAGTTGGAGAAAAAACAGGGAAATGCTACTGCAAATGCGACCGCAAAACCCCTGAAATTTGTGATAGATCCA
GATTATAGACCACCTCTTTCGTACCCACAGAGATTCAAACATGCCTCACAAGATGTACAGTTCAAGAAGTTCTTAGATGTATTGAAGCAGTTGCACATTAACATGCCATT
AGTGAAGGCACTTGAACAAATGCCTACATATGTAAAGTTCTTAAAAGATATTCTGTCGAAAAAGAGAAGGAATGTAGGCAGAGCTCAGTGCTATTTGGGGGCAAGCATCA
ACTTAATGCCATATTCGGTTTTTAAACAGTTAGGAATAGGTGATGCACGACCAACAACTGTGACCTTGCAGTTGGCAGACCGATCACTTACACACCCCATTGGAAAGATT
GAAGACATATTGGTTAAAGTTGATAAGTTTGTTTTTCATGCAGATTTCATTATCTTAGATTGTGAAGCCGACAAGGAGGAAATCACGTTTAACGTGATGAGGGCAATGAA
GTACCCTGGTGATTTCAAGGAATGTTCAATGATCGATGAAATTGATGAACTCTCTCGATCTACTCTGCAGGAAATAGTAAGACGAGAAACCTTGGAGGATATTCTTGGAG
GTGAAGAATTGGATGAAAAAGGTGGGAGAAAGTTTGAAGTGGAAGAAAGCAATCAAACCATATTTTCGTGTCCACGGATATCGACCATCAACAACAAGTTAGCCCTTCAT
GTGTGTTTGTACCCCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAATTTTCAAGCAAGCTCTCTAAGGAAGACTTTTCTTCTATGGAAGCAGATCGCACACCACTCTCCTCCTCTTGGGGATACAATCTCCCTATGAAAAATAGAGGAGA
AGTTGGAATTCCCCAGAAATGCGACCGCATTTCTGGAAAAATGGAGGCCGTTCTGAGTCGTGCATGGGTCGTTGTTGACGAGTCTTCTTCGTACCTAACCGAGATTCTGA
CGGAATCTCGGTCTTCGCTTGGTTCCCATACTGTCGCGCCTCTTGAAGAGGCGCGACATCACCATCTGTTACCAGTCGTTCCCTTTCCGTCCAGTTCCGTCCAGTTGCAT
CTTCGTTCGCTCGGAACAGAGACGAAAGAAGAAGGAGCAACAAGAGTTGAACGCACGGGAATTTCTAGAGGAAGCATCTTACATTCAAGAGTTTCCAATGGTCTTGGAGT
CGATCCTCAAGTTGATCCACAAGATCGTGGAAGGGAGCAAAATGGCGGGAGAATTTCTCTAGTTCCTCCAATTCCACTGAGGCCAGAACCAGAAAACCCGATCTACATTG
CTGAAGATCGAAACCGAGAAATAAGGGATTACGCTTTACCAGTTTTCCAGACATTGAATCCAGGAATCTTGGAGCGACCGATTGATGCTCCACAATTTGAGCTTAAACCA
GTGATGTTCCAGATGTTAAATACGATGGGCCAATTCTCAGGACTTCCGAATGAGGATCCACACAAGCATATTAAACTGTTCTTGAGAGTTTGCAATTCTTTTAAACTAAC
AGGAGTGCCAGAAGATACTTTAAAGCTAAAACTTTTTCTGTTTTCTTTACAAGGAGATGTTGAAGCCTGGTTGGATTCATTTCCTCCTAATTCCATCACCTCTTGGAATG
ATTTGGCGGAGAAGTTCTTGGAGAAGTTCTTTCCCTCCAATAAAAATTCCAAGTACAAAGCAGAAATCATTGCTTTCAGACAGTCATACAATGAACCTTTGGATGCAGCT
TGGGAGAGATTCCAGAGGCTGGCACTTGTAAATGCATCTGCAAACGGATCTTTTCTAAAGAAGTCTGCTAATGAGGCGCACACTATTCTAGATACTATAGCCACCAATAA
CCGACATTGGGGAGAAAATGAACCAACAACCATTTTGAAAAATCCAGTGAAAGCAGTAGAAACAGAGGCTAACTATACGATGCAAGCCCAGATCAAGGCCATTCATAGCA
TGATGATGGGCATGTCCATGAGCAATCAAACCAATATAGCCCCTGCAAACGCTGTTTCTTCTCCCTGTTGTGAGATATGTGATGAGGAAAACATCGCTGATAGTTGTCCA
CTAAACCCTGCATCCGTTTTTTATGTACGTCCACAAGGAAATCAGAACCAAAATAGATGGAATCATTATTCGGCCATGTACAACCCAGGATGGAGACAACATCCGAACTT
TTCTTGGGGAGGACAAGCCGGATCGAGTAATTCCAATAATCATCAGTTTGGGAAGCAACCTACACCACAACAGAAGCAAAAAAAATTCCAACAAGTTGCAGTACAGAGCC
AAGAGTCAAATCTAGAGACTCTTATGAATGAGTACATAGCGAGGAACGATGCTGCAGTTAGGAATTTGGAAGTTCAGCTTGGACAAATTGCTCAAGAGATAAAGAATAGA
CCTCAAGGGACGTTGTCCAGCATGACCGAGACCCCACACAGAGAAGGAAAGGAACAATGCAAGGCAGTGACCCTAAGAAGTGAATTAGAATATGATGGACCAAAGTATCC
ACAAAGTTCAGAAGCAGAGATCCCAGTTGTAGCTCGAGAAAAGATTTGTGAAAAGAAGTCAGACAATTCCACAGAGATTGCAAAGAATAAAGAAGAAAACGAAAAAACCA
AGGAGCCTGAAGTCTTGGCAGAAAAAGAAGAAAAATTGAAGTTGGAGAAAAAACAGGGAAATGCTACTGCAAATGCGACCGCAAAACCCCTGAAATTTGTGATAGATCCA
GATTATAGACCACCTCTTTCGTACCCACAGAGATTCAAACATGCCTCACAAGATGTACAGTTCAAGAAGTTCTTAGATGTATTGAAGCAGTTGCACATTAACATGCCATT
AGTGAAGGCACTTGAACAAATGCCTACATATGTAAAGTTCTTAAAAGATATTCTGTCGAAAAAGAGAAGGAATGTAGGCAGAGCTCAGTGCTATTTGGGGGCAAGCATCA
ACTTAATGCCATATTCGGTTTTTAAACAGTTAGGAATAGGTGATGCACGACCAACAACTGTGACCTTGCAGTTGGCAGACCGATCACTTACACACCCCATTGGAAAGATT
GAAGACATATTGGTTAAAGTTGATAAGTTTGTTTTTCATGCAGATTTCATTATCTTAGATTGTGAAGCCGACAAGGAGGAAATCACGTTTAACGTGATGAGGGCAATGAA
GTACCCTGGTGATTTCAAGGAATGTTCAATGATCGATGAAATTGATGAACTCTCTCGATCTACTCTGCAGGAAATAGTAAGACGAGAAACCTTGGAGGATATTCTTGGAG
GTGAAGAATTGGATGAAAAAGGTGGGAGAAAGTTTGAAGTGGAAGAAAGCAATCAAACCATATTTTCGTGTCCACGGATATCGACCATCAACAACAAGTTAGCCCTTCAT
GTGTGTTTGTACCCCAGTTAG
Protein sequenceShow/hide protein sequence
MQFSSKLSKEDFSSMEADRTPLSSSWGYNLPMKNRGEVGIPQKCDRISGKMEAVLSRAWVVVDESSSYLTEILTESRSSLGSHTVAPLEEARHHHLLPVVPFPSSSVQLH
LRSLGTETKEEGATRVERTGISRGSILHSRVSNGLGVDPQVDPQDRGREQNGGRISLVPPIPLRPEPENPIYIAEDRNREIRDYALPVFQTLNPGILERPIDAPQFELKP
VMFQMLNTMGQFSGLPNEDPHKHIKLFLRVCNSFKLTGVPEDTLKLKLFLFSLQGDVEAWLDSFPPNSITSWNDLAEKFLEKFFPSNKNSKYKAEIIAFRQSYNEPLDAA
WERFQRLALVNASANGSFLKKSANEAHTILDTIATNNRHWGENEPTTILKNPVKAVETEANYTMQAQIKAIHSMMMGMSMSNQTNIAPANAVSSPCCEICDEENIADSCP
LNPASVFYVRPQGNQNQNRWNHYSAMYNPGWRQHPNFSWGGQAGSSNSNNHQFGKQPTPQQKQKKFQQVAVQSQESNLETLMNEYIARNDAAVRNLEVQLGQIAQEIKNR
PQGTLSSMTETPHREGKEQCKAVTLRSELEYDGPKYPQSSEAEIPVVAREKICEKKSDNSTEIAKNKEENEKTKEPEVLAEKEEKLKLEKKQGNATANATAKPLKFVIDP
DYRPPLSYPQRFKHASQDVQFKKFLDVLKQLHINMPLVKALEQMPTYVKFLKDILSKKRRNVGRAQCYLGASINLMPYSVFKQLGIGDARPTTVTLQLADRSLTHPIGKI
EDILVKVDKFVFHADFIILDCEADKEEITFNVMRAMKYPGDFKECSMIDEIDELSRSTLQEIVRRETLEDILGGEELDEKGGRKFEVEESNQTIFSCPRISTINNKLALH
VCLYPS